MARS5 speech model (TTS) from CAMB.AI
$ npx skills add Camb-ai/MARS5-TTSAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.
MARS5 speech model (TTS) from CAMB.AI
$ npx skills add Camb-ai/MARS5-TTS1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
$ npx skills add RVC-Boss/GPT-SoVITSđClone a voice in 5 seconds to generate arbitrary speech in real-time
$ npx skills add babysor/MockingBirdMOSSâTTS Family is an openâsource speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for highâfidelity, highâexpressiveness, and complex realâworld scenarios, covering stable longâform speech, multiâspeaker dialogue, voice/character design, environmental sound effects, and realâtime streaming TTS.
$ npx skills add OpenMOSS/MOSS-TTSA Python/Pytorch app for easily synthesising human voices
$ npx skills add voice-cloning-app/Voice-Cloning-AppGPT-SoVITS ONNX Inference Engine & Model Converter
$ npx skills add High-Logic/Genie-TTSPyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech
$ npx skills add rishikksh20/FastSpeech2VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
$ npx skills add OpenBMB/VoxCPMVITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
$ npx skills add jaywalnut310/vitsTranslate the video from one language to another and embed dubbing & subtitles.
$ npx skills add jianchang512/pyvideotransA TTS model capable of generating ultra-realistic dialogue in one pass.
$ npx skills add nari-labs/diaA Flow Matching-based Text-to-Speech Model with Emoji-driven Style Control
$ npx skills add Aratako/Irodori-TTSeSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
$ npx skills add espeak-ng/espeak-ngAn unofficial PyTorch implementation of the audio LM VALL-E
$ npx skills add enhuiz/vall-eđ¸đŹ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
$ npx skills add coqui-ai/TTSDockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/multiplatform CPU, AMD, NVIDIA GPU PyTorch support, handling, and auto-stitching
$ npx skills add remsky/Kokoro-FastAPIHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Cross Lingual Voice Cloning if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.