VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
$ npx skills add jaywalnut310/vitsAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
This is now the official location of the Merlin project.
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
$ npx skills add jaywalnut310/vits🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
$ npx skills add babysor/MockingBird🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
$ npx skills add coqui-ai/TTSUse Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
$ npx skills add rany2/edge-ttsDeepMind's Tacotron-2 Tensorflow implementation
$ npx skills add Rayhane-mamah/Tacotron-2Converts text to speech in realtime
$ npx skills add KoljaB/RealtimeTTSVietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio quality • Chuyển văn bản thành giọng nói tiếng Việt • Text to speech tiếng Việt • TTS tiếng Việt
$ npx skills add pnnbao97/VieNeu-TTSControllable and fast Text-to-Speech for over 7000 languages!
$ npx skills add DigitalPhonetics/IMS-Toucan1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
$ npx skills add RVC-Boss/GPT-SoVITSVoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
$ npx skills add OpenBMB/VoxCPMEmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
$ npx skills add netease-youdao/EmotiVoiceTranslate the video from one language to another and embed dubbing & subtitles.
$ npx skills add jianchang512/pyvideotransAn Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
$ npx skills add index-tts/index-ttsFoundational model for human-like, expressive TTS
$ npx skills add metavoiceio/metavoice-srcA TTS model capable of generating ultra-realistic dialogue in one pass.
$ npx skills add nari-labs/diaA Flow Matching-based Text-to-Speech Model with Emoji-driven Style Control
$ npx skills add Aratako/Irodori-TTSHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Merlin if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.