StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
$ npx skills add yl4579/StyleTTS2Alternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
$ npx skills add yl4579/StyleTTS2:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
$ npx skills add mozilla/TTS🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
$ npx skills add babysor/MockingBirdA Flow Matching-based Text-to-Speech Model with Emoji-driven Style Control
$ npx skills add Aratako/Irodori-TTS🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
$ npx skills add coqui-ai/TTSVietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio quality • Chuyển văn bản thành giọng nói tiếng Việt • Text to speech tiếng Việt • TTS tiếng Việt
$ npx skills add pnnbao97/VieNeu-TTSSilero Models: pre-trained text-to-speech models made embarrassingly simple
$ npx skills add snakers4/silero-modelsControllable and fast Text-to-Speech for over 7000 languages!
$ npx skills add DigitalPhonetics/IMS-ToucanUnsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.
$ npx skills add unslothai/unsloth1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
$ npx skills add RVC-Boss/GPT-SoVITSVoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
$ npx skills add OpenBMB/VoxCPMEmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
$ npx skills add netease-youdao/EmotiVoiceOpenVINO™ is an open source toolkit for optimizing and deploying AI inference
$ npx skills add openvinotoolkit/openvinoVITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
$ npx skills add jaywalnut310/vitsTranslate the video from one language to another and embed dubbing & subtitles.
$ npx skills add jianchang512/pyvideotransAn Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
$ npx skills add index-tts/index-ttsHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Matcha TTS if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.