A Generative Flow for Text-to-Speech via Monotonic Alignment Search
$ npx skills add jaywalnut310/glow-ttsAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
$ npx skills add jaywalnut310/glow-ttsunofficial vits2-TTS implementation in pytorch
$ npx skills add p0p4k/vits2_pytorchAn Open-Sourced LLM-empowered Foundation TTS System
$ npx skills add FireRedTeam/FireRedTTS[AutoArk] GPA (General Purpose Audio) can do ASR, TTS and voice conversion with one tiny model!
$ npx skills add AutoArk/GPAA Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"
$ npx skills add soobinseo/Transformer-TTSMake Azure natural TTS voices accessible to any SAPI 5-compatible application.
$ npx skills add gexgd0419/NaturalVoiceSAPIAdapterDeep learning for audio processing
$ npx skills add markovka17/dlaDiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
$ npx skills add lmnt-com/diffwaveA lightweight, offline Android Text-to-Speech (TTS) engine enabling seamless system-wide voice cloning and high-fidelity text reading. / 运行在安卓本地的轻量级文字转语音 (TTS) 引擎,支持离线发音人提取、零门槛音色克隆与双擎系统级全局听书。
$ npx skills add sipeter/CloneTTSSimple Python script to interact with the TikTok TTS API
$ npx skills add oscie57/tiktok-voicePytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。
$ npx skills add yeyupiaoling/MASR🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
$ npx skills add huggingface/diffusersImplementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
$ npx skills add lucidrains/e2-tts-pytorchComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
$ npx skills add wildminder/ComfyUI-VibeVoiceModified version of Chatterbox that accepts text files as input and no character restrictions. I use it to make audiobooks, especially for my kids.
$ npx skills add petermg/Chatterbox-TTS-ExtendedSuno AI's Bark model in C/C++ for fast text-to-speech generation
$ npx skills add PABannier/bark.cppHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Vits2 if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.