A Generative Flow for Text-to-Speech via Monotonic Alignment Search
$ npx skills add jaywalnut310/glow-ttsAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
[AutoArk] GPA (General Purpose Audio) can do ASR, TTS and voice conversion with one tiny model!
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
$ npx skills add jaywalnut310/glow-ttsThe code for the bark-voicecloning model. Training and inference.
$ npx skills add gitmylo/bark-voice-cloning-HuBERT-quantizerImplementation of F5-TTS in MLX
$ npx skills add lucasnewman/f5-tts-mlxA Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"
$ npx skills add soobinseo/Transformer-TTSRun Orpheus 3B Locally With LM Studio
$ npx skills add isaiahbjork/orpheus-tts-localunofficial vits2-TTS implementation in pytorch
$ npx skills add p0p4k/vits2_pytorchSimple Python script to interact with the TikTok TTS API
$ npx skills add oscie57/tiktok-voiceImplementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
$ npx skills add lucidrains/e2-tts-pytorch🚀 AI 全自动短视频引擎 | AI Fully Automated Short Video Engine
$ npx skills add AIDC-AI/Pixelle-VideoAn Open-Sourced LLM-empowered Foundation TTS System
$ npx skills add FireRedTeam/FireRedTTSLLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
$ npx skills add zhenye234/LLaSA_trainingComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
$ npx skills add wildminder/ComfyUI-VibeVoiceModified version of Chatterbox that accepts text files as input and no character restrictions. I use it to make audiobooks, especially for my kids.
$ npx skills add petermg/Chatterbox-TTS-ExtendedVITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
$ npx skills add daniilrobnikov/vits2On-device streaming speech-to-text engine powered by deep learning
$ npx skills add Picovoice/cheetahA SOTA Industrial-Grade All-in-One ASR system with ASR, VAD, LID, and Punc modules. FireRedASR2 supports Chinese (Mandarin, 20+ dialects/accents), English, code-switching, and both speech and singing ASR. FireRedVAD supports speech/singing/music in 100+ langs. FireRedLID supports 100+ langs and 20+ zh dialects. FireRedPunc supports zh and en.
$ npx skills add FireRedTeam/FireRedASR2SHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep GPA if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.