A Generative Flow for Text-to-Speech via Monotonic Alignment Search
$ npx skills add jaywalnut310/glow-ttsAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
unofficial vits2-TTS implementation in pytorch
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
$ npx skills add jaywalnut310/glow-ttsImplementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
$ npx skills add lucidrains/e2-tts-pytorchVITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
$ npx skills add daniilrobnikov/vits2[AutoArk] GPA (General Purpose Audio) can do ASR, TTS and voice conversion with one tiny model!
$ npx skills add AutoArk/GPAImplementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
$ npx skills add lucidrains/voicebox-pytorchImplementation of F5-TTS in MLX
$ npx skills add lucasnewman/f5-tts-mlxA Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"
$ npx skills add soobinseo/Transformer-TTSRun Orpheus 3B Locally With LM Studio
$ npx skills add isaiahbjork/orpheus-tts-localFireRed-Image-Edit is a powerful image editing foundation model achieving open-source state-of-the-art performance with precise instruction following, high-fidelity generation, superior identity consistency, and seamless multi-element fusion.
$ npx skills add FireRedTeam/FireRed-Image-EditSimple Python script to interact with the TikTok TTS API
$ npx skills add oscie57/tiktok-voice🚀 AI 全自动短视频引擎 | AI Fully Automated Short Video Engine
$ npx skills add AIDC-AI/Pixelle-VideoLLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
$ npx skills add zhenye234/LLaSA_trainingA lightweight, offline Android Text-to-Speech (TTS) engine enabling seamless system-wide voice cloning and high-fidelity text reading. / 运行在安卓本地的轻量级文字转语音 (TTS) 引擎,支持离线发音人提取、零门槛音色克隆与双擎系统级全局听书。
$ npx skills add sipeter/CloneTTSDiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
$ npx skills add lmnt-com/diffwaveThe code for the bark-voicecloning model. Training and inference.
$ npx skills add gitmylo/bark-voice-cloning-HuBERT-quantizer[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
$ npx skills add cure-lab/MagicDriveHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Vits2 Pytorch if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.