Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
$ npx skills add lucidrains/e2-tts-pytorchAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
$ npx skills add lucidrains/e2-tts-pytorchA Generative Flow for Text-to-Speech via Monotonic Alignment Search
$ npx skills add jaywalnut310/glow-ttsunofficial vits2-TTS implementation in pytorch
$ npx skills add p0p4k/vits2_pytorchLLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
$ npx skills add zhenye234/LLaSA_training[AutoArk] GPA (General Purpose Audio) can do ASR, TTS and voice conversion with one tiny model!
$ npx skills add AutoArk/GPAThe code for the bark-voicecloning model. Training and inference.
$ npx skills add gitmylo/bark-voice-cloning-HuBERT-quantizerImplementation of F5-TTS in MLX
$ npx skills add lucasnewman/f5-tts-mlxRun Orpheus 3B Locally With LM Studio
$ npx skills add isaiahbjork/orpheus-tts-localFireRed-Image-Edit is a powerful image editing foundation model achieving open-source state-of-the-art performance with precise instruction following, high-fidelity generation, superior identity consistency, and seamless multi-element fusion.
$ npx skills add FireRedTeam/FireRed-Image-EditSimple Python script to interact with the TikTok TTS API
$ npx skills add oscie57/tiktok-voice🚀 AI 全自动短视频引擎 | AI Fully Automated Short Video Engine
$ npx skills add AIDC-AI/Pixelle-VideoA talking LLM that runs on your own computer without needing the internet.
$ npx skills add vndee/local-talking-llmA lightweight, offline Android Text-to-Speech (TTS) engine enabling seamless system-wide voice cloning and high-fidelity text reading. / 运行在安卓本地的轻量级文字转语音 (TTS) 引擎,支持离线发音人提取、零门槛音色克隆与双擎系统级全局听书。
$ npx skills add sipeter/CloneTTSDiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
$ npx skills add lmnt-com/diffwaveAI-powered multi-voice audiobook generator — LLM script annotation, voice cloning, voice design, LoRA training, per-line style control, and export to MP3, chaptered M4B, or Audacity multi-track. Built on Qwen3-TTS.
$ npx skills add Finrandojin/alexandria-audiobookFoundational Model for Speech Recognition Tasks
$ npx skills add salute-developers/GigaAMHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Voicebox Pytorch if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.