Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
$ npx skills add lucidrains/e2-tts-pytorchAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
$ npx skills add lucidrains/e2-tts-pytorchA talking LLM that runs on your own computer without needing the internet.
$ npx skills add vndee/local-talking-llm[AutoArk] GPA (General Purpose Audio) can do ASR, TTS and voice conversion with one tiny model!
$ npx skills add AutoArk/GPAImplementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
$ npx skills add lucidrains/voicebox-pytorchA Generative Flow for Text-to-Speech via Monotonic Alignment Search
$ npx skills add jaywalnut310/glow-ttsThe code for the bark-voicecloning model. Training and inference.
$ npx skills add gitmylo/bark-voice-cloning-HuBERT-quantizerImplementation of F5-TTS in MLX
$ npx skills add lucasnewman/f5-tts-mlxRun Orpheus 3B Locally With LM Studio
$ npx skills add isaiahbjork/orpheus-tts-localunofficial vits2-TTS implementation in pytorch
$ npx skills add p0p4k/vits2_pytorchOpen source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and WS
$ npx skills add toverainc/willow-inference-serverReal-time voice assistant — WebRTC streaming, faster-whisper ASR, local LLM, Vui Nano (300M) TTS. OpenAI Realtime API compatible. Voice cloning, barge-in, ~9× realtime on a 4090. Apache 2.0.
$ npx skills add fluxions-ai/vuiSimple Python script to interact with the TikTok TTS API
$ npx skills add oscie57/tiktok-voice🚀 AI 全自动短视频引擎 | AI Fully Automated Short Video Engine
$ npx skills add AIDC-AI/Pixelle-VideoAutoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
$ npx skills add FoundationVision/LlamaGenA lightweight, offline Android Text-to-Speech (TTS) engine enabling seamless system-wide voice cloning and high-fidelity text reading. / 运行在安卓本地的轻量级文字转语音 (TTS) 引擎,支持离线发音人提取、零门槛音色克隆与双擎系统级全局听书。
$ npx skills add sipeter/CloneTTSAI-powered multi-voice audiobook generator — LLM script annotation, voice cloning, voice design, LoRA training, per-line style control, and export to MP3, chaptered M4B, or Audacity multi-track. Built on Qwen3-TTS.
$ npx skills add Finrandojin/alexandria-audiobookHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep LLaSA Training if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.