LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
$ npx skills add zhenye234/LLaSA_trainingAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
Real-time voice assistant — WebRTC streaming, faster-whisper ASR, local LLM, Vui Nano (300M) TTS. OpenAI Realtime API compatible. Voice cloning, barge-in, ~9× realtime on a 4090. Apache 2.0.
LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
$ npx skills add zhenye234/LLaSA_training[AutoArk] GPA (General Purpose Audio) can do ASR, TTS and voice conversion with one tiny model!
$ npx skills add AutoArk/GPAAI-powered multi-voice audiobook generator — LLM script annotation, voice cloning, voice design, LoRA training, per-line style control, and export to MP3, chaptered M4B, or Audacity multi-track. Built on Qwen3-TTS.
$ npx skills add Finrandojin/alexandria-audiobookTurn PDFs and EPUBs into audiobooks; subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, including voice-cloning (instant, RVC-enhanced, XTTS fine-tuning) and LLM processing. It aspires to be a user-friendly app with a GUI, an installer and all-in-one packages.
$ npx skills add lukaszliniewicz/PandratorOpen source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and WS
$ npx skills add toverainc/willow-inference-serverImplementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
$ npx skills add lucidrains/e2-tts-pytorchLocal, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate voice cloned speech anywhere the OpenAI API is used (e.g. Open WebUI, AnythingLLM, etc.)
$ npx skills add travisvn/chatterbox-tts-apiAn Open-Sourced LLM-empowered Foundation TTS System
$ npx skills add FireRedTeam/FireRedTTSRun Qwen3-TTS text-to-speech locally on Mac (M1/M2/M3/M4). Voice cloning, voice design, custom voices. 100% offline using MLX.
$ npx skills add kapi2800/qwen3-tts-apple-siliconComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
$ npx skills add wildminder/ComfyUI-VibeVoiceModified version of Chatterbox that accepts text files as input and no character restrictions. I use it to make audiobooks, especially for my kids.
$ npx skills add petermg/Chatterbox-TTS-ExtendedVonage REST API client for PHP. API support for SMS, Voice, Text-to-Speech, Numbers, Verify (2FA) and more.
$ npx skills add Vonage/vonage-php-sdk-coreCaptains log and 3d star map for Elite Dangerous
$ npx skills add EDDiscovery/EDDiscoveryAugmentative and Alternative Communication (AAC) system with text-to-speech for the browser
$ npx skills add cboard-org/cboardOn-device Speech-to-Intent engine powered by deep learning
$ npx skills add Picovoice/rhinoA modular Swift SDK for audio processing with MLX on Apple Silicon
$ npx skills add Blaizzy/mlx-audio-swiftHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Vui if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.