Instant voice cloning by MIT and MyShell. Audio foundation model.
$ npx skills add myshell-ai/OpenVoiceAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Instant voice cloning by MIT and MyShell. Audio foundation model.
$ npx skills add myshell-ai/OpenVoice1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
$ npx skills add RVC-Boss/GPT-SoVITSGPT-SoVITS ONNX Inference Engine & Model Converter
$ npx skills add High-Logic/Genie-TTS🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
$ npx skills add babysor/MockingBirdUse Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
$ npx skills add rany2/edge-ttsOmniVoice TTS nodes for ComfyUI - Zero-shot multilingual text-to-speech with voice cloning, voice design, and multi-speaker dialogue
$ npx skills add Saganaki22/ComfyUI-OmniVoice-TTSAutomatically translates the text of a video based on a subtitle file, and then uses AI voice services to create a new dubbed & translated audio track where the speech is synced using the subtitle's timings.
$ npx skills add ThioJoe/Auto-Synced-Translated-DubsInterface for OuteTTS models.
$ npx skills add edwko/OuteTTSVoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
$ npx skills add OpenBMB/VoxCPMVITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
$ npx skills add jaywalnut310/vitsPyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
$ npx skills add lifeiteng/vall-eSoprano: Instant, Ultra-Realistic Text-to-Speech
$ npx skills add ekwek1/sopranoTranslate the video from one language to another and embed dubbing & subtitles.
$ npx skills add jianchang512/pyvideotransFoundational model for human-like, expressive TTS
$ npx skills add metavoiceio/metavoice-srcA TTS model capable of generating ultra-realistic dialogue in one pass.
$ npx skills add nari-labs/dia[AutoArk] GPA (General Purpose Audio) can do ASR, TTS and voice conversion with one tiny model!
$ npx skills add AutoArk/GPAHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Index Tts if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.