🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
$ npx skills add babysor/MockingBirdAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
An unofficial PyTorch implementation of the audio LM VALL-E
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
$ npx skills add babysor/MockingBird1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
$ npx skills add RVC-Boss/GPT-SoVITSVITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
$ npx skills add jaywalnut310/vitsPyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
$ npx skills add lifeiteng/vall-eUse Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
$ npx skills add rany2/edge-ttsInstant voice cloning by MIT and MyShell. Audio foundation model.
$ npx skills add myshell-ai/OpenVoiceA Python/Pytorch app for easily synthesising human voices
$ npx skills add voice-cloning-app/Voice-Cloning-AppAutomatically translates the text of a video based on a subtitle file, and then uses AI voice services to create a new dubbed & translated audio track where the speech is synced using the subtitle's timings.
$ npx skills add ThioJoe/Auto-Synced-Translated-DubsGPT-SoVITS ONNX Inference Engine & Model Converter
$ npx skills add High-Logic/Genie-TTSInterface for OuteTTS models.
$ npx skills add edwko/OuteTTSVoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
$ npx skills add OpenBMB/VoxCPMSoprano: Instant, Ultra-Realistic Text-to-Speech
$ npx skills add ekwek1/sopranoTranslate the video from one language to another and embed dubbing & subtitles.
$ npx skills add jianchang512/pyvideotransAn Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
$ npx skills add index-tts/index-ttsA TTS model capable of generating ultra-realistic dialogue in one pass.
$ npx skills add nari-labs/diaA Flow Matching-based Text-to-Speech Model with Emoji-driven Style Control
$ npx skills add Aratako/Irodori-TTSHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Vall E if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.