An unofficial PyTorch implementation of the audio LM VALL-E
$ npx skills add enhuiz/vall-eAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
An unofficial PyTorch implementation of the audio LM VALL-E
$ npx skills add enhuiz/vall-e1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
$ npx skills add RVC-Boss/GPT-SoVITS🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
$ npx skills add babysor/MockingBirdAn Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
$ npx skills add index-tts/index-ttsUse Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
$ npx skills add rany2/edge-ttsInstant voice cloning by MIT and MyShell. Audio foundation model.
$ npx skills add myshell-ai/OpenVoiceAutomatically translates the text of a video based on a subtitle file, and then uses AI voice services to create a new dubbed & translated audio track where the speech is synced using the subtitle's timings.
$ npx skills add ThioJoe/Auto-Synced-Translated-DubsGPT-SoVITS ONNX Inference Engine & Model Converter
$ npx skills add High-Logic/Genie-TTSInterface for OuteTTS models.
$ npx skills add edwko/OuteTTSVoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
$ npx skills add OpenBMB/VoxCPMVITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
$ npx skills add jaywalnut310/vitsSoprano: Instant, Ultra-Realistic Text-to-Speech
$ npx skills add ekwek1/sopranoTranslate the video from one language to another and embed dubbing & subtitles.
$ npx skills add jianchang512/pyvideotransA TTS model capable of generating ultra-realistic dialogue in one pass.
$ npx skills add nari-labs/dia[AutoArk] GPA (General Purpose Audio) can do ASR, TTS and voice conversion with one tiny model!
$ npx skills add AutoArk/GPAA Flow Matching-based Text-to-Speech Model with Emoji-driven Style Control
$ npx skills add Aratako/Irodori-TTSHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Vall E if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.