đClone a voice in 5 seconds to generate arbitrary speech in real-time
$ npx skills add babysor/MockingBirdAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
A Python/Pytorch app for easily synthesising human voices
đClone a voice in 5 seconds to generate arbitrary speech in real-time
$ npx skills add babysor/MockingBird1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
$ npx skills add RVC-Boss/GPT-SoVITSVITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
$ npx skills add jaywalnut310/vitsGPT-SoVITS ONNX Inference Engine & Model Converter
$ npx skills add High-Logic/Genie-TTSA Flow Matching-based Text-to-Speech Model with Emoji-driven Style Control
$ npx skills add Aratako/Irodori-TTSA Generative Flow for Text-to-Speech via Monotonic Alignment Search
$ npx skills add jaywalnut310/glow-ttsAn unofficial PyTorch implementation of the audio LM VALL-E
$ npx skills add enhuiz/vall-eđ¸đŹ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
$ npx skills add coqui-ai/TTSunofficial vits2-TTS implementation in pytorch
$ npx skills add p0p4k/vits2_pytorchUse Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
$ npx skills add rany2/edge-ttsInstant voice cloning by MIT and MyShell. Audio foundation model.
$ npx skills add myshell-ai/OpenVoiceMOSSâTTS Family is an openâsource speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for highâfidelity, highâexpressiveness, and complex realâworld scenarios, covering stable longâform speech, multiâspeaker dialogue, voice/character design, environmental sound effects, and realâtime streaming TTS.
$ npx skills add OpenMOSS/MOSS-TTSOfficial Implementation of StyleTTS
$ npx skills add yl4579/StyleTTSOmniVoice TTS nodes for ComfyUI - Zero-shot multilingual text-to-speech with voice cloning, voice design, and multi-speaker dialogue
$ npx skills add Saganaki22/ComfyUI-OmniVoice-TTSAutomatically translates the text of a video based on a subtitle file, and then uses AI voice services to create a new dubbed & translated audio track where the speech is synced using the subtitle's timings.
$ npx skills add ThioJoe/Auto-Synced-Translated-DubsUnofficial implementation of NVIDIA P-Flow TTS paper
$ npx skills add p0p4k/pflowtts_pytorchHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Voice Cloning App if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.