Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
$ npx skills add maum-ai/univnetAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
$ npx skills add maum-ai/univnetControllable and fast Text-to-Speech for over 7000 languages!
$ npx skills add DigitalPhonetics/IMS-ToucanVITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
$ npx skills add jaywalnut310/vits🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
$ npx skills add babysor/MockingBirdFoundational model for human-like, expressive TTS
$ npx skills add metavoiceio/metavoice-srcVietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio quality • Chuyển văn bản thành giọng nói tiếng Việt • Text to speech tiếng Việt • TTS tiếng Việt
$ npx skills add pnnbao97/VieNeu-TTSStyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
$ npx skills add yl4579/StyleTTS2A Generative Flow for Text-to-Speech via Monotonic Alignment Search
$ npx skills add jaywalnut310/glow-tts🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
$ npx skills add coqui-ai/TTSUse Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
$ npx skills add rany2/edge-ttsWaveRNN Vocoder + TTS
$ npx skills add fatchord/WaveRNNA simple, high-quality voice conversion tool focused on ease of use and performance.
$ npx skills add IAHispano/ApplioOfficial Implementation of StyleTTS
$ npx skills add yl4579/StyleTTSA Python/Pytorch app for easily synthesising human voices
$ npx skills add voice-cloning-app/Voice-Cloning-AppHiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform
$ npx skills add yl4579/HiFTNetMultilingual TTS model with voice cloning and duration control, based on T5Gemma encoder-decoder LLM
$ npx skills add Aratako/T5Gemma-TTSHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Hifi Gan if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.