Alternatives

ComfyUI OmniVoice TTS alternatives for AI agents.

Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.

Current skill

ComfyUI OmniVoice TTS

OmniVoice TTS nodes for ComfyUI - Zero-shot multilingual text-to-speech with voice cloning, voice design, and multi-speaker dialogue

84
Quality
83
Trust
432
Stars
#1

GPT SoVITS

Similarity 166Trust 95Excellent 100

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

59K starsJun 20, 2026 pushmedia-automationPythonVoice
$ npx skills add RVC-Boss/GPT-SoVITS
#2

Genie TTS

Similarity 159Trust 89Excellent 97

GPT-SoVITS ONNX Inference Engine & Model Converter

1.6K starsApr 18, 2026 pushmedia-automationPythonVoice
$ npx skills add High-Logic/Genie-TTS
#3

OpenVoice

Similarity 154Trust 86Excellent 87

Instant voice cloning by MIT and MyShell. Audio foundation model.

37K starsApr 19, 2025 pushmedia-automationPythonVoice
$ npx skills add myshell-ai/OpenVoice
#4

MockingBird

Similarity 149Trust 87Excellent 100

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

37K starsMar 3, 2026 pushmedia-automationPythonVoice
$ npx skills add babysor/MockingBird
#5

Index Tts

Similarity 148Trust 90Excellent 100

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

21K starsJun 16, 2026 pushmedia-automationPythonVoice
$ npx skills add index-tts/index-tts
#6

Irodori TTS

Similarity 147Trust 84Excellent 87

A Flow Matching-based Text-to-Speech Model with Emoji-driven Style Control

965 starsJun 4, 2026 pushmedia-automationPythonVoice
$ npx skills add Aratako/Irodori-TTS
#7

Edge Tts

Similarity 146Trust 84Excellent 99

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

11K starsMar 22, 2026 pushmedia-automationPythonVoice
$ npx skills add rany2/edge-tts
#8

MOSS TTS

Similarity 146Trust 93Excellent 100

MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenarios, covering stable long‑form speech, multi‑speaker dialogue, voice/character design, environmental sound effects, and real‑time streaming TTS.

3.5K starsJun 18, 2026 pushmedia-automationPythonVoice
$ npx skills add OpenMOSS/MOSS-TTS
#9

Voice Cloning App

Similarity 145Trust 83Strong 72

A Python/Pytorch app for easily synthesising human voices

1.4K starsDec 2, 2024 pushmedia-automationPythonVoice
$ npx skills add voice-cloning-app/Voice-Cloning-App
#10

Auto Synced Translated Dubs

Similarity 144Trust 91Excellent 98

Automatically translates the text of a video based on a subtitle file, and then uses AI voice services to create a new dubbed & translated audio track where the speech is synced using the subtitle's timings.

1.7K starsMay 11, 2026 pushmedia-automationPythonVoice
$ npx skills add ThioJoe/Auto-Synced-Translated-Dubs
#11

Voxtream

Similarity 144Trust 83Strong 81

VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Speaking rate Control

241 starsMay 30, 2026 pushmedia-automationPythonVoice
$ npx skills add herimor/voxtream
#12

OuteTTS

Similarity 143Trust 87Excellent 97

Interface for OuteTTS models.

1.4K starsMar 23, 2026 pushmedia-automationPythonVoice
$ npx skills add edwko/OuteTTS
#13

Vits

Similarity 142Trust 85Strong 79

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

7.9K starsDec 6, 2023 pushmedia-automationPythonVoice
$ npx skills add jaywalnut310/vits
#14

Soprano

Similarity 141Trust 87Excellent 92

Soprano: Instant, Ultra-Realistic Text-to-Speech

1.2K starsJan 15, 2026 pushmedia-automationPythonVoice
$ npx skills add ekwek1/soprano
#15

Pyvideotrans

Similarity 141Trust 93Excellent 100

Translate the video from one language to another and embed dubbing & subtitles.

18K starsJun 19, 2026 pushmedia-automationPythonVoice
$ npx skills add jianchang512/pyvideotrans
#16

Dia

Similarity 140Trust 89Excellent 98

A TTS model capable of generating ultra-realistic dialogue in one pass.

19K starsNov 19, 2025 pushmedia-automationPythonVoice
$ npx skills add nari-labs/dia

How to choose

When should you switch?

Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep ComfyUI OmniVoice TTS if it already passes your workflow test and repository review.

Next step

Compare top candidates side by side

Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.