Alternatives

Voxtream alternatives for AI agents.

Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.

Current skill

Voxtream

VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Speaking rate Control

81
Quality
83
Trust
241
Stars
#1

GPT SoVITS

Similarity 158Trust 95Excellent 100

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

59K starsJun 20, 2026 pushmedia-automationPythonVoice
$ npx skills add RVC-Boss/GPT-SoVITS
#2

Irodori TTS

Similarity 155Trust 84Excellent 87

A Flow Matching-based Text-to-Speech Model with Emoji-driven Style Control

965 starsJun 4, 2026 pushmedia-automationPythonVoice
$ npx skills add Aratako/Irodori-TTS
#3

Edge Tts

Similarity 154Trust 84Excellent 99

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

11K starsMar 22, 2026 pushmedia-automationPythonVoice
$ npx skills add rany2/edge-tts
#4

Genie TTS

Similarity 151Trust 89Excellent 97

GPT-SoVITS ONNX Inference Engine & Model Converter

1.6K starsApr 18, 2026 pushmedia-automationPythonVoice
$ npx skills add High-Logic/Genie-TTS
#5

Vits

Similarity 150Trust 85Strong 79

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

7.9K starsDec 6, 2023 pushmedia-automationPythonVoice
$ npx skills add jaywalnut310/vits
#6

MockingBird

Similarity 149Trust 87Excellent 100

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

37K starsMar 3, 2026 pushmedia-automationPythonVoice
$ npx skills add babysor/MockingBird
#7

OpenVoice

Similarity 146Trust 86Excellent 87

Instant voice cloning by MIT and MyShell. Audio foundation model.

37K starsApr 19, 2025 pushmedia-automationPythonVoice
$ npx skills add myshell-ai/OpenVoice
#8

MOSS TTS

Similarity 146Trust 93Excellent 100

MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenarios, covering stable long‑form speech, multi‑speaker dialogue, voice/character design, environmental sound effects, and real‑time streaming TTS.

3.5K starsJun 18, 2026 pushmedia-automationPythonVoice
$ npx skills add OpenMOSS/MOSS-TTS
#9

RealtimeTTS

Similarity 146Trust 91Excellent 100

Converts text to speech in realtime

4.0K starsMay 31, 2026 pushmedia-automationPythonVoice
$ npx skills add KoljaB/RealtimeTTS
#10

ComfyUI OmniVoice TTS

Similarity 145Trust 83Strong 84

OmniVoice TTS nodes for ComfyUI - Zero-shot multilingual text-to-speech with voice cloning, voice design, and multi-speaker dialogue

432 starsJun 11, 2026 pushmedia-automationPythonVoice
$ npx skills add Saganaki22/ComfyUI-OmniVoice-TTS
#11

Voice Cloning App

Similarity 145Trust 83Strong 72

A Python/Pytorch app for easily synthesising human voices

1.4K starsDec 2, 2024 pushmedia-automationPythonVoice
$ npx skills add voice-cloning-app/Voice-Cloning-App
#12

Auto Synced Translated Dubs

Similarity 144Trust 91Excellent 98

Automatically translates the text of a video based on a subtitle file, and then uses AI voice services to create a new dubbed & translated audio track where the speech is synced using the subtitle's timings.

1.7K starsMay 11, 2026 pushmedia-automationPythonVoice
$ npx skills add ThioJoe/Auto-Synced-Translated-Dubs
#13

OuteTTS

Similarity 143Trust 87Excellent 97

Interface for OuteTTS models.

1.4K starsMar 23, 2026 pushmedia-automationPythonVoice
$ npx skills add edwko/OuteTTS
#14

Soprano

Similarity 141Trust 87Excellent 92

Soprano: Instant, Ultra-Realistic Text-to-Speech

1.2K starsJan 15, 2026 pushmedia-automationPythonVoice
$ npx skills add ekwek1/soprano
#15

Pyvideotrans

Similarity 141Trust 93Excellent 100

Translate the video from one language to another and embed dubbing & subtitles.

18K starsJun 19, 2026 pushmedia-automationPythonVoice
$ npx skills add jianchang512/pyvideotrans
#16

Index Tts

Similarity 140Trust 90Excellent 100

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

21K starsJun 16, 2026 pushmedia-automationPythonVoice
$ npx skills add index-tts/index-tts

How to choose

When should you switch?

Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Voxtream if it already passes your workflow test and repository review.

Next step

Compare top candidates side by side

Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.