Alternatives

ComfyUI VibeVoice alternatives for AI agents.

Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.

Current skill

ComfyUI VibeVoice

ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio

67
Quality
83
Trust
586
Stars
#1

GPA

Similarity 124Trust 88Excellent 87

[AutoArk] GPA (General Purpose Audio) can do ASR, TTS and voice conversion with one tiny model!

866 starsMay 25, 2026 pushmedia-automationPythonVoice
$ npx skills add AutoArk/GPA
#2

FireRedTTS

Similarity 119Trust 81Promising 69

An Open-Sourced LLM-empowered Foundation TTS System

912 starsSep 28, 2025 pushmedia-automationPythonVoice
$ npx skills add FireRedTeam/FireRedTTS
#3

Chatterbox TTS Extended

Similarity 118Trust 83Promising 67

Modified version of Chatterbox that accepts text files as input and no character restrictions. I use it to make audiobooks, especially for my kids.

565 starsAug 23, 2025 pushmedia-automationPythonVoice
$ npx skills add petermg/Chatterbox-TTS-Extended
#4

CloneTTS

Similarity 116Trust 84Strong 81

A lightweight, offline Android Text-to-Speech (TTS) engine enabling seamless system-wide voice cloning and high-fidelity text reading. / 运行在安卓本地的轻量级文字转语音 (TTS) 引擎,支持离线发音人提取、零门槛音色克隆与双擎系统级全局听书。

692 starsMay 18, 2026 pushmedia-automationVoiceClaude Code
$ npx skills add sipeter/CloneTTS
#5

Voice AI

Similarity 115Trust 84Strong 81

Rapida is an open-source, end-to-end voice AI orchestration platform for building real-time conversational voice agents with audio streaming, STT, TTS, VAD, multi-channel integration, agent state management, and observability.

666 starsJun 16, 2026 pushmedia-automationGoVoice
$ npx skills add rapidaai/voice-ai
#6

Alexandria Audiobook

Similarity 115Trust 88Excellent 86

AI-powered multi-voice audiobook generator — LLM script annotation, voice cloning, voice design, LoRA training, per-line style control, and export to MP3, chaptered M4B, or Audacity multi-track. Built on Qwen3-TTS.

682 starsJun 4, 2026 pushmedia-automationPythonVoice
$ npx skills add Finrandojin/alexandria-audiobook
#7

Glow Tts

Similarity 115Trust 75Promising 55

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

714 starsJul 12, 2022 pushmedia-automationPythonVoice
$ npx skills add jaywalnut310/glow-tts
#8

Pandrator

Similarity 115Trust 89Excellent 85

Turn PDFs and EPUBs into audiobooks; subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, including voice-cloning (instant, RVC-enhanced, XTTS fine-tuning) and LLM processing. It aspires to be a user-friendly app with a GUI, an installer and all-in-one packages.

572 starsJun 14, 2026 pushmedia-automationPythonVoice
$ npx skills add lukaszliniewicz/Pandrator
#9

F5 Tts Mlx

Similarity 115Trust 77Needs review 54

Implementation of F5-TTS in MLX

635 starsMar 19, 2025 pushmedia-automationPythonVoice
$ npx skills add lucasnewman/f5-tts-mlx
#10

Transformer TTS

Similarity 115Trust 74Promising 55

A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"

690 starsNov 8, 2023 pushmedia-automationPythonVoice
$ npx skills add soobinseo/Transformer-TTS
#11

Orpheus Tts Local

Similarity 115Trust 77Needs review 54

Run Orpheus 3B Locally With LM Studio

543 starsMar 20, 2025 pushmedia-automationPythonVoice
$ npx skills add isaiahbjork/orpheus-tts-local
#12

NaturalVoiceSAPIAdapter

Similarity 114Trust 82Strong 76

Make Azure natural TTS voices accessible to any SAPI 5-compatible application.

799 starsJan 2, 2026 pushmedia-automationC++Voice
$ npx skills add gexgd0419/NaturalVoiceSAPIAdapter
#13

Vits2 Pytorch

Similarity 114Trust 75Needs review 54

unofficial vits2-TTS implementation in pytorch

549 starsMar 28, 2024 pushmedia-automationPythonVoice
$ npx skills add p0p4k/vits2_pytorch
#14

Vui

Similarity 113Trust 82Strong 81

Real-time voice assistant — WebRTC streaming, faster-whisper ASR, local LLM, Vui Nano (300M) TTS. OpenAI Realtime API compatible. Voice cloning, barge-in, ~9× realtime on a 4090. Apache 2.0.

701 starsJun 12, 2026 pushmedia-automationPythonVoice
$ npx skills add fluxions-ai/vui
#15

Tiktok Voice

Similarity 113Trust 72Needs review 49

Simple Python script to interact with the TikTok TTS API

602 starsOct 12, 2024 pushmedia-automationPythonVoice
$ npx skills add oscie57/tiktok-voice
#16

E2 Tts Pytorch

Similarity 112Trust 85Strong 74

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

517 starsDec 20, 2025 pushmedia-automationPythonVoice
$ npx skills add lucidrains/e2-tts-pytorch

How to choose

When should you switch?

Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep ComfyUI VibeVoice if it already passes your workflow test and repository review.

Next step

Compare top candidates side by side

Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.