Alternatives

Voice Pro alternatives for AI agents.

Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.

Current skill

Voice Pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

95
Quality
91
Trust
11K
Stars
#1

Mlx Audio

Similarity 139Trust 94Excellent 100

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

7.4K starsJun 19, 2026 pushmedia-automationPythonSpeech
$ npx skills add Blaizzy/mlx-audio
#2

AI Waifu Vtuber

Similarity 134Trust 87Excellent 97

AI Vtuber for Streaming on Youtube/Twitch

1.1K starsMay 31, 2026 pushmedia-automationPythonSpeech
$ npx skills add ardha27/AI-Waifu-Vtuber
#3

WhisperX

Similarity 133Trust 95Excellent 100

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

23K starsJun 3, 2026 pushmedia-automationPythonSpeech
$ npx skills add m-bain/whisperX
#4

Faster Whisper

Similarity 132Trust 89Excellent 99

Faster Whisper transcription with CTranslate2

24K starsNov 19, 2025 pushmedia-automationPythonSpeech
$ npx skills add SYSTRAN/faster-whisper
#5

Speech Recognition

Similarity 131Trust 94Excellent 100

Speech recognition module for Python, supporting several engines and APIs, online and offline.

9.0K starsJun 16, 2026 pushmedia-automationPythonSpeech
$ npx skills add Uberi/speech_recognition
#6

ComfyUI Custom Nodes AlekPet

Similarity 129Trust 89Excellent 97

Custom nodes that extend the capabilities of Comfyui

1.5K starsMay 9, 2026 pushmedia-automationJavaScriptSpeech
$ npx skills add AlekPet/ComfyUI_Custom_Nodes_AlekPet
#7

Stt

Similarity 129Trust 88Excellent 98

Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式

4.6K starsJan 22, 2026 pushmedia-automationPythonSpeech
$ npx skills add jianchang512/stt
#8

Whisper Writer

Similarity 128Trust 81Strong 71

💬📝 A small dictation app using OpenAI's Whisper speech recognition model.

1.1K starsAug 24, 2024 pushmedia-automationPythonSpeech
$ npx skills add savbell/whisper-writer
#9

Whisper.Cpp

Similarity 128Trust 93Excellent 100

Port of OpenAI's Whisper model in C/C++

51K starsJun 22, 2026 pushmedia-automationC++Speech
$ npx skills add ggml-org/whisper.cpp
#10

Quillman

Similarity 127Trust 90Excellent 100

A voice chat app

1.2K starsMay 28, 2026 pushmedia-automationPythonSpeech
$ npx skills add modal-labs/quillman
#11

Whisper Standalone Win

Similarity 127Trust 84Strong 83

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

3.1K starsNov 7, 2025 pushmedia-automationSpeechClaude Code
$ npx skills add Purfview/whisper-standalone-win
#12

Irene Voice Assistant

Similarity 126Trust 89Excellent 97

Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.

1.1K starsJun 13, 2026 pushmedia-automationPythonSpeech
$ npx skills add janvarev/Irene-Voice-Assistant
#13

Whisper Ctranslate2

Similarity 126Trust 88Excellent 93

Whisper command line client compatible with original OpenAI client based on CTranslate2.

1.3K starsFeb 14, 2026 pushmedia-automationPythonSpeech
$ npx skills add Softcatala/whisper-ctranslate2
#14

Whisper Asr Webservice

Similarity 126Trust 83Excellent 89

OpenAI Whisper ASR Webservice API

3.3K starsNov 23, 2025 pushmedia-automationPythonSpeech
$ npx skills add ahmetoner/whisper-asr-webservice
#15

Annyang

Similarity 125Trust 92Excellent 100

💬 Speech recognition for your site

6.8K starsJun 11, 2026 pushmedia-automationTypeScriptSpeech
$ npx skills add TalAter/annyang
#16

PaddleSpeech

Similarity 124Trust 95Excellent 100

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

13K starsJun 21, 2026 pushmedia-automationPythonSpeech
$ npx skills add PaddlePaddle/PaddleSpeech

How to choose

When should you switch?

Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Voice Pro if it already passes your workflow test and repository review.

Next step

Compare top candidates side by side

Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.