Alternatives

Whishper alternatives for AI agents.

Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.

Current skill

Whishper

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

88
Quality
88
Trust
3.0K
Stars
#1

Stt

Similarity 131Trust 88Excellent 98

Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式

4.6K starsJan 22, 2026 pushmedia-automationPythonSpeech
$ npx skills add jianchang512/stt
#2

Quillman

Similarity 129Trust 90Excellent 100

A voice chat app

1.2K starsMay 28, 2026 pushmedia-automationPythonSpeech
$ npx skills add modal-labs/quillman
#3

Whisper.Cpp

Similarity 128Trust 93Excellent 100

Port of OpenAI's Whisper model in C/C++

51K starsJun 22, 2026 pushmedia-automationC++Speech
$ npx skills add ggml-org/whisper.cpp
#4

WhisperX

Similarity 127Trust 95Excellent 100

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

23K starsJun 3, 2026 pushmedia-automationPythonSpeech
$ npx skills add m-bain/whisperX
#5

Faster Whisper

Similarity 126Trust 89Excellent 99

Faster Whisper transcription with CTranslate2

24K starsNov 19, 2025 pushmedia-automationPythonSpeech
$ npx skills add SYSTRAN/faster-whisper
#6

Openvino

Similarity 126Trust 93Excellent 100

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

10K starsJun 23, 2026 pushmedia-automationC++Speech
$ npx skills add openvinotoolkit/openvino
#7

Speech Recognition

Similarity 125Trust 94Excellent 100

Speech recognition module for Python, supporting several engines and APIs, online and offline.

9.0K starsJun 16, 2026 pushmedia-automationPythonSpeech
$ npx skills add Uberi/speech_recognition
#8

Whisperboard

Similarity 125Trust 88Strong 83

The open-source iOS app that's making quality voice transcription more accessible on mobile devices.

1.0K starsDec 18, 2025 pushmedia-automationSwiftSpeech
$ npx skills add Saik0s/Whisperboard
#9

Mlx Audio

Similarity 125Trust 94Excellent 100

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

7.4K starsJun 19, 2026 pushmedia-automationPythonSpeech
$ npx skills add Blaizzy/mlx-audio
#10

Annyang

Similarity 125Trust 92Excellent 100

💬 Speech recognition for your site

6.8K starsJun 11, 2026 pushmedia-automationTypeScriptSpeech
$ npx skills add TalAter/annyang
#11

Voice Pro

Similarity 124Trust 91Excellent 95

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

11K starsDec 5, 2025 pushmedia-automationPythonSpeech
$ npx skills add abus-aikorea/voice-pro
#12

Cheetah

Similarity 124Trust 84Excellent 85

On-device streaming speech-to-text engine powered by deep learning

665 starsJun 10, 2026 pushmedia-automationPythonSpeech
$ npx skills add Picovoice/cheetah
#13

STT

Similarity 124Trust 81Strong 75

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

2.6K starsMar 11, 2024 pushmedia-automationC++Speech
$ npx skills add coqui-ai/STT
#14

Whisper Diarization

Similarity 124Trust 89Excellent 99

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

5.6K starsFeb 23, 2026 pushmedia-automationJupyter NotebookSpeech
$ npx skills add MahmoudAshraf97/whisper-diarization
#15

Open Speech Corpora

Similarity 123Trust 83Strong 72

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

1.4K starsJun 6, 2024 pushmedia-automationSpeechClaude Code
$ npx skills add coqui-ai/open-speech-corpora
#16

Tensorflow Speech Recognition

Similarity 122Trust 79Promising 69

🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

2.2K starsJan 17, 2024 pushmedia-automationPythonSpeech
$ npx skills add pannous/tensorflow-speech-recognition

How to choose

When should you switch?

Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Whishper if it already passes your workflow test and repository review.

Next step

Compare top candidates side by side

Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.