Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
$ npx skills add MahmoudAshraf97/whisper-diarizationAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
$ npx skills add MahmoudAshraf97/whisper-diarizationPort of OpenAI's Whisper model in C/C++
$ npx skills add ggml-org/whisper.cppWhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
$ npx skills add m-bain/whisperXFaster Whisper transcription with CTranslate2
$ npx skills add SYSTRAN/faster-whisperGradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
$ npx skills add abus-aikorea/voice-pro💬📝 A small dictation app using OpenAI's Whisper speech recognition model.
$ npx skills add savbell/whisper-writerFine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment
$ npx skills add yeyupiaoling/Whisper-FinetuneAI Vtuber for Streaming on Youtube/Twitch
$ npx skills add ardha27/AI-Waifu-VtuberOpenAI Whisper ASR Webservice API
$ npx skills add ahmetoner/whisper-asr-webserviceEasy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
$ npx skills add PaddlePaddle/PaddleSpeechOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
$ npx skills add alphacep/vosk-apiA PyTorch-based Speech Toolkit
$ npx skills add speechbrain/speechbrainSpeech recognition module for Python, supporting several engines and APIs, online and offline.
$ npx skills add Uberi/speech_recognitionThe open-source iOS app that's making quality voice transcription more accessible on mobile devices.
$ npx skills add Saik0s/WhisperboardA text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
$ npx skills add Blaizzy/mlx-audioRunning speech to text model (whisper.cpp) in Unity3d on your local machine.
$ npx skills add Macoron/whisper.unityHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Whisper Standalone Win if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.