WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
$ npx skills add m-bain/whisperXAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
$ npx skills add m-bain/whisperXJAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
$ npx skills add sanchit-gandhi/whisper-jaxPort of OpenAI's Whisper model in C/C++
$ npx skills add ggml-org/whisper.cpp💬 Speech recognition for your site
$ npx skills add TalAter/annyangVoice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式
$ npx skills add jianchang512/sttWhisper command line client compatible with original OpenAI client based on CTranslate2.
$ npx skills add Softcatala/whisper-ctranslate2OpenAI Whisper ASR Webservice API
$ npx skills add ahmetoner/whisper-asr-webserviceWhisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
$ npx skills add Purfview/whisper-standalone-winFaster Whisper transcription with CTranslate2
$ npx skills add SYSTRAN/faster-whisperSpeech recognition module for Python, supporting several engines and APIs, online and offline.
$ npx skills add Uberi/speech_recognitionOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
$ npx skills add alphacep/vosk-apiA voice chat app
$ npx skills add modal-labs/quillmanBuild real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
$ npx skills add saharmor/whisper-playgroundPytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。
$ npx skills add yeyupiaoling/MASREasy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
$ npx skills add PaddlePaddle/PaddleSpeechDistilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
$ npx skills add huggingface/distil-whisperHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Whisper Diarization if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.