WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
$ npx skills add m-bain/whisperXAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
💬 Speech recognition for your site
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
$ npx skills add m-bain/whisperXAutomatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
$ npx skills add MahmoudAshraf97/whisper-diarizationVoice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式
$ npx skills add jianchang512/sttSpeech Recognition for React Native Expo projects
$ npx skills add jamsch/expo-speech-recognitionPort of OpenAI's Whisper model in C/C++
$ npx skills add ggml-org/whisper.cppSpeech recognition module for Python, supporting several engines and APIs, online and offline.
$ npx skills add Uberi/speech_recognitionA simple Azure Speech Service module that uses the Microsoft Edge Read Aloud API. https://www.npmjs.com/package/msedge-tts
$ npx skills add Migushthe2nd/MsEdgeTTSA voice chat app
$ npx skills add modal-labs/quillmanWhisper command line client compatible with original OpenAI client based on CTranslate2.
$ npx skills add Softcatala/whisper-ctranslate2OpenAI Whisper ASR Webservice API
$ npx skills add ahmetoner/whisper-asr-webserviceFaster Whisper transcription with CTranslate2
$ npx skills add SYSTRAN/faster-whisperJAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
$ npx skills add sanchit-gandhi/whisper-jaxA text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
$ npx skills add Blaizzy/mlx-audioGradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
$ npx skills add abus-aikorea/voice-proOpen-Source Large Vocabulary Continuous Speech Recognition Engine
$ npx skills add julius-speech/juliusA small speech recognizer
$ npx skills add cmusphinx/pocketsphinxHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Annyang if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.