WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
$ npx skills add m-bain/whisperXAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
$ npx skills add m-bain/whisperXSpeech recognition module for Python, supporting several engines and APIs, online and offline.
$ npx skills add Uberi/speech_recognitionEnd-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
$ npx skills add zzw922cn/Automatic_Speech_RecognitionVoice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式
$ npx skills add jianchang512/stt中文语音识别; Mandarin Automatic Speech Recognition;
$ npx skills add nobody132/masrTools for handling multimodal data in machine learning projects.
$ npx skills add lhotse-speech/lhotseA voice chat app
$ npx skills add modal-labs/quillmanИрина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.
$ npx skills add janvarev/Irene-Voice-AssistantWhisper command line client compatible with original OpenAI client based on CTranslate2.
$ npx skills add Softcatala/whisper-ctranslate2OpenAI Whisper ASR Webservice API
$ npx skills add ahmetoner/whisper-asr-webserviceAutomatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支持多语言,准确率高
$ npx skills add shibing624/parrots[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
$ npx skills add sooftware/conformerSpeech-to-text server framework with next-gen Kaldi
$ npx skills add k2-fsa/sherpaEasy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
$ npx skills add PaddlePaddle/PaddleSpeechFaster Whisper transcription with CTranslate2
$ npx skills add SYSTRAN/faster-whisperDistilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
$ npx skills add huggingface/distil-whisperHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep ASRT SpeechRecognition if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.