A small speech recognizer
$ npx skills add cmusphinx/pocketsphinxAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
Open-Source Large Vocabulary Continuous Speech Recognition Engine
A small speech recognizer
$ npx skills add cmusphinx/pocketsphinxWhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
$ npx skills add m-bain/whisperX💬 Speech recognition for your site
$ npx skills add TalAter/annyangAutomatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
$ npx skills add MahmoudAshraf97/whisper-diarizationVoice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式
$ npx skills add jianchang512/sttPort of OpenAI's Whisper model in C/C++
$ npx skills add ggml-org/whisper.cppA PyTorch-based Speech Toolkit
$ npx skills add speechbrain/speechbrainSpeech recognition module for Python, supporting several engines and APIs, online and offline.
$ npx skills add Uberi/speech_recognitionA text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
$ npx skills add Blaizzy/mlx-audioeSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
$ npx skills add espeak-ng/espeak-ngkaldi-asr/kaldi is the official location of the Kaldi project.
$ npx skills add kaldi-asr/kaldiMultilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoregressive.
$ npx skills add FunAudioLLM/SenseVoiceA voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
$ npx skills add sdkcarlos/artyom.js这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。 支持的模型是由Google的Transformer模型中优化而来,数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小时), 所以识别效果也很好,可以媲美许多商用的ASR软件。
$ npx skills add chenkui164/FastASRFacebook AI Research's Automatic Speech Recognition Toolkit
$ npx skills add flashlight/wav2letterSALMONN family: A suite of advanced multi-modal LLMs
$ npx skills add bytedance/SALMONNHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Julius if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.