On-device streaming speech-to-text engine powered by deep learning
$ npx skills add Picovoice/cheetahAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
On-device streaming speech-to-text engine powered by deep learning
$ npx skills add Picovoice/cheetahOpenAI Whisper ASR Webservice API
$ npx skills add ahmetoner/whisper-asr-webserviceWhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
$ npx skills add m-bain/whisperXOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
$ npx skills add alphacep/vosk-apiFaster Whisper transcription with CTranslate2
$ npx skills add SYSTRAN/faster-whisperA PyTorch-based Speech Toolkit
$ npx skills add speechbrain/speechbrainPort of OpenAI's Whisper model in C/C++
$ npx skills add ggml-org/whisper.cppProduction First and Production Ready End-to-End Speech Recognition Toolkit
$ npx skills add wenet-e2e/wenetAutomatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
$ npx skills add MahmoudAshraf97/whisper-diarizationOpenVINO™ is an open source toolkit for optimizing and deploying AI inference
$ npx skills add openvinotoolkit/openvinoVoice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式
$ npx skills add jianchang512/stt🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
$ npx skills add pannous/tensorflow-speech-recognitionOpen-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recognition capability.
$ npx skills add FireRedTeam/FireRedASRPytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。
$ npx skills add yeyupiaoling/MASRFacebook AI Research's Automatic Speech Recognition Toolkit
$ npx skills add flashlight/wav2letterTranscribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
$ npx skills add pluja/whishperHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep STT if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.