Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式
$ npx skills add jianchang512/sttAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式
$ npx skills add jianchang512/sttA voice chat app
$ npx skills add modal-labs/quillmanPort of OpenAI's Whisper model in C/C++
$ npx skills add ggml-org/whisper.cppWhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
$ npx skills add m-bain/whisperXFaster Whisper transcription with CTranslate2
$ npx skills add SYSTRAN/faster-whisperOpenVINO™ is an open source toolkit for optimizing and deploying AI inference
$ npx skills add openvinotoolkit/openvinoSpeech recognition module for Python, supporting several engines and APIs, online and offline.
$ npx skills add Uberi/speech_recognitionThe open-source iOS app that's making quality voice transcription more accessible on mobile devices.
$ npx skills add Saik0s/WhisperboardA text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
$ npx skills add Blaizzy/mlx-audio💬 Speech recognition for your site
$ npx skills add TalAter/annyangGradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
$ npx skills add abus-aikorea/voice-proOn-device streaming speech-to-text engine powered by deep learning
$ npx skills add Picovoice/cheetah🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
$ npx skills add coqui-ai/STTAutomatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
$ npx skills add MahmoudAshraf97/whisper-diarization💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
$ npx skills add coqui-ai/open-speech-corpora🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
$ npx skills add pannous/tensorflow-speech-recognitionHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Whishper if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.