Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式
$ npx skills add jianchang512/sttAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式
$ npx skills add jianchang512/sttWhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
$ npx skills add m-bain/whisperXSpeech recognition module for Python, supporting several engines and APIs, online and offline.
$ npx skills add Uberi/speech_recognitionA voice chat app
$ npx skills add modal-labs/quillmanWhisper command line client compatible with original OpenAI client based on CTranslate2.
$ npx skills add Softcatala/whisper-ctranslate2Faster Whisper transcription with CTranslate2
$ npx skills add SYSTRAN/faster-whisperPort of OpenAI's Whisper model in C/C++
$ npx skills add ggml-org/whisper.cppTools for handling multimodal data in machine learning projects.
$ npx skills add lhotse-speech/lhotseBuild real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
$ npx skills add saharmor/whisper-playgroundTranslate the video from one language to another and embed dubbing & subtitles.
$ npx skills add jianchang512/pyvideotrans🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
$ npx skills add babysor/MockingBirdИрина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.
$ npx skills add janvarev/Irene-Voice-AssistantOpenAI Whisper ASR Webservice API
$ npx skills add ahmetoner/whisper-asr-webserviceJAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
$ npx skills add sanchit-gandhi/whisper-jax💬 Speech recognition for your site
$ npx skills add TalAter/annyang🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
$ npx skills add coqui-ai/STTHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Tensorflow Speech Recognition if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.