Whisper.net. Speech to text made simple using Whisper Models
$ npx skills add sandrohanea/whisper.netAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
Android Input Method Editor (IME) based on Whisper
Whisper.net. Speech to text made simple using Whisper Models
$ npx skills add sandrohanea/whisper.netOptimized Whisper models for streaming and on-device use
$ npx skills add TheStageAI/TheWhisperA 100% private AI voice transcription app that converts speech to text in 100+ languages. Built with Compose Multiplatform for Android & iOS using Whisper AI - no cloud uploads, all processing happens on-device for complete privacy.
$ npx skills add Notely-Voice/NotelyVoiceReact Native binding of whisper.cpp.
$ npx skills add mybigday/whisper.rn🎤 The easiest way to transcribe audio in Swift
$ npx skills add exPHAT/SwiftWhisperOn-device streaming speech-to-text engine powered by deep learning
$ npx skills add Picovoice/cheetahturnkey self-hosted offline transcription and diarization service with llm summary
$ npx skills add transcriptionstream/transcriptionstreamThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
$ npx skills add lkuza2/java-speech-apiOffline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
$ npx skills add vilassn/whisper_android语音api示例
$ npx skills add Baidu-AIP/speech-demoBuild real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
$ npx skills add saharmor/whisper-playgroundPrivate and on-device speech recognition keyboard and service for Android.
$ npx skills add soupslurpr/TranscribroAI speech toolkit for Apple Silicon — ASR, TTS, speech-to-speech, VAD, and diarization powered by MLX and CoreML
$ npx skills add soniqo/speech-swiftSpeech-to-text server framework with next-gen Kaldi
$ npx skills add k2-fsa/sherpaSpeech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) (VTuber TTS)
$ npx skills add VRCWizard/TTS-Voice-WizardVoice-to-text with push-to-talk for Wayland compositors
$ npx skills add peteonrails/voxtypeHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep WhisperIME if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.