WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
$ npx skills add m-bain/whisperXAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
$ npx skills add m-bain/whisperXUse Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
$ npx skills add rany2/edge-ttsSpeech recognition module for Python, supporting several engines and APIs, online and offline.
$ npx skills add Uberi/speech_recognitionA talking LLM that runs on your own computer without needing the internet.
$ npx skills add vndee/local-talking-llmVoice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式
$ npx skills add jianchang512/stt1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
$ npx skills add RVC-Boss/GPT-SoVITSVITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
$ npx skills add jaywalnut310/vitsA voice chat app
$ npx skills add modal-labs/quillman🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
$ npx skills add babysor/MockingBirdAI Vtuber for Streaming on Youtube/Twitch
$ npx skills add ardha27/AI-Waifu-VtuberWhisper command line client compatible with original OpenAI client based on CTranslate2.
$ npx skills add Softcatala/whisper-ctranslate2A Flow Matching-based Text-to-Speech Model with Emoji-driven Style Control
$ npx skills add Aratako/Irodori-TTSDistilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
$ npx skills add huggingface/distil-whisperInstant voice cloning by MIT and MyShell. Audio foundation model.
$ npx skills add myshell-ai/OpenVoiceConverts text to speech in realtime
$ npx skills add KoljaB/RealtimeTTSA text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
$ npx skills add Blaizzy/mlx-audioHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Irene Voice Assistant if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.