A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
$ npx skills add Blaizzy/mlx-audioAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
$ npx skills add Blaizzy/mlx-audioAI Vtuber for Streaming on Youtube/Twitch
$ npx skills add ardha27/AI-Waifu-VtuberWhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
$ npx skills add m-bain/whisperXFaster Whisper transcription with CTranslate2
$ npx skills add SYSTRAN/faster-whisperSpeech recognition module for Python, supporting several engines and APIs, online and offline.
$ npx skills add Uberi/speech_recognitionCustom nodes that extend the capabilities of Comfyui
$ npx skills add AlekPet/ComfyUI_Custom_Nodes_AlekPetVoice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式
$ npx skills add jianchang512/stt💬📝 A small dictation app using OpenAI's Whisper speech recognition model.
$ npx skills add savbell/whisper-writerPort of OpenAI's Whisper model in C/C++
$ npx skills add ggml-org/whisper.cppA voice chat app
$ npx skills add modal-labs/quillmanWhisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
$ npx skills add Purfview/whisper-standalone-winИрина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.
$ npx skills add janvarev/Irene-Voice-AssistantWhisper command line client compatible with original OpenAI client based on CTranslate2.
$ npx skills add Softcatala/whisper-ctranslate2OpenAI Whisper ASR Webservice API
$ npx skills add ahmetoner/whisper-asr-webservice💬 Speech recognition for your site
$ npx skills add TalAter/annyangEasy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
$ npx skills add PaddlePaddle/PaddleSpeechHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Voice Pro if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.