A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
$ npx skills add Blaizzy/mlx-audioAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
$ npx skills add Blaizzy/mlx-audioGradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
$ npx skills add abus-aikorea/voice-proVoice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式
$ npx skills add jianchang512/sttCustom nodes that extend the capabilities of Comfyui
$ npx skills add AlekPet/ComfyUI_Custom_Nodes_AlekPetTranscribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
$ npx skills add pluja/whishperAI Vtuber for Streaming on Youtube/Twitch
$ npx skills add ardha27/AI-Waifu-VtuberPort of OpenAI's Whisper model in C/C++
$ npx skills add ggml-org/whisper.cppWhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
$ npx skills add m-bain/whisperXFaster Whisper transcription with CTranslate2
$ npx skills add SYSTRAN/faster-whisperSpeech recognition module for Python, supporting several engines and APIs, online and offline.
$ npx skills add Uberi/speech_recognition💬 Speech recognition for your site
$ npx skills add TalAter/annyangOn-device streaming speech-to-text engine powered by deep learning
$ npx skills add Picovoice/cheetah🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
$ npx skills add coqui-ai/STTAutomatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
$ npx skills add MahmoudAshraf97/whisper-diarizationControllable and fast Text-to-Speech for over 7000 languages!
$ npx skills add DigitalPhonetics/IMS-ToucanA voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
$ npx skills add sdkcarlos/artyom.jsHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Open Speech Corpora if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.