AI speech toolkit for Apple Silicon — ASR, TTS, speech-to-speech, VAD, and diarization powered by MLX and CoreML
$ npx skills add soniqo/speech-swiftAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
On-device Speech AI for Apple Silicon
AI speech toolkit for Apple Silicon — ASR, TTS, speech-to-speech, VAD, and diarization powered by MLX and CoreML
$ npx skills add soniqo/speech-swiftPort of OpenAI's Whisper model in C/C++
$ npx skills add ggml-org/whisper.cppOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
$ npx skills add alphacep/vosk-apiFaster Whisper transcription with CTranslate2
$ npx skills add SYSTRAN/faster-whisperAutomatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
$ npx skills add MahmoudAshraf97/whisper-diarizationThe open-source iOS app that's making quality voice transcription more accessible on mobile devices.
$ npx skills add Saik0s/Whisperboard🎤 The easiest way to transcribe audio in Swift
$ npx skills add exPHAT/SwiftWhisperWhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
$ npx skills add m-bain/whisperXWhisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
$ npx skills add Purfview/whisper-standalone-winEasy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
$ npx skills add PaddlePaddle/PaddleSpeechA PyTorch-based Speech Toolkit
$ npx skills add speechbrain/speechbrainLightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.
$ npx skills add supertone-inc/supertonicOpenVINO™ is an open source toolkit for optimizing and deploying AI inference
$ npx skills add openvinotoolkit/openvinoSpeech recognition module for Python, supporting several engines and APIs, online and offline.
$ npx skills add Uberi/speech_recognitionEnd-to-End Speech Processing Toolkit
$ npx skills add espnet/espnetA text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
$ npx skills add Blaizzy/mlx-audioHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Argmax Oss Swift if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.