Alternatives

Speech Swift alternatives for AI agents.

Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.

Current skill

Speech Swift

AI speech toolkit for Apple Silicon — ASR, TTS, speech-to-speech, VAD, and diarization powered by MLX and CoreML

87
Quality
88
Trust
894
Stars
#1

TheWhisper

Similarity 134Trust 86Excellent 87

Optimized Whisper models for streaming and on-device use

888 starsJun 15, 2026 pushmedia-automationPythonSpeech
$ npx skills add TheStageAI/TheWhisper
#2

SwiftWhisper

Similarity 123Trust 75Promising 55

🎤 The easiest way to transcribe audio in Swift

781 starsMay 23, 2024 pushmedia-automationSwiftSpeech
$ npx skills add exPHAT/SwiftWhisper
#3

Sherpa

Similarity 118Trust 86Excellent 87

Speech-to-text server framework with next-gen Kaldi

940 starsJun 16, 2026 pushmedia-automationC++Speech
$ npx skills add k2-fsa/sherpa
#4

NotelyVoice

Similarity 117Trust 88Excellent 86

A 100% private AI voice transcription app that converts speech to text in 100+ languages. Built with Compose Multiplatform for Android & iOS using Whisper AI - no cloud uploads, all processing happens on-device for complete privacy.

725 starsMay 19, 2026 pushmedia-automationC++Speech
$ npx skills add Notely-Voice/NotelyVoice
#5

Cheetah

Similarity 117Trust 86Excellent 85

On-device streaming speech-to-text engine powered by deep learning

665 starsJun 10, 2026 pushmedia-automationPythonSpeech
$ npx skills add Picovoice/cheetah
#6

Hear

Similarity 117Trust 86Excellent 85

Command line interface for the built-in speech recognition and transcription capabilities in macOS.

661 starsMay 19, 2026 pushmedia-automationObjective-CSpeech
$ npx skills add sveinbjornt/hear
#7

FireRedASR2S

Similarity 116Trust 87Excellent 85

A SOTA Industrial-Grade All-in-One ASR system with ASR, VAD, LID, and Punc modules. FireRedASR2 supports Chinese (Mandarin, 20+ dialects/accents), English, code-switching, and both speech and singing ASR. FireRedVAD supports speech/singing/music in 100+ langs. FireRedLID supports 100+ langs and 20+ zh dialects. FireRedPunc supports zh and en.

551 starsJun 2, 2026 pushmedia-automationPythonSpeech
$ npx skills add FireRedTeam/FireRedASR2S
#8

Cn2an

Similarity 116Trust 84Strong 80

📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)

762 starsApr 23, 2026 pushmedia-automationPythonSpeech
$ npx skills add Ailln/cn2an
#9

Qwen3 Tts Apple Silicon

Similarity 115Trust 82Strong 73

Run Qwen3-TTS text-to-speech locally on Mac (M1/M2/M3/M4). Voice cloning, voice design, custom voices. 100% offline using MLX.

516 starsMay 2, 2026 pushmedia-automationPythonVoice
$ npx skills add kapi2800/qwen3-tts-apple-silicon
#10

Whisper Android

Similarity 114Trust 82Strong 76

Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android

669 starsMar 18, 2026 pushmedia-automationC++Speech
$ npx skills add vilassn/whisper_android
#11

SwiftSpeech

Similarity 114Trust 75Needs review 53

A speech recognition framework designed for SwiftUI.

530 starsJul 27, 2021 pushmedia-automationSwiftSpeech
$ npx skills add Cay-Zhang/SwiftSpeech
#12

PPASR

Similarity 113Trust 83Promising 69

基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型

873 starsDec 17, 2025 pushmedia-automationPythonSpeech
$ npx skills add yeyupiaoling/PPASR
#13

Voice Overlay Ios

Similarity 113Trust 86Strong 79

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI

557 starsApr 8, 2026 pushmedia-automationSwiftSpeech
$ npx skills add algolia/voice-overlay-ios
#14

MASR

Similarity 113Trust 83Promising 68

Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。

723 starsDec 17, 2025 pushmedia-automationPythonSpeech
$ npx skills add yeyupiaoling/MASR
#15

PaddlePaddle DeepSpeech

Similarity 113Trust 82Promising 68

基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。

762 starsDec 17, 2025 pushmedia-automationPythonSpeech
$ npx skills add yeyupiaoling/PaddlePaddle-DeepSpeech
#16

Vosk Browser

Similarity 112Trust 81Promising 66

A speech recognition library running in the browser thanks to a WebAssembly build of Vosk

519 starsDec 7, 2025 pushmedia-automationJavaScriptSpeech
$ npx skills add ccoreilly/vosk-browser

How to choose

When should you switch?

Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Speech Swift if it already passes your workflow test and repository review.

Next step

Compare top candidates side by side

Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.