Alternatives

Mlx Audio Swift alternatives for AI agents.

Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.

Current skill

Mlx Audio Swift

A modular Swift SDK for audio processing with MLX on Apple Silicon

86
Quality
86
Trust
675
Stars
#1

Qwen3 Tts Apple Silicon

Similarity 113Trust 82Strong 73

Run Qwen3-TTS text-to-speech locally on Mac (M1/M2/M3/M4). Voice cloning, voice design, custom voices. 100% offline using MLX.

516 starsMay 2, 2026 pushmedia-automationPythonVoice
$ npx skills add kapi2800/qwen3-tts-apple-silicon
#2

TheWhisper

Similarity 112Trust 86Excellent 87

Optimized Whisper models for streaming and on-device use

888 starsJun 15, 2026 pushmedia-automationPythonSpeech
$ npx skills add TheStageAI/TheWhisper
#3

Cheetah

Similarity 111Trust 86Excellent 85

On-device streaming speech-to-text engine powered by deep learning

665 starsJun 10, 2026 pushmedia-automationPythonSpeech
$ npx skills add Picovoice/cheetah
#4

Speech Swift

Similarity 110Trust 88Excellent 87

AI speech toolkit for Apple Silicon — ASR, TTS, speech-to-speech, VAD, and diarization powered by MLX and CoreML

894 starsJun 14, 2026 pushmedia-automationSwiftSpeech
$ npx skills add soniqo/speech-swift
#5

GPA

Similarity 110Trust 88Excellent 87

[AutoArk] GPA (General Purpose Audio) can do ASR, TTS and voice conversion with one tiny model!

866 starsMay 25, 2026 pushmedia-automationPythonVoice
$ npx skills add AutoArk/GPA
#6

Vonage Php Sdk Core

Similarity 110Trust 86Excellent 87

Vonage REST API client for PHP. API support for SMS, Voice, Text-to-Speech, Numbers, Verify (2FA) and more.

931 starsMay 28, 2026 pushmedia-automationPHPVoice
$ npx skills add Vonage/vonage-php-sdk-core
#7

EDDiscovery

Similarity 110Trust 86Excellent 87

Captains log and 3d star map for Elite Dangerous

886 starsJun 15, 2026 pushmedia-automationC#Voice
$ npx skills add EDDiscovery/EDDiscovery
#8

Alexandria Audiobook

Similarity 109Trust 88Excellent 86

AI-powered multi-voice audiobook generator — LLM script annotation, voice cloning, voice design, LoRA training, per-line style control, and export to MP3, chaptered M4B, or Audacity multi-track. Built on Qwen3-TTS.

682 starsJun 4, 2026 pushmedia-automationPythonVoice
$ npx skills add Finrandojin/alexandria-audiobook
#9

Cboard

Similarity 109Trust 86Excellent 86

Augmentative and Alternative Communication (AAC) system with text-to-speech for the browser

738 starsJun 12, 2026 pushmedia-automationJavaScriptVoice
$ npx skills add cboard-org/cboard
#10

Pandrator

Similarity 109Trust 89Excellent 85

Turn PDFs and EPUBs into audiobooks; subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, including voice-cloning (instant, RVC-enhanced, XTTS fine-tuning) and LLM processing. It aspires to be a user-friendly app with a GUI, an installer and all-in-one packages.

572 starsJun 14, 2026 pushmedia-automationPythonVoice
$ npx skills add lukaszliniewicz/Pandrator
#11

F5 Tts Mlx

Similarity 109Trust 77Needs review 54

Implementation of F5-TTS in MLX

635 starsMar 19, 2025 pushmedia-automationPythonVoice
$ npx skills add lucasnewman/f5-tts-mlx
#12

Dictionariez

Similarity 108Trust 85Excellent 85

📚 A customizable dictionary extension that supports double-click lookups in 20+ languages, 1000+ dictionaries, text-to-speech, translation and Anki integration.

647 starsJun 10, 2026 pushmedia-automationJavaScriptVoice
$ npx skills add pnlpal/dictionariez
#13

Willow Inference Server

Similarity 108Trust 85Strong 74

Open source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and WS

504 starsFeb 12, 2026 pushmedia-automationPythonSpeech
$ npx skills add toverainc/willow-inference-server
#14

CloneTTS

Similarity 108Trust 84Strong 81

A lightweight, offline Android Text-to-Speech (TTS) engine enabling seamless system-wide voice cloning and high-fidelity text reading. / 运行在安卓本地的轻量级文字转语音 (TTS) 引擎,支持离线发音人提取、零门槛音色克隆与双擎系统级全局听书。

692 starsMay 18, 2026 pushmedia-automationVoiceClaude Code
$ npx skills add sipeter/CloneTTS
#15

Voice AI

Similarity 107Trust 84Strong 81

Rapida is an open-source, end-to-end voice AI orchestration platform for building real-time conversational voice agents with audio streaming, STT, TTS, VAD, multi-channel integration, agent state management, and observability.

666 starsJun 16, 2026 pushmedia-automationGoVoice
$ npx skills add rapidaai/voice-ai
#16

Vui

Similarity 107Trust 82Strong 81

Real-time voice assistant — WebRTC streaming, faster-whisper ASR, local LLM, Vui Nano (300M) TTS. OpenAI Realtime API compatible. Voice cloning, barge-in, ~9× realtime on a 4090. Apache 2.0.

701 starsJun 12, 2026 pushmedia-automationPythonVoice
$ npx skills add fluxions-ai/vui

How to choose

When should you switch?

Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Mlx Audio Swift if it already passes your workflow test and repository review.

Next step

Compare top candidates side by side

Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.