Alternatives

Speech Recognition Uk alternatives for AI agents.

Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.

Current skill

Speech Recognition Uk

🇺🇦 Speech Recognition & Synthesis for Ukrainian

61
Quality
73
Trust
438
Stars
#1

MockingBird

Similarity 149Trust 87Excellent 100

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

37K starsMar 3, 2026 pushmedia-automationPythonVoice
$ npx skills add babysor/MockingBird
#2

Edge Tts

Similarity 146Trust 84Excellent 99

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

11K starsMar 22, 2026 pushmedia-automationPythonVoice
$ npx skills add rany2/edge-tts
#3

WhisperX

Similarity 143Trust 95Excellent 100

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

23K starsJun 3, 2026 pushmedia-automationPythonSpeech
$ npx skills add m-bain/whisperX
#4

IMS Toucan

Similarity 143Trust 88Excellent 95

Controllable and fast Text-to-Speech for over 7000 languages!

2.2K starsJan 25, 2026 pushmedia-automationPythonVoice
$ npx skills add DigitalPhonetics/IMS-Toucan
#5

GPT SoVITS

Similarity 142Trust 95Excellent 100

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

59K starsJun 20, 2026 pushmedia-automationPythonVoice
$ npx skills add RVC-Boss/GPT-SoVITS
#6

VoxCPM

Similarity 142Trust 95Excellent 100

VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning

31K starsJun 10, 2026 pushmedia-automationPythonVoice
$ npx skills add OpenBMB/VoxCPM
#7

Vits

Similarity 142Trust 85Strong 79

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

7.9K starsDec 6, 2023 pushmedia-automationPythonVoice
$ npx skills add jaywalnut310/vits
#8

Mlx Audio

Similarity 141Trust 94Excellent 100

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

7.4K starsJun 19, 2026 pushmedia-automationPythonSpeech
$ npx skills add Blaizzy/mlx-audio
#9

Pyvideotrans

Similarity 141Trust 93Excellent 100

Translate the video from one language to another and embed dubbing & subtitles.

18K starsJun 19, 2026 pushmedia-automationPythonVoice
$ npx skills add jianchang512/pyvideotrans
#10

Index Tts

Similarity 140Trust 90Excellent 100

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

21K starsJun 16, 2026 pushmedia-automationPythonVoice
$ npx skills add index-tts/index-tts
#11

Metavoice Src

Similarity 140Trust 83Strong 77

Foundational model for human-like, expressive TTS

4.2K starsJul 30, 2024 pushmedia-automationPythonVoice
$ npx skills add metavoiceio/metavoice-src
#12

Irodori TTS

Similarity 139Trust 84Excellent 87

A Flow Matching-based Text-to-Speech Model with Emoji-driven Style Control

965 starsJun 4, 2026 pushmedia-automationPythonVoice
$ npx skills add Aratako/Irodori-TTS
#13

Stt

Similarity 139Trust 88Excellent 98

Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式

4.6K starsJan 22, 2026 pushmedia-automationPythonSpeech
$ npx skills add jianchang512/stt
#14

MsEdgeTTS

Similarity 138Trust 82Strong 82

A simple Azure Speech Service module that uses the Microsoft Edge Read Aloud API. https://www.npmjs.com/package/msedge-tts

334 starsJun 18, 2026 pushmedia-automationTypeScriptVoice
$ npx skills add Migushthe2nd/MsEdgeTTS
#15

OpenVoice

Similarity 138Trust 86Excellent 87

Instant voice cloning by MIT and MyShell. Audio foundation model.

37K starsApr 19, 2025 pushmedia-automationPythonVoice
$ npx skills add myshell-ai/OpenVoice
#16

WaveRNN

Similarity 138Trust 81Strong 74

WaveRNN Vocoder + TTS

2.2K starsJul 2, 2022 pushmedia-automationPythonVoice
$ npx skills add fatchord/WaveRNN

How to choose

When should you switch?

Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Speech Recognition Uk if it already passes your workflow test and repository review.

Next step

Compare top candidates side by side

Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.