Alternatives

Speech Recognition alternatives for AI agents.

Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.

Current skill

Speech Recognition

中文语音识别

51
Quality
72
Trust
851
Stars
#1

SpecAugment

Similarity 131Trust 77Needs review 54

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

654 starsApr 5, 2022 pushmedia-automationPythonSpeech
$ npx skills add DemisEom/SpecAugment
#2

Local Talking Llm

Similarity 130Trust 84Strong 81

A talking LLM that runs on your own computer without needing the internet.

869 starsApr 4, 2026 pushmedia-automationPythonSpeech
$ npx skills add vndee/local-talking-llm
#3

GigaAM

Similarity 129Trust 84Strong 79

Foundational Model for Speech Recognition Tasks

619 starsApr 15, 2026 pushmedia-automationPythonSpeech
$ npx skills add salute-developers/GigaAM
#4

Whisper Playground

Similarity 127Trust 83Promising 68

Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/

833 starsSep 12, 2025 pushmedia-automationPythonSpeech
$ npx skills add saharmor/whisper-playground
#5

Parrots

Similarity 126Trust 83Promising 66

Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支持多语言,准确率高

525 starsNov 11, 2025 pushmedia-automationPythonSpeech
$ npx skills add shibing624/parrots
#6

Speechpy

Similarity 124Trust 78Promising 56

:speech_balloon: SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/

883 starsDec 15, 2024 pushmedia-automationPythonSpeech
$ npx skills add astorfi/speechpy
#7

Stephanie Va

Similarity 123Trust 77Promising 55

Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.

797 starsJan 20, 2019 pushmedia-automationPythonSpeech
$ npx skills add SlapBot/stephanie-va
#8

Allosaurus

Similarity 123Trust 77Promising 55

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

731 starsApr 26, 2024 pushmedia-automationPythonSpeech
$ npx skills add xinjli/allosaurus
#9

Adapt

Similarity 123Trust 77Promising 55

Adapt Intent Parser

720 starsJul 21, 2024 pushmedia-automationPythonSpeech
$ npx skills add MycroftAI/adapt
#10

Whisper Mic

Similarity 123Trust 76Promising 55

Project that allows one to use a microphone with OpenAI whisper.

789 starsJul 4, 2024 pushmedia-automationPythonSpeech
$ npx skills add mallorbc/whisper_mic
#11

Chinese Text Normalization

Similarity 123Trust 75Promising 55

Chinese text normalization for speech processing

732 starsMar 18, 2023 pushmedia-automationPythonSpeech
$ npx skills add speechio/chinese_text_normalization
#12

Espresso

Similarity 122Trust 74Needs review 51

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

939 starsSep 4, 2024 pushmedia-automationPythonSpeech
$ npx skills add freewym/espresso
#13

Audio AI Hub

Similarity 122Trust 84Strong 82

The hub for audio AI research: papers, open models, benchmarks & datasets across audio LLMs, speech recognition, TTS, music & audio generation.

933 starsJun 15, 2026 pushmedia-automationPythonSpeech
$ npx skills add BinWang28/audio-ai-hub
#14

Leaderboard

Similarity 121Trust 76Needs review 49

SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.

545 starsMar 29, 2025 pushmedia-automationPythonSpeech
$ npx skills add SpeechColab/Leaderboard
#15

CleanS2S

Similarity 121Trust 86Strong 78

High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体!

528 starsApr 7, 2026 pushmedia-automationPythonSpeech
$ npx skills add opendilab/CleanS2S
#16

PPASR

Similarity 119Trust 83Promising 69

基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型

873 starsDec 17, 2025 pushmedia-automationPythonSpeech
$ npx skills add yeyupiaoling/PPASR

How to choose

When should you switch?

Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Speech Recognition if it already passes your workflow test and repository review.

Next step

Compare top candidates side by side

Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.