Alternatives

MASR alternatives for AI agents.

Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.

Current skill

MASR

Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。

68
Quality
83
Trust
723
Stars
#1

PPASR

Similarity 143Trust 83Promising 69

基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型

873 starsDec 17, 2025 pushmedia-automationPythonSpeech
$ npx skills add yeyupiaoling/PPASR
#2

PaddlePaddle DeepSpeech

Similarity 135Trust 82Promising 68

基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。

762 starsDec 17, 2025 pushmedia-automationPythonSpeech
$ npx skills add yeyupiaoling/PaddlePaddle-DeepSpeech
#3

Cheetah

Similarity 131Trust 86Excellent 85

On-device streaming speech-to-text engine powered by deep learning

665 starsJun 10, 2026 pushmedia-automationPythonSpeech
$ npx skills add Picovoice/cheetah
#4

Speech To Text Benchmark

Similarity 129Trust 84Strong 80

speech to text benchmark framework

693 starsMar 19, 2026 pushmedia-automationPythonSpeech
$ npx skills add Picovoice/speech-to-text-benchmark
#5

Willow Inference Server

Similarity 128Trust 85Strong 74

Open source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and WS

504 starsFeb 12, 2026 pushmedia-automationPythonSpeech
$ npx skills add toverainc/willow-inference-server
#6

Sherpa

Similarity 126Trust 86Excellent 87

Speech-to-text server framework with next-gen Kaldi

940 starsJun 16, 2026 pushmedia-automationC++Speech
$ npx skills add k2-fsa/sherpa
#7

WhisperS2T

Similarity 125Trust 80Needs review 54

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

574 starsAug 27, 2024 pushmedia-automationJupyter NotebookSpeech
$ npx skills add shashikg/WhisperS2T
#8

TheWhisper

Similarity 124Trust 86Excellent 87

Optimized Whisper models for streaming and on-device use

888 starsJun 15, 2026 pushmedia-automationPythonSpeech
$ npx skills add TheStageAI/TheWhisper
#9

Allosaurus

Similarity 123Trust 77Promising 55

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

731 starsApr 26, 2024 pushmedia-automationPythonSpeech
$ npx skills add xinjli/allosaurus
#10

Openspeech

Similarity 123Trust 77Promising 55

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

717 starsOct 23, 2023 pushmedia-automationPythonSpeech
$ npx skills add openspeech-team/openspeech
#11

SpecAugment

Similarity 123Trust 77Needs review 54

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

654 starsApr 5, 2022 pushmedia-automationPythonSpeech
$ npx skills add DemisEom/SpecAugment
#12

Kospeech

Similarity 123Trust 77Needs review 54

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

637 starsMay 27, 2023 pushmedia-automationPythonSpeech
$ npx skills add sooftware/kospeech
#13

FireRedASR2S

Similarity 122Trust 87Excellent 85

A SOTA Industrial-Grade All-in-One ASR system with ASR, VAD, LID, and Punc modules. FireRedASR2 supports Chinese (Mandarin, 20+ dialects/accents), English, code-switching, and both speech and singing ASR. FireRedVAD supports speech/singing/music in 100+ langs. FireRedLID supports 100+ langs and 20+ zh dialects. FireRedPunc supports zh and en.

551 starsJun 2, 2026 pushmedia-automationPythonSpeech
$ npx skills add FireRedTeam/FireRedASR2S
#14

Espresso

Similarity 122Trust 74Needs review 51

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

939 starsSep 4, 2024 pushmedia-automationPythonSpeech
$ npx skills add freewym/espresso
#15

Cn2an

Similarity 122Trust 84Strong 80

📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)

762 starsApr 23, 2026 pushmedia-automationPythonSpeech
$ npx skills add Ailln/cn2an
#16

Leaderboard

Similarity 121Trust 76Needs review 49

SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.

545 starsMar 29, 2025 pushmedia-automationPythonSpeech
$ npx skills add SpeechColab/Leaderboard

How to choose

When should you switch?

Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep MASR if it already passes your workflow test and repository review.

Next step

Compare top candidates side by side

Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.