Alternatives

PaddlePaddle DeepSpeech alternatives for AI agents.

Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.

Current skill

PaddlePaddle DeepSpeech

基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。

68
Quality
82
Trust
762
Stars
#1

PPASR

Similarity 143Trust 83Promising 69

基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型

873 starsDec 17, 2025 pushmedia-automationPythonSpeech
$ npx skills add yeyupiaoling/PPASR
#2

MASR

Similarity 135Trust 83Promising 68

Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。

723 starsDec 17, 2025 pushmedia-automationPythonSpeech
$ npx skills add yeyupiaoling/MASR
#3

Speech To Text Benchmark

Similarity 129Trust 84Strong 80

speech to text benchmark framework

693 starsMar 19, 2026 pushmedia-automationPythonSpeech
$ npx skills add Picovoice/speech-to-text-benchmark
#4

Chinese Text Normalization

Similarity 123Trust 75Promising 55

Chinese text normalization for speech processing

732 starsMar 18, 2023 pushmedia-automationPythonSpeech
$ npx skills add speechio/chinese_text_normalization
#5

Cheetah

Similarity 123Trust 86Excellent 85

On-device streaming speech-to-text engine powered by deep learning

665 starsJun 10, 2026 pushmedia-automationPythonSpeech
$ npx skills add Picovoice/cheetah
#6

FireRedASR2S

Similarity 122Trust 87Excellent 85

A SOTA Industrial-Grade All-in-One ASR system with ASR, VAD, LID, and Punc modules. FireRedASR2 supports Chinese (Mandarin, 20+ dialects/accents), English, code-switching, and both speech and singing ASR. FireRedVAD supports speech/singing/music in 100+ langs. FireRedLID supports 100+ langs and 20+ zh dialects. FireRedPunc supports zh and en.

551 starsJun 2, 2026 pushmedia-automationPythonSpeech
$ npx skills add FireRedTeam/FireRedASR2S
#7

Cn2an

Similarity 122Trust 84Strong 80

📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)

762 starsApr 23, 2026 pushmedia-automationPythonSpeech
$ npx skills add Ailln/cn2an
#8

Gpt Home

Similarity 120Trust 83Strong 75

ChatGPT at home! A better alternative to commercial smart home assistants, built on the Raspberry Pi using LiteLLM and LangGraph.

643 starsMar 17, 2026 pushmedia-automationPythonSpeech
$ npx skills add judahpaul16/gpt-home
#9

Willow Inference Server

Similarity 120Trust 85Strong 74

Open source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and WS

504 starsFeb 12, 2026 pushmedia-automationPythonSpeech
$ npx skills add toverainc/willow-inference-server
#10

Speech Swift

Similarity 118Trust 88Excellent 87

AI speech toolkit for Apple Silicon — ASR, TTS, speech-to-speech, VAD, and diarization powered by MLX and CoreML

894 starsJun 14, 2026 pushmedia-automationSwiftSpeech
$ npx skills add soniqo/speech-swift
#11

Sherpa

Similarity 118Trust 86Excellent 87

Speech-to-text server framework with next-gen Kaldi

940 starsJun 16, 2026 pushmedia-automationC++Speech
$ npx skills add k2-fsa/sherpa
#12

WhisperS2T

Similarity 117Trust 80Needs review 54

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

574 starsAug 27, 2024 pushmedia-automationJupyter NotebookSpeech
$ npx skills add shashikg/WhisperS2T
#13

TheWhisper

Similarity 116Trust 86Excellent 87

Optimized Whisper models for streaming and on-device use

888 starsJun 15, 2026 pushmedia-automationPythonSpeech
$ npx skills add TheStageAI/TheWhisper
#14

Openspeech

Similarity 115Trust 77Promising 55

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

717 starsOct 23, 2023 pushmedia-automationPythonSpeech
$ npx skills add openspeech-team/openspeech
#15

Kur

Similarity 115Trust 74Promising 55

Descriptive Deep Learning

822 starsFeb 5, 2024 pushmedia-automationPythonSpeech
$ npx skills add deepgram/kur
#16

Rhino

Similarity 115Trust 86Excellent 86

On-device Speech-to-Intent engine powered by deep learning

703 starsJun 16, 2026 pushmedia-automationPythonSpeech
$ npx skills add Picovoice/rhino

How to choose

When should you switch?

Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep PaddlePaddle DeepSpeech if it already passes your workflow test and repository review.

Next step

Compare top candidates side by side

Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.