The hub for audio AI research: papers, open models, benchmarks & datasets across audio LLMs, speech recognition, TTS, music & audio generation.
$ npx skills add BinWang28/audio-ai-hubAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支持多语言,准确率高
The hub for audio AI research: papers, open models, benchmarks & datasets across audio LLMs, speech recognition, TTS, music & audio generation.
$ npx skills add BinWang28/audio-ai-hubA talking LLM that runs on your own computer without needing the internet.
$ npx skills add vndee/local-talking-llm中文语音识别
$ npx skills add xxbb1234021/speech_recognitionFoundational Model for Speech Recognition Tasks
$ npx skills add salute-developers/GigaAMHigh-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体!
$ npx skills add opendilab/CleanS2S🚀 AI 全自动短视频引擎 | AI Fully Automated Short Video Engine
$ npx skills add AIDC-AI/Pixelle-VideoBuild real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
$ npx skills add saharmor/whisper-playgroundA real-time silent speech recognition tool.
$ npx skills add amanvirparhar/chaplin[AutoArk] GPA (General Purpose Audio) can do ASR, TTS and voice conversion with one tiny model!
$ npx skills add AutoArk/GPASpeech-to-text server framework with next-gen Kaldi
$ npx skills add k2-fsa/sherpa:speech_balloon: SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/
$ npx skills add astorfi/speechpyOptimized Whisper models for streaming and on-device use
$ npx skills add TheStageAI/TheWhisperStephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
$ npx skills add SlapBot/stephanie-vaAllosaurus is a pretrained universal phone recognizer for more than 2000 languages
$ npx skills add xinjli/allosaurusAdapt Intent Parser
$ npx skills add MycroftAI/adaptOpen-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.
$ npx skills add openspeech-team/openspeechHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Parrots if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.