A PyTorch-based Speech Toolkit
$ npx skills add speechbrain/speechbrainAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
Tools for handling multimodal data in machine learning projects.
A PyTorch-based Speech Toolkit
$ npx skills add speechbrain/speechbrainSpeech recognition module for Python, supporting several engines and APIs, online and offline.
$ npx skills add Uberi/speech_recognitionEnd-to-End Speech Processing Toolkit
$ npx skills add espnet/espnetHigh-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体!
$ npx skills add opendilab/CleanS2SA voice chat app
$ npx skills add modal-labs/quillman🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
$ npx skills add babysor/MockingBirdOpenVINO™ is an open source toolkit for optimizing and deploying AI inference
$ npx skills add openvinotoolkit/openvinoWhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
$ npx skills add m-bain/whisperXFaster Whisper transcription with CTranslate2
$ npx skills add SYSTRAN/faster-whisperDistilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
$ npx skills add huggingface/distil-whisperWebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
$ npx skills add alphacep/vosk-serverA Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
$ npx skills add nl8590687/ASRT_SpeechRecognitionEnd-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
$ npx skills add zzw922cn/Automatic_Speech_RecognitionA free audio dataset of spoken digits. An audio version of MNIST.
$ npx skills add Jakobovski/free-spoken-digit-datasetVoice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式
$ npx skills add jianchang512/sttpytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
$ npx skills add mravanelli/pytorch-kaldiHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Lhotse if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.