Cross-platform, customizable ML solutions for live and streaming media.
$ npx skills add google-ai-edge/mediapipeAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
[EMNLP2024 Demo], [ICASSP 2025], [ICASSP 2026] A user-friendly library for reproducible video moment retrieval and highlight detection. It also supports audio moment retrieval.
Cross-platform, customizable ML solutions for live and streaming media.
$ npx skills add google-ai-edge/mediapipe🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
$ npx skills add huggingface/transformersWe write your reusable computer vision tools. 💜
$ npx skills add roboflow/supervision🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools
$ npx skills add huggingface/datasetsUltralytics YOLO 🚀
$ npx skills add ultralytics/ultralyticsDatasets, Transforms and Models specific to Computer Vision
$ npx skills add pytorch/visionNLTK Source
$ npx skills add nltk/nltkAdvanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
$ npx skills add jacobgil/pytorch-grad-camLow-code framework for building custom LLMs, neural networks, and other AI models
$ npx skills add ludwig-ai/ludwigA PyTorch-based Speech Toolkit
$ npx skills add speechbrain/speechbrainFast and Accurate ML in 3 Lines of Code
$ npx skills add autogluon/autogluon🐍 Geometric Computer Vision Library for Spatial AI
$ npx skills add kornia/korniaRefine high-quality datasets and visual AI models
$ npx skills add voxel51/fiftyoneNode-based Visual Programming Toolbox
$ npx skills add alicevision/MeshroomRF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO, designed for fine-tuning. [ICLR 2026]
$ npx skills add roboflow/rf-detrA text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
$ npx skills add Blaizzy/mlx-audioHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Lighthouse if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.