OpenAgentSkillRegistry

Star0 Submit Skill

OpenAgentSkill guide

Best multimodal media skills for AI agents

Browse skills for image, video, audio, transcription, metadata extraction, and multimodal content workflows.

Open use-case page Browse excellent matches

When to use this guide

Start from the job, then shortlist the tools.

Transcribe audio

Use quality and freshness signals to decide whether a skill belongs in this workflow.

Extract video metadata

Use quality and freshness signals to decide whether a skill belongs in this workflow.

Summarize images

Use quality and freshness signals to decide whether a skill belongs in this workflow.

Prepare media for search

Use quality and freshness signals to decide whether a skill belongs in this workflow.

Shortlist

Top skills to evaluate

#1TransformersExcellent · 100162K stars

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Best fit: High-confidence pick with strong adoption and healthy maintenance signals.

#2DiffusersExcellent · 10034K stars

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Best fit: High-confidence pick with strong adoption and healthy maintenance signals.

#3Cs Video CoursesExcellent · 10082K stars

List of Computer Science courses with video lectures.

Best fit: High-confidence pick with strong adoption and healthy maintenance signals.

#4UltralyticsExcellent · 10058K stars

Ultralytics YOLO 🚀

Best fit: High-confidence pick with strong adoption and healthy maintenance signals.

#5SupervisionExcellent · 10044K stars

We write your reusable computer vision tools. 💜

Best fit: High-confidence pick with strong adoption and healthy maintenance signals.

#6MediapipeExcellent · 10036K stars

Cross-platform, customizable ML solutions for live and streaming media.

Best fit: High-confidence pick with strong adoption and healthy maintenance signals.

#7Pixelle VideoExcellent · 10023K stars

🚀 AI 全自动短视频引擎 | AI Fully Automated Short Video Engine

Best fit: High-confidence pick with strong adoption and healthy maintenance signals.

#8ScreenpipeExcellent · 10019K stars

YC (S26) | AI that knows what you've seen, said, or heard. Records everything you do, say, hear 24/7, local, private, secure

Best fit: High-confidence pick with strong adoption and healthy maintenance signals.

#9LivePortraitExcellent · 10019K stars

Bring portraits to life!

Best fit: High-confidence pick with strong adoption and healthy maintenance signals.

#10Waifu2x Extension GUIExcellent · 10017K stars

Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IFRNet, CAIN, DAIN, and ACNet.

Best fit: High-confidence pick with strong adoption and healthy maintenance signals.