1
[KDD'2026] "VideoRAG: Chat with Your Videos"
$ npx skills add HKUDS/VideoRAG3.0K stars63 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonrag
by HKUDSQuick view
Decision filters
4 skills matching "multi-modal"
Best blend of quality, stars, freshness, and agent usage
[KDD'2026] "VideoRAG: Chat with Your Videos"
$ npx skills add HKUDS/VideoRAGMulti-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
$ npx skills add raphael-seo/Versatile-OCR-Program💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
$ npx skills add showlab/Awesome-GUI-AgentWindows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.
$ npx skills add microsoft/WindowsAgentArena