Official Repo For Pixel-LLM Codebase: Sa2VA (Arxiv-25), SAMTok (CVPR-26), VRT, SaSaSa2VA (1-st solution for LSVOS)
$ npx skills add bytedance/Sa2VAAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
Official Repo For Pixel-LLM Codebase: Sa2VA (Arxiv-25), SAMTok (CVPR-26), VRT, SaSaSa2VA (1-st solution for LSVOS)
$ npx skills add bytedance/Sa2VAA Python package for fast and robust Image Stitching
$ npx skills add OpenStitching/stitchingRay tracing and hybrid rasterization of Gaussian particles
$ npx skills add nv-tlabs/3dgrut记录cv算法工程师的成长之路,分享计算机视觉和模型压缩部署技术栈笔记。https://harleyszhang.github.io/cv_note/
$ npx skills add harleyszhang/cv_noteMaaNTE. Nevertheless to Everless automatic assistant 异环小助手
$ npx skills add 1bananachicken/MaaNTE[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
$ npx skills add hkchengrex/MMAudioA list of synthetic dataset and tools for computer vision
$ npx skills add unrealcv/synthetic-computer-visionTraining library for local feature detection and matching
$ npx skills add cvg/glue-factorySenseTime Research platform for single object tracking, implementing algorithms like SiamRPN and SiamMask.
$ npx skills add STVIR/pysot[CVPR2023] The official repo for OC-SORT: Observation-Centric SORT on video Multi-Object Tracking. OC-SORT is simple, online and robust to occlusion/non-linear motion.
$ npx skills add noahcao/OC_SORT[ICCV 2023] DDColor: Towards Photo-Realistic Image Colorization via Dual Decoders
$ npx skills add piddnad/DDColorOpenMMLab Foundational Library for Training Deep Learning Models
$ npx skills add open-mmlab/mmengineOne-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
$ npx skills add GaParmar/img2img-turbo[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
$ npx skills add KlingAIResearch/ReCamMaster[CVPR 2025] MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors
$ npx skills add rmurai0610/MASt3R-SLAM[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
$ npx skills add fundamentalvision/BEVFormerHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep MovieChat if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.