Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
$ npx skills add open-compass/VLMEvalKitAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
$ npx skills add open-compass/VLMEvalKitOpen Machine Learning course
$ npx skills add girafe-ai/ml-courseThis repository is a compilation of free resources for learning Data Science.
$ npx skills add sreeharierk/datascienceA python library for self-supervised learning on images.
$ npx skills add lightly-ai/lightlyLightweight, useful implementation of conformal prediction on real data.
$ npx skills add aangelopoulos/conformal-predictionAll-in-one training for vision models (YOLO, ViTs, RT-DETR, DINOv3): pretraining, fine-tuning, distillation.
$ npx skills add lightly-ai/lightly-trainA Unified Semi-Supervised Learning Codebase (NeurIPS'22)
$ npx skills add microsoft/Semi-supervised-learning📚 Jupyter notebook tutorials for OpenVINO™
$ npx skills add openvinotoolkit/openvino_notebooksModel explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.
$ npx skills add cdpierse/transformers-interpretOfficial Repository for "Mish: A Self Regularized Non-Monotonic Neural Activation Function" [BMVC 2020]
$ npx skills add digantamisra98/MishTracking Any Point (TAP)
$ npx skills add google-deepmind/tapnetSearch photos on Unsplash using natural language
$ npx skills add haltakov/natural-language-image-searchcomputer vision projects | 计算机视觉相关好玩的AI项目(Python、C++、embedded system)
$ npx skills add enpeizhao/CVprojectsObjectron is a dataset of short, object-centric video clips. In addition, the videos also contain AR session metadata including camera poses, sparse point-clouds and planes. In each video, the camera moves around and above the object and captures it from different views. Each object is annotated with a 3D bounding box. The 3D bounding box describes the object’s position, orientation, and dimensions. The dataset contains about 15K annotated video clips and 4M annotated images in the following categories: bikes, books, bottles, cameras, cereal boxes, chairs, cups, laptops, and shoes
$ npx skills add google-research-datasets/ObjectronTowhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
$ npx skills add towhee-io/towheeA collection of scientific methods, processes, algorithms, and systems to build stories & models.
$ npx skills add hemansnation/AI-Engineer-HeadquartersHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Vlms Zero To Hero if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.