High-intent entry points

Start from the task, not a keyword list.

These shortcuts use the same trust, supply, and relevance signals as the registry API, so humans and agents land on a useful shortlist faster.

Agent-readable index

FinanceStock analysis QuantTrading research SlidesPPT generation DocsPDF parsing ExtractWeb scraping CreativeDesign workflow SportsFootball analytics

Decision filters

Choose by scenario, quality, and trust signals.

Use casePlatform fitQuality tierTrust profileSafety gateGitHub adoption

Showing 1-16 of 703 ranked candidates matching "llm-eval"

Best blend of quality, stars, freshness, and agent usage

Awesome LLM Eval

PROMISING · 67TRUST · 78SAFE · REVIEWEDRESEARCH

Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表…

$ npx skills add onejune2018/Awesome-LLM-Eval

642 stars42 quality78 trustReviewed with permission notesClaude Code + OpenAI Agents8mo since pushNeeds review

QualityUseful candidate, but compare it with alternatives before adopting.

TrustGood trust signals with a few areas worth checking before rollout.Review: Quality score needs review

Safety gateUsable candidate, but the agent should surface permission and audit notes before installation.

Scenario RAG and knowledge · I need my agent to build a RAG workflow over documents and retrieve reliable context.

Claude Code + OpenAI Agents · 4 targets

rag

by onejune2018DetailsQuick view

Awesome Llm Apps

VERIFIEDEXCELLENT · 100TRUST · 93SAFE · VERIFIEDCODING

100+ AI Agent & RAG apps you can actually run — clone, customize, ship.

$ npx skills add Shubhamsaboo/awesome-llm-apps

114.5K stars78 quality93 trustVerifiedClaude Code24d since pushSafe to try

QualityHigh-confidence pick with strong adoption and healthy maintenance signals.

TrustStrong OpenAgentSkill Trust Score across adoption, recent maintenance, license clarity, documentation, dependency/…

Safety gateStrong metadata, audit, install, and review signals. Suitable for agent shortlists after normal workspace review.

Scenario Coding agents · I need a coding agent that can understand a repository, edit code, and review pull requests.

Claude Code + CLI · 4 targets

pythonrag

by ShubhamsabooDetailsQuick view

Anything Llm

VERIFIEDEXCELLENT · 100TRUST · 92SAFE · VERIFIEDRESEARCH

Stop renting your intelligence. Own it with AnythingLLM. Everything you need for a powerful local-first agent experience

$ npx skills add Mintplex-Labs/anything-llm

62.5K stars77 quality92 trustVerifiedClaude Code4d since pushSafe to try

QualityHigh-confidence pick with strong adoption and healthy maintenance signals.

TrustStrong OpenAgentSkill Trust Score across adoption, recent maintenance, license clarity, documentation, dependency/…

Safety gateStrong metadata, audit, install, and review signals. Suitable for agent shortlists after normal workspace review.

Scenario RAG and knowledge · I need my agent to build a RAG workflow over documents and retrieve reliable context.

Claude Code + CLI · 4 targets

javascriptvector-search

by Mintplex-LabsDetailsQuick view

Llm App

VERIFIEDEXCELLENT · 100TRUST · 93SAFE · VERIFIEDDATA

Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, Pos…

$ npx skills add pathwaycom/llm-app

59.3K stars77 quality93 trustVerifiedClaude Code27d since pushSafe to try

QualityHigh-confidence pick with strong adoption and healthy maintenance signals.

TrustStrong OpenAgentSkill Trust Score across adoption, recent maintenance, license clarity, documentation, dependency/…

Safety gateStrong metadata, audit, install, and review signals. Suitable for agent shortlists after normal workspace review.

Scenario Database and SQL · I need my agent to inspect database schemas, write SQL, and explain query results.

Claude Code + CLI · 4 targets

jupyter-notebookrag

by pathwaycomDetailsQuick view

Happy Llm

VERIFIEDEXCELLENT · 100TRUST · 87SAFE · REVIEWEDDATA

📚 从零开始构建大模型

$ npx skills add datawhalechina/happy-llm

31.9K stars72 quality87 trustReviewedClaude Code2mo since pushSafe to try

QualityHigh-confidence pick with strong adoption and healthy maintenance signals.

TrustStrong OpenAgentSkill Trust Score across adoption, recent maintenance, license clarity, documentation, dependency/…Review: License is unclear

Safety gateGood audit and safety signals with no high-risk permission hints in public metadata.

Scenario RAG and knowledge · I need my agent to build a RAG workflow over documents and retrieve reliable context.

Claude Code + CLI · 4 targets

jupyter-notebookrag

by datawhalechinaDetailsQuick view

Langfuse

VERIFIEDEXCELLENT · 100TRUST · 90SAFE · REVIEWEDCODING

🪢 Open source AI engineering platform: LLM evals, observability, metrics, prompt management, playground, datasets. Integrates with OpenTelemetry, LangChain, OpenAI SDK,…

$ npx skills add langfuse/langfuse

29.4K stars74 quality90 trustReviewedClaude Code + OpenAI Agents17d since pushSafe to try

QualityHigh-confidence pick with strong adoption and healthy maintenance signals.

TrustStrong OpenAgentSkill Trust Score across adoption, recent maintenance, license clarity, documentation, dependency/…Review: License is unclear

Safety gateGood audit and safety signals with no high-risk permission hints in public metadata.

Scenario Coding agents · I need a coding agent that can understand a repository, edit code, and review pull requests.

Claude Code + OpenAI Agents · 4 targets

typescriptllmops

by langfuseDetailsQuick view

Mlflow

VERIFIEDEXCELLENT · 100TRUST · 94SAFE · VERIFIEDCODING

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality A…

$ npx skills add mlflow/mlflow

26.7K stars74 quality94 trustVerifiedClaude Code + LangChain12d since pushSafe to try

QualityHigh-confidence pick with strong adoption and healthy maintenance signals.

TrustStrong OpenAgentSkill Trust Score across adoption, recent maintenance, license clarity, documentation, dependency/…

Safety gateStrong metadata, audit, install, and review signals. Suitable for agent shortlists after normal workspace review.

Scenario Coding agents · I need a coding agent that can understand a repository, edit code, and review pull requests.

Claude Code + LangChain · 4 targets

pythonllmops

by mlflowDetailsQuick view

Opik

VERIFIEDEXCELLENT · 100TRUST · 93SAFE · VERIFIEDCODING

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

$ npx skills add comet-ml/opik

19.8K stars73 quality93 trustVerifiedClaude Code + LangChain12d since pushSafe to try

QualityHigh-confidence pick with strong adoption and healthy maintenance signals.

TrustStrong OpenAgentSkill Trust Score across adoption, recent maintenance, license clarity, documentation, dependency/…

Safety gateStrong metadata, audit, install, and review signals. Suitable for agent shortlists after normal workspace review.

Scenario Coding agents · I need a coding agent that can understand a repository, edit code, and review pull requests.

Claude Code + LangChain · 4 targets

pythonllmops

by comet-mlDetailsQuick view

OpenLLM

VERIFIEDEXCELLENT · 100TRUST · 91SAFE · VERIFIEDCODING

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

$ npx skills add bentoml/OpenLLM

12.4K stars72 quality91 trustVerifiedClaude Code + OpenAI Agents15d since pushSafe to try

QualityHigh-confidence pick with strong adoption and healthy maintenance signals.

TrustStrong OpenAgentSkill Trust Score across adoption, recent maintenance, license clarity, documentation, dependency/…

Safety gateStrong metadata, audit, install, and review signals. Suitable for agent shortlists after normal workspace review.

Scenario Coding agents · I need a coding agent that can understand a repository, edit code, and review pull requests.

Claude Code + OpenAI Agents · 4 targets

pythonllmops

by bentomlDetailsQuick view

Promptfoo

VERIFIEDEXCELLENT · 100TRUST · 92SAFE · REVIEWEDCODING

Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, DeepSeek, and more. Simple declara…

$ npx skills add promptfoo/promptfoo

22.2K stars74 quality92 trustReviewed with permission notesClaude Code + OpenAI Agents22d since pushSafe to try

QualityHigh-confidence pick with strong adoption and healthy maintenance signals.

TrustStrong OpenAgentSkill Trust Score across adoption, recent maintenance, license clarity, documentation, dependency/…

Safety gateUsable candidate, but the agent should surface permission and audit notes before installation.

Scenario Coding agents · I need a coding agent that can understand a repository, edit code, and review pull requests.

Claude Code + OpenAI Agents · 4 targets

typescriptrag

by promptfooDetailsQuick view

LLMs From Scratch

VERIFIEDEXCELLENT · 100TRUST · 87SAFE · REVIEWEDCODING

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

$ npx skills add rasbt/LLMs-from-scratch

97.3K stars78 quality87 trustReviewedClaude Code + OpenAI Agents1mo since pushSafe to try

QualityHigh-confidence pick with strong adoption and healthy maintenance signals.

TrustStrong OpenAgentSkill Trust Score across adoption, recent maintenance, license clarity, documentation, dependency/…Review: License is unclear

Safety gateGood audit and safety signals with no high-risk permission hints in public metadata.

Scenario GitHub automation · I need my agent to triage GitHub issues, review pull requests, and summarize repository changes.

Claude Code + OpenAI Agents · 4 targets

jupyter-notebookmachine-learning

by rasbtDetailsQuick view

Ragflow

VERIFIEDEXCELLENT · 100TRUST · 94SAFE · VERIFIEDCODING

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for L…

$ npx skills add infiniflow/ragflow

82.6K stars78 quality94 trustVerifiedClaude Code24d since pushSafe to try

QualityHigh-confidence pick with strong adoption and healthy maintenance signals.

TrustStrong OpenAgentSkill Trust Score across adoption, recent maintenance, license clarity, documentation, dependency/…

Safety gateStrong metadata, audit, install, and review signals. Suitable for agent shortlists after normal workspace review.

Scenario Coding agents · I need a coding agent that can understand a repository, edit code, and review pull requests.

Claude Code + CLI · 4 targets

pythonai-agents

by infiniflowDetailsQuick view

MiroFish

VERIFIEDEXCELLENT · 100TRUST · 92SAFE · VERIFIEDRESEARCH

A Simple and Universal Swarm Intelligence Engine, Predicting Anything. 简洁通用的群体智能引擎，预测万物

$ npx skills add 666ghj/MiroFish

66.8K stars77 quality92 trustVerifiedClaude Code1mo since pushSafe to try

QualityHigh-confidence pick with strong adoption and healthy maintenance signals.

TrustStrong OpenAgentSkill Trust Score across adoption, recent maintenance, license clarity, documentation, dependency/…

Safety gateStrong metadata, audit, install, and review signals. Suitable for agent shortlists after normal workspace review.

Scenario RAG and knowledge · I need my agent to build a RAG workflow over documents and retrieve reliable context.

Claude Code + CLI · 4 targets

pythonknowledge-graph

by 666ghjDetailsQuick view

Hello Agents

VERIFIEDEXCELLENT · 100TRUST · 86SAFE · REVIEWEDCODING

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

$ npx skills add datawhalechina/hello-agents

59.5K stars77 quality86 trustReviewed with permission notesClaude Code26d since pushNeeds review

QualityHigh-confidence pick with strong adoption and healthy maintenance signals.

TrustStrong OpenAgentSkill Trust Score across adoption, recent maintenance, license clarity, documentation, dependency/…Review: License is unclear

Safety gateUsable candidate, but the agent should surface permission and audit notes before installation.

Scenario Coding agents · I need a coding agent that can understand a repository, edit code, and review pull requests.

Claude Code + CLI · 4 targets

pythonrag

by datawhalechinaDetailsQuick view

Llama Index

VERIFIEDEXCELLENT · 100TRUST · 92SAFE · VERIFIEDDATA

LlamaIndex is the leading document agent and OCR platform

$ npx skills add run-llama/llama_index

50.2K stars76 quality92 trustVerifiedClaude Code + LlamaIndex20d since pushSafe to try

QualityHigh-confidence pick with strong adoption and healthy maintenance signals.

TrustStrong OpenAgentSkill Trust Score across adoption, recent maintenance, license clarity, documentation, dependency/…

Safety gateStrong metadata, audit, install, and review signals. Suitable for agent shortlists after normal workspace review.

Scenario RAG and knowledge · I need my agent to build a RAG workflow over documents and retrieve reliable context.

Claude Code + LlamaIndex · 4 targets

pythonrag

by run-llamaDetailsQuick view

LightRAG

VERIFIEDEXCELLENT · 100TRUST · 92SAFE · VERIFIEDRESEARCH

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

$ npx skills add HKUDS/LightRAG

36.8K stars75 quality92 trustVerifiedClaude Code + OpenAI Agents19d since pushSafe to try

QualityHigh-confidence pick with strong adoption and healthy maintenance signals.

TrustStrong OpenAgentSkill Trust Score across adoption, recent maintenance, license clarity, documentation, dependency/…

Safety gateStrong metadata, audit, install, and review signals. Suitable for agent shortlists after normal workspace review.

Scenario RAG and knowledge · I need my agent to build a RAG workflow over documents and retrieve reliable context.

Claude Code + OpenAI Agents · 4 targets

pythonknowledge-graph

by HKUDSDetailsQuick view

Page 1

Showing the strongest 16 results to keep the registry fast for humans and agents. Refine by use case, platform, stars, or search query for a narrower shortlist.

Try the agent resolve API

AI Agent Skills Directory

Real skills, grouped by the work your agent needs to finish.

Build the registry by domain, not just by count.

Coding and developer agents

Research and knowledge work

Presentation and deck workflows

Finance and quant workflows

Marketing and growth automation

Design and creative production

Data, BI, and analytics

Legal, policy, and compliance

Education and tutoring

Football and World Cup analytics

Start from the task, not a keyword list.

Choose by scenario, quality, and trust signals.

Awesome LLM Eval

Awesome Llm Apps

Anything Llm

Llm App

Happy Llm

Langfuse

Mlflow

Opik

OpenLLM

Promptfoo

LLMs From Scratch

Ragflow

MiroFish

Hello Agents

Llama Index

LightRAG