Skill comparison

Compare agent skills before installing.

Put high-signal skills side by side and inspect quality, adoption, freshness, install readiness, use-case fit, and warnings in one place.

Comparing 4 skills

Use this as a shortlist, then open the skill detail page before adopting.

Add more skills

Decision summary

Mlflow is the strongest overall pick here because it has a 100/100 readiness score and fits Coding agents.

Strongest overall

Mlflow

Use this as a leading candidate, then validate the README and install path in your own agent stack.

Fastest prototype

Mlflow

Best first install candidate based on install readiness and adoption.

Freshest repo

Phoenix

Most recent maintenance signal among this shortlist.

SignalPhoenix

AI Observability & Evaluation

Mlflow

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.

Lmnr

Laminar - open-source observability platform purpose-built for AI agents. YC S24.

Opik

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

Quality
100/100
Excellent
100/100
Excellent
100/100
Excellent
100/100
Excellent
Decision verdict
100/100
Production-ready

Use this as a leading candidate, then validate the README and install path in your own agent stack.

100/100
Production-ready

Use this as a leading candidate, then validate the README and install path in your own agent stack.

100/100
Production-ready

Use this as a leading candidate, then validate the README and install path in your own agent stack.

100/100
Production-ready

Use this as a leading candidate, then validate the README and install path in your own agent stack.

Adoption10K stars
0 installs
26K stars
0 installs
3.0K stars
0 installs
19K stars
0 installs
FreshnessJun 9, 2026Jun 5, 2026Jun 6, 2026Jun 5, 2026
Use-case fit
Stack fit
Platform hintsPython, LLMOps, Claude Code, LangChainPython, LLMOps, Claude Code, LangChainTypeScript, LLMOps, Claude CodePython, LLMOps, Claude Code, LangChain
WarningsNo major risk signals from current metadataNo major risk signals from current metadataNo major risk signals from current metadataNo major risk signals from current metadata
Best forCoding agents workflows · Claude Code teams · teams that value GitHub adoption signalsCoding agents workflows · Claude Code teams · teams that value GitHub adoption signalsBrowser automation workflows · Claude Code teams · teams that value GitHub adoption signalsCoding agents workflows · Claude Code teams · teams that value GitHub adoption signals
Not ideal forteams that need a vendor-supported SLA · high-compliance environments without internal security reviewteams that need a vendor-supported SLA · high-compliance environments without internal security reviewteams that need a vendor-supported SLA · high-compliance environments without internal security reviewteams that need a vendor-supported SLA · high-compliance environments without internal security review
OpenAgentSkill engagement2 views
0 install copies
1 views
0 install copies
2 views
0 install copies
3 views
0 install copies
Install
$ npx skills add Arize-ai/phoenix
$ npx skills add mlflow/mlflow
$ npx skills add lmnr-ai/lmnr
$ npx skills add comet-ml/opik