Skill comparison

Compare agent skills before installing.

Put high-signal skills side by side and inspect quality, adoption, freshness, install readiness, use-case fit, and warnings in one place.

Comparing 4 skills

Use this as a shortlist, then open the skill detail page before adopting.

Add more skills

Decision summary

Promptfoo is the strongest overall pick here because it has a 100/100 readiness score and fits Coding agents.

Strongest overall

Promptfoo

Use this as a leading candidate, then validate the README and install path in your own agent stack.

Fastest prototype

Promptfoo

Best first install candidate based on install readiness and adoption.

Freshest repo

Promptfoo

Most recent maintenance signal among this shortlist.

SignalPromptfoo

Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, DeepSeek, and more. Simple declarative configs with command line and CI/CD integration. Used by OpenAI and Anthropic.

Opik

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

Giskard Oss

🐢 Open-Source Evaluation & Testing library for LLM Agents

Helicone

🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓

Quality
100/100
Excellent
100/100
Excellent
100/100
Excellent
100/100
Excellent
Decision verdict
100/100
Production-ready

Use this as a leading candidate, then validate the README and install path in your own agent stack.

100/100
Production-ready

Use this as a leading candidate, then validate the README and install path in your own agent stack.

100/100
Production-ready

Use this as a leading candidate, then validate the README and install path in your own agent stack.

100/100
Production-ready

Use this as a leading candidate, then validate the README and install path in your own agent stack.

Adoption22K stars
0 installs
19K stars
0 installs
5.4K stars
0 installs
5.8K stars
0 installs
FreshnessJun 9, 2026Jun 5, 2026Jun 5, 2026May 18, 2026
Use-case fit
Stack fit
Platform hintsTypeScript, LLMOps, Claude Code, OpenAI AgentsPython, LLMOps, Claude Code, LangChainPython, LLMOps, Claude CodeTypeScript, LLMOps, Claude Code, OpenAI Agents, LangChain
WarningsNo major risk signals from current metadataNo major risk signals from current metadataNo major risk signals from current metadataNo major risk signals from current metadata
Best forCoding agents workflows · Claude Code teams · teams that value GitHub adoption signalsCoding agents workflows · Claude Code teams · teams that value GitHub adoption signalsCoding agents workflows · Claude Code teams · teams that value GitHub adoption signalsCoding agents workflows · Claude Code teams · teams that value GitHub adoption signals
Not ideal forteams that need a vendor-supported SLA · high-compliance environments without internal security reviewteams that need a vendor-supported SLA · high-compliance environments without internal security reviewteams that need a vendor-supported SLA · high-compliance environments without internal security reviewteams that need a vendor-supported SLA · high-compliance environments without internal security review
OpenAgentSkill engagement3 views
0 install copies
3 views
0 install copies
1 views
0 install copies
1 views
0 install copies
Install
$ npx skills add promptfoo/promptfoo
$ npx skills add comet-ml/opik
$ npx skills add Giskard-AI/giskard-oss
$ npx skills add Helicone/helicone