Skill audit report
AgentEval audit report.
AgentEval is the comprehensive .NET toolkit for AI agent evaluation—tool usage validation, RAG quality metrics, stochastic evaluation, and model comparison—built first for Microsoft Agent Framework (MAF) and Microsoft.Extensions.AI. What RAGAS, PromptFoo and DeepEval do for Python, AgentEval does for .NET
OpenAgentSkill Trust Score
Stars, maintenance, license, docs, install safety, permission surface, and installability.
The Trust Score is OpenAgentSkill's adoption layer. It is designed to help an agent decide whether a skill is safe enough to shortlist before installation.
GitHub adoption
INFO62
124 GitHub stars
Stars/forks activity
WARN57
124 stars, 10 forks; issue activity unavailable in current metadata
Recent maintenance
PASS100
13d since push
License clarity
PASS86
MIT
README/SKILL.md completeness
PASS100
Metadata includes enough usage and workflow context
Dependency/runtime risk
PASS90
no major dependency risk hints in public metadata
Install availability
PASS92
npx skills add AgentEvalHQ/AgentEval
Install command safety
PASS92
standard package or runtime install path
Permission surface
PASS86
filesystem or document access
Repository evidence
PASS86
https://github.com/AgentEvalHQ/AgentEval
Review status
PASS88
AI review data available
Agent Proven outcomes
INFO54
No agent outcome data yet
Checks
Install and adoption review
Install path
92
npx skills add AgentEvalHQ/AgentEval
Repository
88
https://github.com/AgentEvalHQ/AgentEval
License
86
MIT
Maintenance
100
13d since push
AI review
88
Approved with no listed issues
README/SKILL.md completeness
100
Usable description available
Dependency risk
90
no major dependency risk hints in public metadata
Install command safety
92
standard package or runtime install path
Permission surface
86
filesystem or document access
Stars/forks activity
57
124 stars, 10 forks; issue activity unavailable in current metadata
Adoption
68
124 GitHub stars
Warnings
- Quality score needs review
- Stars/forks activity: 124 stars, 10 forks; issue activity unavailable in current metadata
Method
This report combines public metadata, AI review output, repository freshness, install readiness, OpenAgentSkill events, quality scoring, trust checks, and the agent safety gate. It is not a full source-code security review.
Compare nearby options