Skill audit report

Awesome LLM Eval audit report.

Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表,主要面向基础大模型评测,旨在探求生成式AI的技术边界.

REVIEWED · REVIEWNeeds reviewGenerated Jun 16, 2026Heuristic metadata audit
79
Audit
82
Trust
67
Quality
97
Security
62
Maintain
92
Install

OpenAgentSkill Trust Score

82
Strong shortlist

Stars, maintenance, license, docs, dependency risk, and installability.

The Trust Score is OpenAgentSkill's adoption layer. It is designed to help an agent decide whether a skill is safe enough to shortlist before installation.

GitHub adoption

INFO

76

642 GitHub stars

Recent maintenance

INFO

62

7mo since push

License clarity

PASS

86

MIT

README/SKILL.md completeness

PASS

90

Metadata includes enough usage and workflow context

Dependency risk

PASS

90

no major dependency risk hints in public metadata

Install availability

PASS

92

npx skills add onejune2018/Awesome-LLM-Eval

Repository evidence

PASS

86

https://github.com/onejune2018/Awesome-LLM-Eval

Review status

PASS

88

AI review data available

Checks

Install and adoption review

7 passed · 2 review

Install path

92

PASS

npx skills add onejune2018/Awesome-LLM-Eval

Repository

88

PASS

https://github.com/onejune2018/Awesome-LLM-Eval

License

86

PASS

MIT

Maintenance

62

CHECK

7mo since push

AI review

88

PASS

Approved with no listed issues

README/SKILL.md completeness

90

PASS

Usable description available

Dependency risk

90

PASS

no major dependency risk hints in public metadata

Adoption

88

PASS

642 GitHub stars

Warnings

  • Quality score needs review

Method

This report combines public metadata, AI review output, repository freshness, install readiness, OpenAgentSkill events, quality scoring, trust checks, and the agent safety gate. It is not a full source-code security review.

Compare nearby options

Related skills to audit next