Skill audit report
Chinese Llm Benchmark audit report.
非线智能 NoneLinear - ReLE评测:中文AI大模型能力评测(持续更新):目前已囊括374个大模型,覆盖chatgpt、gpt-5.4、谷歌gemini-3.1-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE-5.0、qwen3.6-max、qwen3.6-plus、百川、讯飞星火、商汤senseChat等商用模型, 以及step3.5-flash、kimi-k2.6、ernie4.5、MiniMax-M2.7、deepseek-v4、Qwen3.6、llama4、智谱GLM-5.1、MiMo-V2、LongCat、gemma4、mistral等开源大模型。不仅提供排行榜,也提供规模超200万的大模型缺陷库!方便广大社区研究分析、改进大模型。
OpenAgentSkill Trust Score
Stars, maintenance, license, docs, dependency risk, and installability.
The Trust Score is OpenAgentSkill's adoption layer. It is designed to help an agent decide whether a skill is safe enough to shortlist before installation.
GitHub adoption
PASS94
6.2K GitHub stars
Recent maintenance
PASS100
9d since push
License clarity
WARN42
Unknown
README/SKILL.md completeness
PASS90
Metadata includes enough usage and workflow context
Dependency risk
PASS90
no major dependency risk hints in public metadata
Install availability
PASS92
npx skills add jeinlee1991/chinese-llm-benchmark
Repository evidence
PASS86
https://github.com/jeinlee1991/chinese-llm-benchmark
Review status
PASS88
AI review data available
Checks
Install and adoption review
Install path
92
npx skills add jeinlee1991/chinese-llm-benchmark
Repository
88
https://github.com/jeinlee1991/chinese-llm-benchmark
License
45
Unknown
Maintenance
100
9d since push
AI review
88
Approved with no listed issues
README/SKILL.md completeness
90
Usable description available
Dependency risk
90
no major dependency risk hints in public metadata
Adoption
88
6.2K GitHub stars
Warnings
- License is unclear
- License clarity: Unknown
Method
This report combines public metadata, AI review output, repository freshness, install readiness, OpenAgentSkill events, quality scoring, trust checks, and the agent safety gate. It is not a full source-code security review.
Compare nearby options