Skill audit report

Chinese Llm Benchmark audit report.

非线智能 NoneLinear - ReLE评测:中文AI大模型能力评测(持续更新):目前已囊括374个大模型,覆盖chatgpt、gpt-5.4、谷歌gemini-3.1-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE-5.0、qwen3.6-max、qwen3.6-plus、百川、讯飞星火、商汤senseChat等商用模型, 以及step3.5-flash、kimi-k2.6、ernie4.5、MiniMax-M2.7、deepseek-v4、Qwen3.6、llama4、智谱GLM-5.1、MiMo-V2、LongCat、gemma4、mistral等开源大模型。不仅提供排行榜,也提供规模超200万的大模型缺陷库!方便广大社区研究分析、改进大模型。

REVIEWED · REVIEWSafe to tryGenerated Jun 16, 2026Heuristic metadata audit
95
Audit
93
Trust
100
Quality
89
Security
100
Maintain
92
Install

OpenAgentSkill Trust Score

93
Production candidate

Stars, maintenance, license, docs, dependency risk, and installability.

The Trust Score is OpenAgentSkill's adoption layer. It is designed to help an agent decide whether a skill is safe enough to shortlist before installation.

GitHub adoption

PASS

94

6.2K GitHub stars

Recent maintenance

PASS

100

9d since push

License clarity

WARN

42

Unknown

README/SKILL.md completeness

PASS

90

Metadata includes enough usage and workflow context

Dependency risk

PASS

90

no major dependency risk hints in public metadata

Install availability

PASS

92

npx skills add jeinlee1991/chinese-llm-benchmark

Repository evidence

PASS

86

https://github.com/jeinlee1991/chinese-llm-benchmark

Review status

PASS

88

AI review data available

Checks

Install and adoption review

7 passed · 3 review

Install path

92

PASS

npx skills add jeinlee1991/chinese-llm-benchmark

Repository

88

PASS

https://github.com/jeinlee1991/chinese-llm-benchmark

License

45

CHECK

Unknown

Maintenance

100

PASS

9d since push

AI review

88

PASS

Approved with no listed issues

README/SKILL.md completeness

90

PASS

Usable description available

Dependency risk

90

PASS

no major dependency risk hints in public metadata

Adoption

88

PASS

6.2K GitHub stars

Warnings

  • License is unclear
  • License clarity: Unknown

Method

This report combines public metadata, AI review output, repository freshness, install readiness, OpenAgentSkill events, quality scoring, trust checks, and the agent safety gate. It is not a full source-code security review.

Compare nearby options

Related skills to audit next