Skill audit report

Chinese Llm Benchmark audit report.

非线智能 NoneLinear - ReLE评测：中文AI大模型能力评测（持续更新）：目前已囊括374个大模型，覆盖chatgpt、gpt-5.4、谷歌gemini-3.1-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE-5.0、qwen3.6-max、qwen3.6-plus、百川、讯飞星火、商汤senseChat等商用模型，以及step3.5-flash、kimi-k2.6、ernie4.5、MiniMax-M2.7、deepseek-v4、Qwen3.6、llama4、智谱GLM-5.1、MiMo-V2、LongCat、gemma4、mistral等开源大模型。不仅提供排行榜，也提供规模超200万的大模型缺陷库！方便广大社区研究分析、改进大模型。

REVIEWED · REVIEWSafe to tryGenerated Jun 16, 2026Heuristic metadata audit

Audit

Trust

100

Quality

Security

100

Maintain

Install

OpenAgentSkill Trust Score

Production candidate

Stars, maintenance, license, docs, dependency risk, and installability.

The Trust Score is OpenAgentSkill's adoption layer. It is designed to help an agent decide whether a skill is safe enough to shortlist before installation.

GitHub adoption

PASS

6.2K GitHub stars

Recent maintenance

PASS

100

9d since push

License clarity

WARN

Unknown

README/SKILL.md completeness

PASS

Metadata includes enough usage and workflow context

Dependency risk

PASS

no major dependency risk hints in public metadata

Install availability

PASS

npx skills add jeinlee1991/chinese-llm-benchmark

Repository evidence

PASS

https://github.com/jeinlee1991/chinese-llm-benchmark

Review status

PASS

AI review data available

Checks

Install and adoption review

7 passed · 3 review

Install path

PASS

npx skills add jeinlee1991/chinese-llm-benchmark

Repository

PASS

https://github.com/jeinlee1991/chinese-llm-benchmark

License

CHECK

Unknown

Maintenance

100

PASS

9d since push

AI review

PASS

Approved with no listed issues

README/SKILL.md completeness

PASS

Usable description available

Dependency risk

PASS

no major dependency risk hints in public metadata

Adoption

PASS

6.2K GitHub stars

Warnings

License is unclear
License clarity: Unknown

Method

This report combines public metadata, AI review output, repository freshness, install readiness, OpenAgentSkill events, quality scoring, trust checks, and the agent safety gate. It is not a full source-code security review.

Compare nearby options

Chinese Llm Benchmark audit report.

Stars, maintenance, license, docs, dependency risk, and installability.

Install and adoption review

Related skills to audit next

AutoGPT

Langchain

Firecrawl