Skill comparison

Compare agent skills before installing.

Put high-signal skills side by side and inspect quality, adoption, freshness, install readiness, use-case fit, and warnings in one place.

Comparing 1 skill

Use this as a shortlist, then open the skill detail page before adopting.

Add more skills
SignalChinese Llm Benchmark

非线智能 NoneLinear - ReLE评测:中文AI大模型能力评测(持续更新):目前已囊括374个大模型,覆盖chatgpt、gpt-5.4、谷歌gemini-3.1-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE-5.0、qwen3.6-max、qwen3.6-plus、百川、讯飞星火、商汤senseChat等商用模型, 以及step3.5-flash、kimi-k2.6、ernie4.5、MiniMax-M2.7、deepseek-v4、Qwen3.6、llama4、智谱GLM-5.1、MiMo-V2、LongCat、gemma4、mistral等开源大模型。不仅提供排行榜,也提供规模超200万的大模型缺陷库!方便广大社区研究分析、改进大模型。

Quality
100/100
Excellent
Adoption6.0K stars
0 installs
FreshnessMay 23, 2026
Use-case fit
Stack fit
Platform hintsLLM, Claude Code, OpenAI Agents
WarningsNo major warning signals
Install
$ npx skills add jeinlee1991/chinese-llm-benchmark