OpenAgentSkill Registry Manifest Skill: Awesome LLM Eval Slug: onejune2018-awesome-llm-eval Category: rag-knowledge Description: Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表,主要面向基础大模型评测,旨在探求生成式AI的技术边界. Agent fit: - Decision: 69/100 Prototype first - Primary fit: RAG and knowledge - Role: Fallback candidate Supply profile: - Track: Research and knowledge work - Scenario: RAG and knowledge - Applicable agents: Claude Code, OpenAI Agents, CLI, Codex, Cursor - Maintenance: 7mo since push - Risk: Needs review Trust: - Trust score: 80/100 Strong shortlist - Audit: 76/100 Needs review Attribution: - Status: Community indexed - Source: GitHub star discovery - Creator: onejune2018 - Claim URL: https://www.openagentskill.com/skills/onejune2018-awesome-llm-eval#claim-this-skill Install: npx skills add onejune2018/Awesome-LLM-Eval URLs: - Web: https://www.openagentskill.com/skills/onejune2018-awesome-llm-eval - API: https://www.openagentskill.com/api/agent/skills/onejune2018-awesome-llm-eval - Install API: https://www.openagentskill.com/api/skills/onejune2018-awesome-llm-eval/install - Repository: https://github.com/onejune2018/Awesome-LLM-Eval