A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
$ npx skills add THUDM/AgentBenchAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
The LLM Anti-Framework
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
$ npx skills add THUDM/AgentBench"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"
$ npx skills add HKUDS/DeepCodeAIlice is a fully autonomous, general-purpose AI agent.
$ npx skills add myshell-ai/AIliceBuild production-ready AI agents in both Python and Typescript.
$ npx skills add i-am-bee/beeai-frameworkMaiSaka, an LLM-based intelligent agent, is a digital lifeform devoted to understanding you and interacting in the style of a real human. She does not pursue perfection, nor does she seek efficiency; instead, she values warmth, authenticity, and genuine connection.
$ npx skills add Mai-with-u/MaiBotThe RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.
$ npx skills add areal-project/AReaLOfficial Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji.
$ npx skills add xingyaoww/code-act🌐 Make websites accessible for AI agents. Automate tasks online with ease.
$ npx skills add browser-use/browser-useBuild, run, and manage agent platforms.
$ npx skills add agno-agi/agno[ICLR'25] OpenRCA: Can Large Language Models Locate the Root Cause of Software Failures?
$ npx skills add microsoft/OpenRCAThe llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured output. Works also with models not fine-tuned to JSON output and function calls.
$ npx skills add Maximilian-Winter/llama-cpp-agentA curated list of awesome LLM agents frameworks.
$ npx skills add kaushikb11/awesome-llm-agentsAutomated Penetration Testing Agentic Framework Powered by Large Language Models
$ npx skills add GreyDGL/PentestGPT非线智能 NoneLinear - ReLE评测:中文AI大模型能力评测(持续更新):目前已囊括374个大模型,覆盖chatgpt、gpt-5.4、谷歌gemini-3.1-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE-5.0、qwen3.6-max、qwen3.6-plus、百川、讯飞星火、商汤senseChat等商用模型, 以及step3.5-flash、kimi-k2.6、ernie4.5、MiniMax-M2.7、deepseek-v4、Qwen3.6、llama4、智谱GLM-5.1、MiMo-V2、LongCat、gemma4、mistral等开源大模型。不仅提供排行榜,也提供规模超200万的大模型缺陷库!方便广大社区研究分析、改进大模型。
$ npx skills add jeinlee1991/chinese-llm-benchmarkPocket Flow: Codebase to Tutorial
$ npx skills add The-Pocket/PocketFlow-Tutorial-Codebase-KnowledgeA repo lists papers related to LLM based agent
$ npx skills add AGI-Edgerunners/LLM-Agents-PapersHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Mirascope if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.