A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
$ npx skills add THUDM/AgentBenchAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
[ACL 2024 Findings] MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning https://arxiv.org/abs/2311.10537
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
$ npx skills add THUDM/AgentBenchThe LLM Anti-Framework
$ npx skills add Mirascope/mirascope"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"
$ npx skills add HKUDS/DeepCodeAIlice is a fully autonomous, general-purpose AI agent.
$ npx skills add myshell-ai/AIlicePocket Flow: Codebase to Tutorial
$ npx skills add The-Pocket/PocketFlow-Tutorial-Codebase-KnowledgeMaiSaka, an LLM-based intelligent agent, is a digital lifeform devoted to understanding you and interacting in the style of a real human. She does not pursue perfection, nor does she seek efficiency; instead, she values warmth, authenticity, and genuine connection.
$ npx skills add Mai-with-u/MaiBotThe RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.
$ npx skills add areal-project/AReaLOfficial Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji.
$ npx skills add xingyaoww/code-actAn open-source LLM based automatically daily news collecting workflow showcase powered by Agently AI application development framework.
$ npx skills add AgentEra/Agently-Daily-News-Collector🌐 Make websites accessible for AI agents. Automate tasks online with ease.
$ npx skills add browser-use/browser-use[ICLR'25] OpenRCA: Can Large Language Models Locate the Root Cause of Software Failures?
$ npx skills add microsoft/OpenRCAThe llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured output. Works also with models not fine-tuned to JSON output and function calls.
$ npx skills add Maximilian-Winter/llama-cpp-agentA curated list of awesome LLM agents frameworks.
$ npx skills add kaushikb11/awesome-llm-agentsAutomated Penetration Testing Agentic Framework Powered by Large Language Models
$ npx skills add GreyDGL/PentestGPTA repo lists papers related to LLM based agent
$ npx skills add AGI-Edgerunners/LLM-Agents-PapersBuild production-ready AI agents in both Python and Typescript.
$ npx skills add i-am-bee/beeai-frameworkHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep MedAgents if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.