Alternatives

ChatDev alternatives for AI agents.

Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.

Current skill

ChatDev

Use multi-agent collaboration for software development

100
Quality
100
Trust
33K
Stars
#1

Mlflow

Similarity 96Trust 100Excellent 100

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.

26K starsJun 5, 2026 pushdevelopmentPythonLLMOps
$ npx skills add mlflow/mlflow
#2

Opik

Similarity 96Trust 100Excellent 100

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

19K starsJun 5, 2026 pushdevelopmentPythonLLMOps
$ npx skills add comet-ml/opik
#3

RagaAI Catalyst

Similarity 96Trust 100Excellent 100

Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced analytics with timeline and execution graph view

16K starsFeb 11, 2026 pushdevelopmentPythonLLMOps
$ npx skills add raga-ai-hub/RagaAI-Catalyst
#4

Mastg

Similarity 95Trust 100Excellent 100

The OWASP Mobile Application Security Testing Guide (MASTG) is a comprehensive manual for mobile app security testing and reverse engineering. It describes technical processes for verifying the OWASP Mobile Security Weakness Enumeration (MASWE) weaknesses, which are in alignment with the OWASP MASVS.

13K starsJun 5, 2026 pushdevelopmentPythonStatic Analysis
$ npx skills add OWASP/mastg
#5

E2B

Similarity 95Trust 100Excellent 100

Run agent code safely in cloud sandboxes

13K starsJun 10, 2026 pushdevelopmentPythonSandbox
$ npx skills add e2b-dev/E2B
#6

Pr Agent

Similarity 95Trust 100Excellent 100

๐Ÿš€ PR Agent: The Original Open-Source PR Reviewer. This project It is not the Qodo free tier.

12K starsJun 6, 2026 pushdevelopmentPythonCode Review
$ npx skills add The-PR-Agent/pr-agent
#7

Metaflow

Similarity 95Trust 100Excellent 100

Build, Manage and Deploy AI/ML Systems

10K starsJun 5, 2026 pushdevelopmentPythonLLMOps
$ npx skills add Netflix/metaflow
#8

Phoenix

Similarity 95Trust 100Excellent 100

AI Observability & Evaluation

10K starsJun 9, 2026 pushdevelopmentPythonLLMOps
$ npx skills add Arize-ai/phoenix
#9

Checkov

Similarity 94Trust 100Excellent 100

Prevent cloud misconfigurations and find vulnerabilities during build-time in infrastructure as code, container images and open source packages with Checkov by Bridgecrew.

8.8K starsJun 7, 2026 pushdevelopmentPythonStatic Analysis
$ npx skills add bridgecrewio/checkov
#10

DeepAudit

Similarity 94Trust 100Excellent 100

DeepAudit๏ผšไบบไบบๆ‹ฅๆœ‰็š„ AI ้ป‘ๅฎขๆˆ˜้˜Ÿ๏ผŒ่ฎฉๆผๆดžๆŒ–ๆŽ˜่งฆๆ‰‹ๅฏๅŠใ€‚ๅ›ฝๅ†…้ฆ–ไธชๅผ€ๆบ็š„ไปฃ็ ๆผๆดžๆŒ–ๆŽ˜ๅคšๆ™บ่ƒฝไฝ“็ณป็ปŸใ€‚ๅฐ็™ฝไธ€้”ฎ้ƒจ็ฝฒ่ฟ่กŒ๏ผŒ่‡ชไธปๅไฝœๅฎก่ฎก + ่‡ชๅŠจๅŒ–ๆฒ™็ฎฑ PoC ้ชŒ่ฏใ€‚ๆ”ฏๆŒ Ollama ็งๆœ‰้ƒจ็ฝฒ ๏ผŒไธ€้”ฎ็”ŸๆˆๆŠฅๅ‘Šใ€‚ๆ”ฏๆŒไธญ่ฝฌ็ซ™ใ€‚โ€‹่ฎฉๅฎ‰ๅ…จไธๅ†ๆ˜‚่ดต๏ผŒ่ฎฉๅฎก่ฎกไธๅ†ๅคๆ‚ใ€‚

6.3K starsApr 1, 2026 pushdevelopmentPythonCode Review
$ npx skills add lintsinghua/DeepAudit
#11

Slither

Similarity 94Trust 100Excellent 100

Static Analyzer for Solidity and Vyper

6.3K starsJun 5, 2026 pushdevelopmentPythonStatic Analysis
$ npx skills add crytic/slither
#12

Pylint

Similarity 94Trust 100Excellent 100

It's not just a linter that annoys you!

5.7K starsJun 6, 2026 pushdevelopmentPythonStatic Analysis
$ npx skills add pylint-dev/pylint
#13

Zenml

Similarity 94Trust 100Excellent 100

ZenML ๐Ÿ™: One AI Platform from Pipelines to Agents. https://zenml.io.

5.4K starsJun 7, 2026 pushdevelopmentPythonLLMOps
$ npx skills add zenml-io/zenml
#14

Giskard Oss

Similarity 94Trust 100Excellent 100

๐Ÿข Open-Source Evaluation & Testing library for LLM Agents

5.4K starsJun 5, 2026 pushdevelopmentPythonLLMOps
$ npx skills add Giskard-AI/giskard-oss
#15

Llm Twin Course

Similarity 93Trust 100Excellent 100

๐Ÿค– ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป for ๐—ณ๐—ฟ๐—ฒ๐—ฒ how to ๐—ฏ๐˜‚๐—ถ๐—น๐—ฑ an end-to-end ๐—ฝ๐—ฟ๐—ผ๐—ฑ๐˜‚๐—ฐ๐˜๐—ถ๐—ผ๐—ป-๐—ฟ๐—ฒ๐—ฎ๐—ฑ๐˜† ๐—Ÿ๐—Ÿ๐—  & ๐—ฅ๐—”๐—š ๐˜€๐˜†๐˜€๐˜๐—ฒ๐—บ using ๐—Ÿ๐—Ÿ๐— ๐—ข๐—ฝ๐˜€ best practices: ~ ๐˜ด๐˜ฐ๐˜ถ๐˜ณ๐˜ค๐˜ฆ ๐˜ค๐˜ฐ๐˜ฅ๐˜ฆ + 12 ๐˜ฉ๐˜ข๐˜ฏ๐˜ฅ๐˜ด-๐˜ฐ๐˜ฏ ๐˜ญ๐˜ฆ๐˜ด๐˜ด๐˜ฐ๐˜ฏ๐˜ด

4.4K starsApr 20, 2026 pushdevelopmentPythonLLMOps
$ npx skills add decodingai-magazine/llm-twin-course
#16

Flake8

Similarity 93Trust 100Excellent 100

flake8 is a python tool that glues together pycodestyle, pyflakes, mccabe, and third-party plugins to check the style and quality of some python code.

3.8K starsMay 19, 2026 pushdevelopmentPythonStatic Analysis
$ npx skills add PyCQA/flake8

How to choose

When should you switch?

Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep ChatDev if it already passes your workflow test and repository review.

Next step

Compare top candidates side by side

Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.