Alternatives

Evidently alternatives for AI agents.

Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.

Current skill

Evidently

Evidently is ​​an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.

100
Quality
95
Trust
7.6K
Stars
#1

Burr

Similarity 124Trust 96Excellent 100

Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastructure.

2.4K starsJun 13, 2026 pushdevelopmentPythonLLMOps
$ npx skills add apache/burr
#2

Opik

Similarity 120Trust 98Excellent 100

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

20K starsJun 13, 2026 pushdevelopmentPythonLLMOps
$ npx skills add comet-ml/opik
#3

BentoML

Similarity 118Trust 97Excellent 100

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

8.7K starsJun 3, 2026 pushdevelopmentPythonLLMOps
$ npx skills add bentoml/BentoML
#4

Openllmetry

Similarity 118Trust 97Excellent 100

Open-source observability for your GenAI or LLM application, based on OpenTelemetry

7.2K starsJun 12, 2026 pushdevelopmentPythonLLMOps
$ npx skills add traceloop/openllmetry
#5

Plano

Similarity 117Trust 97Excellent 100

Plano is an AI-native proxy and data plane for agentic apps — with built-in orchestration, safety, observability, and smart LLM routing so you stay focused on your agents core logic.

6.6K starsJun 12, 2026 pushdevelopmentRustLLMOps
$ npx skills add katanemo/plano
#6

Agent Starter Pack

Similarity 117Trust 97Excellent 100

Ship AI Agents to Google Cloud in minutes, not months. Production-ready templates with built-in CI/CD, evaluation, and observability.

6.5K starsJun 12, 2026 pushdevelopmentPythonLLMOps
$ npx skills add GoogleCloudPlatform/agent-starter-pack
#7

Rig

Similarity 117Trust 94Excellent 100

⚙️🦀 Build modular and scalable LLM Applications in Rust

7.6K starsJun 12, 2026 pushdevelopmentRustLLMOps
$ npx skills add 0xPlaygrounds/rig
#8

Zenml

Similarity 117Trust 94Excellent 100

ZenML 🙏: One AI Platform from Pipelines to Agents. https://zenml.io.

5.4K starsJun 14, 2026 pushdevelopmentPythonLLMOps
$ npx skills add zenml-io/zenml
#9

Envd

Similarity 115Trust 92Excellent 100

🏕️ Reproducible development environment for humans and agents

2.2K starsMay 21, 2026 pushdevelopmentGoLLMOps
$ npx skills add tensorchord/envd
#10

Mlflow

Similarity 112Trust 98Excellent 100

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.

27K starsJun 14, 2026 pushdevelopmentPythonLLMOps
$ npx skills add mlflow/mlflow
#11

Langfuse

Similarity 112Trust 94Excellent 100

🪢 Open source AI engineering platform: LLM evals, observability, metrics, prompt management, playground, datasets. Integrates with OpenTelemetry, LangChain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

29K starsJun 13, 2026 pushdevelopmentTypeScriptLLMOps
$ npx skills add langfuse/langfuse
#12

OpenLLM

Similarity 111Trust 97Excellent 100

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

12K starsJun 8, 2026 pushdevelopmentPythonLLMOps
$ npx skills add bentoml/OpenLLM
#13

RagaAI Catalyst

Similarity 111Trust 94Excellent 100

Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced analytics with timeline and execution graph view

16K starsFeb 11, 2026 pushdevelopmentPythonLLMOps
$ npx skills add raga-ai-hub/RagaAI-Catalyst
#14

Metaflow

Similarity 110Trust 95Excellent 100

Build, Manage and Deploy AI/ML Systems

10K starsJun 13, 2026 pushdevelopmentPythonLLMOps
$ npx skills add Netflix/metaflow
#15

Helicone

Similarity 109Trust 97Excellent 100

🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓

5.8K starsJun 11, 2026 pushdevelopmentTypeScriptLLMOps
$ npx skills add Helicone/helicone
#16

Phoenix

Similarity 109Trust 91Excellent 100

AI Observability & Evaluation

10K starsJun 14, 2026 pushdevelopmentPythonLLMOps
$ npx skills add Arize-ai/phoenix

How to choose

When should you switch?

Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Evidently if it already passes your workflow test and repository review.

Next step

Compare top candidates side by side

Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.