Alternatives

Pezzo alternatives for AI agents.

Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.

Current skill

Pezzo

πŸ•ΉοΈ Open-source, developer-first LLMOps platform designed to streamline prompt design, version management, instant delivery, collaboration, troubleshooting, observability and more.

100
Quality
93
Trust
3.2K
Stars
#1

Opik

Similarity 136Trust 98Excellent 100

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

20K starsJun 13, 2026 pushdevelopmentPythonLLMOps
$ npx skills add comet-ml/opik
#2

Langfuse

Similarity 134Trust 94Excellent 100

πŸͺ’ Open source AI engineering platform: LLM evals, observability, metrics, prompt management, playground, datasets. Integrates with OpenTelemetry, LangChain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

29K starsJun 13, 2026 pushdevelopmentTypeScriptLLMOps
$ npx skills add langfuse/langfuse
#3

Langtrace

Similarity 132Trust 89Strong 84

Langtrace πŸ” is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorDBs and more.. Integrate using Typescript, Python. πŸš€πŸ’»πŸ“Š

1.2K starsNov 17, 2025 pushdevelopmentTypeScriptLLMOps
$ npx skills add Scale3-Labs/langtrace
#4

Helicone

Similarity 131Trust 96Excellent 100

🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 πŸ“

5.8K starsJun 11, 2026 pushdevelopmentTypeScriptLLMOps
$ npx skills add Helicone/helicone
#5

Langwatch

Similarity 130Trust 93Excellent 100

The platform for LLM evaluations and AI agent testing

3.3K starsJun 14, 2026 pushdevelopmentTypeScriptLLMOps
$ npx skills add langwatch/langwatch
#6

Mlflow

Similarity 128Trust 98Excellent 100

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.

27K starsJun 14, 2026 pushdevelopmentPythonLLMOps
$ npx skills add mlflow/mlflow
#7

Rig

Similarity 125Trust 95Excellent 100

βš™οΈπŸ¦€ Build modular and scalable LLM Applications in Rust

7.6K starsJun 12, 2026 pushdevelopmentRustLLMOps
$ npx skills add 0xPlaygrounds/rig
#8

Burr

Similarity 123Trust 94Excellent 100

Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastructure.

2.4K starsJun 13, 2026 pushdevelopmentPythonLLMOps
$ npx skills add apache/burr
#9

Lmnr

Similarity 122Trust 94Excellent 100

Laminar - open-source observability platform purpose-built for AI agents. YC S24.

3.0K starsJun 13, 2026 pushdevelopmentTypeScriptLLMOps
$ npx skills add lmnr-ai/lmnr
#10

OpenLLM

Similarity 118Trust 95Excellent 100

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

12K starsJun 8, 2026 pushdevelopmentPythonLLMOps
$ npx skills add bentoml/OpenLLM
#11

Metaflow

Similarity 118Trust 95Excellent 100

Build, Manage and Deploy AI/ML Systems

10K starsJun 13, 2026 pushdevelopmentPythonLLMOps
$ npx skills add Netflix/metaflow
#12

BentoML

Similarity 118Trust 96Excellent 100

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

8.7K starsJun 3, 2026 pushdevelopmentPythonLLMOps
$ npx skills add bentoml/BentoML
#13

Openllmetry

Similarity 117Trust 96Excellent 100

Open-source observability for your GenAI or LLM application, based on OpenTelemetry

7.2K starsJun 12, 2026 pushdevelopmentPythonLLMOps
$ npx skills add traceloop/openllmetry
#14

Evidently

Similarity 117Trust 95Excellent 100

Evidently is ​​an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.

7.6K starsMay 2, 2026 pushdevelopmentJupyter NotebookLLMOps
$ npx skills add evidentlyai/evidently
#15

Phoenix

Similarity 117Trust 91Excellent 100

AI Observability & Evaluation

10K starsJun 14, 2026 pushdevelopmentPythonLLMOps
$ npx skills add Arize-ai/phoenix
#16

Coze Loop

Similarity 117Trust 96Excellent 100

Next-generation AI Agent Optimization Platform: Cozeloop addresses challenges in AI agent development by providing full-lifecycle management capabilities from development, debugging, and evaluation to monitoring.

5.5K starsJun 14, 2026 pushdevelopmentGoLLMOps
$ npx skills add coze-dev/coze-loop

How to choose

When should you switch?

Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Pezzo if it already passes your workflow test and repository review.

Next step

Compare top candidates side by side

Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.