Alternatives

Giskard Oss alternatives for AI agents.

Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.

Current skill

Giskard Oss

🐢 Open-Source Evaluation & Testing library for LLM Agents

100
Quality
94
Trust
5.4K
Stars
#1

Opik

Similarity 134Trust 98Excellent 100

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

20K starsJun 13, 2026 pushdevelopmentPythonLLMOps
$ npx skills add comet-ml/opik
#2

Trulens

Similarity 130Trust 93Excellent 100

Evaluation and Tracking for LLM Experiments and AI Agents

3.4K starsJun 12, 2026 pushdevelopmentPythonLLMOps
$ npx skills add truera/trulens
#3

Langfuse

Similarity 128Trust 94Excellent 100

🪢 Open source AI engineering platform: LLM evals, observability, metrics, prompt management, playground, datasets. Integrates with OpenTelemetry, LangChain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

29K starsJun 13, 2026 pushdevelopmentTypeScriptLLMOps
$ npx skills add langfuse/langfuse
#4

Mlflow

Similarity 126Trust 98Excellent 100

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.

27K starsJun 14, 2026 pushdevelopmentPythonLLMOps
$ npx skills add mlflow/mlflow
#5

OpenLLM

Similarity 125Trust 97Excellent 100

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

12K starsJun 8, 2026 pushdevelopmentPythonLLMOps
$ npx skills add bentoml/OpenLLM
#6

BentoML

Similarity 124Trust 97Excellent 100

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

8.7K starsJun 3, 2026 pushdevelopmentPythonLLMOps
$ npx skills add bentoml/BentoML
#7

Openllmetry

Similarity 124Trust 97Excellent 100

Open-source observability for your GenAI or LLM application, based on OpenTelemetry

7.2K starsJun 12, 2026 pushdevelopmentPythonLLMOps
$ npx skills add traceloop/openllmetry
#8

Helicone

Similarity 117Trust 97Excellent 100

🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓

5.8K starsJun 11, 2026 pushdevelopmentTypeScriptLLMOps
$ npx skills add Helicone/helicone
#9

Rig

Similarity 117Trust 94Excellent 100

⚙️🦀 Build modular and scalable LLM Applications in Rust

7.6K starsJun 12, 2026 pushdevelopmentRustLLMOps
$ npx skills add 0xPlaygrounds/rig
#10

Coze Loop

Similarity 117Trust 97Excellent 100

Next-generation AI Agent Optimization Platform: Cozeloop addresses challenges in AI agent development by providing full-lifecycle management capabilities from development, debugging, and evaluation to monitoring.

5.5K starsJun 14, 2026 pushdevelopmentGoLLMOps
$ npx skills add coze-dev/coze-loop
#11

RagaAI Catalyst

Similarity 117Trust 94Excellent 100

Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced analytics with timeline and execution graph view

16K starsFeb 11, 2026 pushdevelopmentPythonLLMOps
$ npx skills add raga-ai-hub/RagaAI-Catalyst
#12

Metaflow

Similarity 116Trust 95Excellent 100

Build, Manage and Deploy AI/ML Systems

10K starsJun 13, 2026 pushdevelopmentPythonLLMOps
$ npx skills add Netflix/metaflow
#13

Agenta

Similarity 116Trust 91Excellent 100

The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.

4.2K starsJun 14, 2026 pushdevelopmentTypeScriptLLMOps
$ npx skills add Agenta-AI/agenta
#14

Langwatch

Similarity 116Trust 93Excellent 100

The platform for LLM evaluations and AI agent testing

3.3K starsJun 14, 2026 pushdevelopmentTypeScriptLLMOps
$ npx skills add langwatch/langwatch
#15

Agent Starter Pack

Similarity 115Trust 97Excellent 100

Ship AI Agents to Google Cloud in minutes, not months. Production-ready templates with built-in CI/CD, evaluation, and observability.

6.5K starsJun 12, 2026 pushdevelopmentPythonLLMOps
$ npx skills add GoogleCloudPlatform/agent-starter-pack
#16

Phoenix

Similarity 115Trust 91Excellent 100

AI Observability & Evaluation

10K starsJun 14, 2026 pushdevelopmentPythonLLMOps
$ npx skills add Arize-ai/phoenix

How to choose

When should you switch?

Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Giskard Oss if it already passes your workflow test and repository review.

Next step

Compare top candidates side by side

Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.