The platform for LLM evaluations and AI agent testing
$ npx skills add langwatch/langwatchAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
Laminar - open-source observability platform purpose-built for AI agents. YC S24.
The platform for LLM evaluations and AI agent testing
$ npx skills add langwatch/langwatchThe open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.
$ npx skills add mlflow/mlflow🪢 Open source AI engineering platform: LLM evals, observability, metrics, prompt management, playground, datasets. Integrates with OpenTelemetry, LangChain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
$ npx skills add langfuse/langfuseAI Observability & Evaluation
$ npx skills add Arize-ai/phoenixNext-generation AI Agent Optimization Platform: Cozeloop addresses challenges in AI agent development by providing full-lifecycle management capabilities from development, debugging, and evaluation to monitoring.
$ npx skills add coze-dev/coze-loop🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓
$ npx skills add Helicone/heliconeThe open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.
$ npx skills add Agenta-AI/agentaBuild, Manage and Deploy AI/ML Systems
$ npx skills add Netflix/metaflowZenML 🙏: One AI Platform from Pipelines to Agents. https://zenml.io.
$ npx skills add zenml-io/zenmlEvaluation and Tracking for LLM Experiments and AI Agents
$ npx skills add truera/trulensDebug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
$ npx skills add comet-ml/opikPython SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced analytics with timeline and execution graph view
$ npx skills add raga-ai-hub/RagaAI-CatalystShip AI Agents to Google Cloud in minutes, not months. Production-ready templates with built-in CI/CD, evaluation, and observability.
$ npx skills add GoogleCloudPlatform/agent-starter-pack⚙️🦀 Build modular and scalable LLM Applications in Rust
$ npx skills add 0xPlaygrounds/rigcube studio开源云原生一站式机器学习/深度学习/大模型AI平台,mlops算法链路全流程,算力租赁平台,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU虚拟化,边缘计算,标注平台自动化标注,deepseek等大模型sft微调/奖励模型/强化学习训练,vllm/ollama/mindie大模型多机推理,私有知识库,AI模型市场,支持国产cpu/gpu/npu 昇腾生态,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/ray/volcano等分布式
$ npx skills add tencentmusic/cube-studioAGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
$ npx skills add Josh-XT/AGiXTHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Lmnr if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.