Decision filters

Choose skills by scenario, quality, and trust signals.

35 skills matching "llmops"

Best blend of quality, stars, freshness, and agent usage

1

Mlflow

VERIFIEDEXCELLENT · 100

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.

$ npx skills add mlflow/mlflow
26.1K stars74 qualityClaude Code + LangChain
High-confidence pick with strong adoption and healthy maintenance signals.
pythonllmops
by mlflowQuick view
2

Opik

VERIFIEDEXCELLENT · 100

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

$ npx skills add comet-ml/opik
19.4K stars73 qualityClaude Code + LangChain
High-confidence pick with strong adoption and healthy maintenance signals.
pythonllmops
by comet-mlQuick view
3

Metaflow

VERIFIEDEXCELLENT · 100

Build, Manage and Deploy AI/ML Systems

$ npx skills add Netflix/metaflow
10.1K stars71 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonllmops
by NetflixQuick view
4

Phoenix

VERIFIEDEXCELLENT · 100

AI Observability & Evaluation

$ npx skills add Arize-ai/phoenix
9.8K stars70 qualityClaude Code + LangChain
High-confidence pick with strong adoption and healthy maintenance signals.
pythonllmops
by Arize-aiQuick view
5

Rig

VERIFIEDEXCELLENT · 100

⚙️🦀 Build modular and scalable LLM Applications in Rust

$ npx skills add 0xPlaygrounds/rig
7.4K stars69 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
rustllmops
by 0xPlaygroundsQuick view
6

Plano

VERIFIEDEXCELLENT · 100

Plano is an AI-native proxy and data plane for agentic apps — with built-in orchestration, safety, observability, and smart LLM routing so you stay focused on your agents core logic.

$ npx skills add katanemo/plano
6.5K stars68 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
rustllmops
by katanemoQuick view
7

Helicone

VERIFIEDEXCELLENT · 100

🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓

$ npx skills add Helicone/helicone
5.7K stars68 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptllmops
by HeliconeQuick view
8

Coze Loop

VERIFIEDEXCELLENT · 100

Next-generation AI Agent Optimization Platform: Cozeloop addresses challenges in AI agent development by providing full-lifecycle management capabilities from development, debugging, and evaluation to monitoring.

$ npx skills add coze-dev/coze-loop
5.5K stars68 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
gollmops
by coze-devQuick view
9

Zenml

VERIFIEDEXCELLENT · 100

ZenML 🙏: One AI Platform from Pipelines to Agents. https://zenml.io.

$ npx skills add zenml-io/zenml
5.4K stars68 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonllmops
by zenml-ioQuick view
10

Giskard Oss

VERIFIEDEXCELLENT · 100

🐢 Open-Source Evaluation & Testing library for LLM Agents

$ npx skills add Giskard-AI/giskard-oss
5.4K stars68 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonllmops
by Giskard-AIQuick view
11

Agenta

VERIFIEDEXCELLENT · 100

The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.

$ npx skills add Agenta-AI/agenta
4.1K stars67 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptllmops
by Agenta-AIQuick view
12

Trulens

VERIFIEDEXCELLENT · 100

Evaluation and Tracking for LLM Experiments and AI Agents

$ npx skills add truera/trulens
3.3K stars66 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonllmops
by trueraQuick view
13

Langwatch

VERIFIEDEXCELLENT · 100

The platform for LLM evaluations and AI agent testing

$ npx skills add langwatch/langwatch
3.3K stars66 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptllmops
by langwatchQuick view
14

AGiXT

VERIFIEDEXCELLENT · 100

AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.

$ npx skills add Josh-XT/AGiXT
3.2K stars66 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonllmops
by Josh-XTQuick view
15

Lmnr

VERIFIEDEXCELLENT · 100

Laminar - open-source observability platform purpose-built for AI agents. YC S24.

$ npx skills add lmnr-ai/lmnr
2.9K stars66 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptllmops
by lmnr-aiQuick view
16

RagaAI Catalyst

VERIFIEDEXCELLENT · 100

Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced analytics with timeline and execution graph view

$ npx skills add raga-ai-hub/RagaAI-Catalyst
16.2K stars66 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonllmops
by raga-ai-hubQuick view
17

Hamilton

VERIFIEDEXCELLENT · 100

Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

$ npx skills add apache/hamilton
2.5K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
jupyter-notebookllmops
by apacheQuick view
18

Envd

VERIFIEDEXCELLENT · 100

🏕️ Reproducible development environment for humans and agents

$ npx skills add tensorchord/envd
2.2K stars65 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
gollmops
by tensorchordQuick view
19

Burr

VERIFIEDEXCELLENT · 100

Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastructure.

$ npx skills add apache/burr
2.0K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonllmops
by apacheQuick view
20

LLM Engineers Handbook

VERIFIEDEXCELLENT · 100

The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices

$ npx skills add PacktPublishing/LLM-Engineers-Handbook
5.1K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonllmops
by PacktPublishingQuick view
21

Llm Twin Course

VERIFIEDEXCELLENT · 100

🤖 𝗟𝗲𝗮𝗿𝗻 for 𝗳𝗿𝗲𝗲 how to 𝗯𝘂𝗶𝗹𝗱 an end-to-end 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻-𝗿𝗲𝗮𝗱𝘆 𝗟𝗟𝗠 & 𝗥𝗔𝗚 𝘀𝘆𝘀𝘁𝗲𝗺 using 𝗟𝗟𝗠𝗢𝗽𝘀 best practices: ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 12 𝘩𝘢𝘯𝘥𝘴-𝘰𝘯 𝘭𝘦𝘴𝘴𝘰𝘯𝘴

$ npx skills add decodingai-magazine/llm-twin-course
4.3K stars64 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonllmops
by decodingai-magazineQuick view
22

Chidori

VERIFIEDEXCELLENT · 100

A reactive runtime for building durable AI agents

$ npx skills add ThousandBirdsInc/chidori
1.3K stars64 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
rustllmops
by ThousandBirdsIncQuick view
23

Observal

VERIFIEDEXCELLENT · 98

Observal is an Observability and Evaluation platform for human-in-the-loop agents

$ npx skills add BlazeUp-AI/Observal
1.3K stars64 qualityClaude Code + Cursor
High-confidence pick with strong adoption and healthy maintenance signals.
pythonllmops
by BlazeUp-AIQuick view
24

Intellagent

VERIFIEDEXCELLENT · 100

A framework for comprehensive diagnosis and optimization of agents using simulated, realistic synthetic interactions

$ npx skills add plurai-ai/intellagent
1.2K stars63 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonllmops
by plurai-aiQuick view
25

Dynamiq

VERIFIEDEXCELLENT · 100

Dynamiq is an orchestration framework for agentic AI and LLM applications

$ npx skills add dynamiq-ai/dynamiq
1.1K stars63 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
pythonllmops
by dynamiq-aiQuick view
26

Second Brain AI Assistant Course

VERIFIEDEXCELLENT · 100

Learn to build your Second Brain AI assistant with LLMs, agents, RAG, fine-tuning, LLMOps and AI systems techniques.

$ npx skills add decodingai-magazine/second-brain-ai-assistant-course
2.7K stars63 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
jupyter-notebookllmops
by decodingai-magazineQuick view
27

Openagent

STRONG · 81

AI Agent Development Platform - Supports multiple models (OpenAI/DeepSeek/Wenxin/Tongyi), knowledge base management, workflow automation, and enterprise-grade security. Built with Flask + Vue3 + LangChain, featuring one-click Docker deployment.

$ npx skills add Haohao-end/openagent
789 stars54 qualityClaude Code + OpenAI Agents
Solid option that is likely worth shortlisting for production workflows.
pythonllmops
by Haohao-endQuick view
28

Swiftide

EXCELLENT · 86

Fast, streaming indexing, query, and agentic LLM applications in Rust

$ npx skills add bosun-ai/swiftide
700 stars54 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
rustllmops
by bosun-aiQuick view
29

Gateway

EXCELLENT · 85

The only fully local production-grade Super SDK that provides a simple, unified, and powerful interface for calling more than 200+ LLMs.

$ npx skills add adaline/gateway
594 stars53 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptai-agents
by adalineQuick view
30

LLMStack

VERIFIEDPROMISING · 69

No-code multi-agent framework to build LLM Agents, workflows and applications with your data

$ npx skills add trypromptly/LLMStack
2.3K stars50 qualityClaude Code
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
pythonllm
by trypromptlyQuick view
31

Contoso Chat

PROMISING · 68

This sample has the full End2End process of creating RAG application with Prompty and Azure AI Foundry. It includes GPT-4 LLM application code, evaluations, deployment automation with AZD CLI, GitHub actions for evaluation and deployment and intent mapping for multiple LLM task mapping.

$ npx skills add Azure-Samples/contoso-chat
761 stars43 qualityClaude Code + OpenAI Agents
Useful candidate, but compare it with alternatives before adopting.
bicepllmops
by Azure-SamplesQuick view
32

Langcorn

PROMISING · 56

⛓️ Serving LangChain LLM apps and agents automagically with FastApi. LLMops

$ npx skills add msoedov/langcorn
939 stars40 qualityClaude Code + OpenAI Agents
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
pythonllmops
by msoedovQuick view
33

NeumAI

PROMISING · 56

Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.

$ npx skills add NeumTry/NeumAI
866 stars39 qualityClaude Code + OpenAI Agents
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
pythonllmops
by NeumTryQuick view
34

Continuous Eval

NEEDS REVIEW · 53

Data-Driven Evaluation for LLM-Powered Applications

$ npx skills add relari-ai/continuous-eval
516 stars38 qualityClaude Code
Inspect the repository carefully before adding it to an agent workflow.Check: Repository looks stale
pythonllmops
by relari-aiQuick view
35

Agency

NEEDS REVIEW · 53

🕵️‍♂️ Library designed for developers eager to explore the potential of Large Language Models (LLMs) and other generative AI through a clean, effective, and Go-idiomatic approach.

$ npx skills add neurocult/agency
508 stars38 qualityClaude Code + OpenAI Agents
Inspect the repository carefully before adding it to an agent workflow.Check: Repository looks stale
gollmops
by neurocultQuick view