OpenAgentSkill guide

Best rag and knowledge skills for AI agents

Use these skills to ingest documents, index knowledge, retrieve relevant context, and make agents better at answering with grounded sources.

When to use this guide

Start from the job, then shortlist the tools.

Index documents

Use quality and freshness signals to decide whether a skill belongs in this workflow.

Search a knowledge base

Use quality and freshness signals to decide whether a skill belongs in this workflow.

Summarize source material

Use quality and freshness signals to decide whether a skill belongs in this workflow.

Ground answers in retrieved context

Use quality and freshness signals to decide whether a skill belongs in this workflow.

Shortlist

Top skills to evaluate

Compare top 4
#1MilvusExcellent · 10044K stars

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

Best fit: High-confidence pick with strong adoption and healthy maintenance signals.

#2WeKnoraExcellent · 10015K stars

Open-source LLM knowledge platform: turn raw documents into a queryable RAG, an autonomous reasoning agent, and a self-maintaining Wiki.

Best fit: High-confidence pick with strong adoption and healthy maintenance signals.

#3TxtaiExcellent · 10013K stars

💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows

Best fit: High-confidence pick with strong adoption and healthy maintenance signals.

#4PaddleOCRExcellent · 10078K stars

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Best fit: High-confidence pick with strong adoption and healthy maintenance signals.

#5PageIndexExcellent · 10032K stars

📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG

Best fit: High-confidence pick with strong adoption and healthy maintenance signals.

#6HaystackExcellent · 10025K stars

Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.

Best fit: High-confidence pick with strong adoption and healthy maintenance signals.

#7Langchain ChatchatExcellent · 10038K stars

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

Best fit: High-confidence pick with strong adoption and healthy maintenance signals.

#8Opendataloader PdfExcellent · 10022K stars

PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.

Best fit: High-confidence pick with strong adoption and healthy maintenance signals.

#9DocsGPTExcellent · 10018K stars

Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.

Best fit: High-confidence pick with strong adoption and healthy maintenance signals.

#10Paper QaExcellent · 1008.5K stars

High accuracy RAG for answering questions from scientific documents with citations

Best fit: High-confidence pick with strong adoption and healthy maintenance signals.

Related stack

Use these skills as part of a workflow.