Search private knowledge

RAG and knowledge workflow skills

Use these skills to ingest documents, index knowledge, retrieve relevant context, and make agents better at answering with grounded sources.

Try this task

I need my agent to build a RAG workflow over documents and retrieve reliable context.

Agent should be able to

  • +Chunk documents
  • +Create embeddings
  • +Retrieve and cite relevant passages

Recommended stack

Turn this use case into a workflow

Workflow map

What to build with these skills

01

Index documents

02

Search a knowledge base

03

Summarize source material

04

Ground answers in retrieved context

Best first installs

Start with high-signal skills

18 matched skills

Milvus

VERIFIED

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

44K stars76 qualityMay 22, 2026 push
$ npx skills add milvus-io/milvus

WeKnora

VERIFIED

Open-source LLM knowledge platform: turn raw documents into a queryable RAG, an autonomous reasoning agent, and a self-maintaining Wiki.

15K stars73 qualityMay 22, 2026 push
$ npx skills add Tencent/WeKnora

Txtai

VERIFIED

💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows

13K stars72 qualityMay 22, 2026 push
$ npx skills add neuml/txtai

Skill shortlist

More options for this use case

Browse full marketplace

PaddleOCR

data

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

78K stars77 quality

PageIndex

agent-frameworks

📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG

32K stars75 quality

Haystack

data

Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.

25K stars74 quality

Langchain Chatchat

data

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

38K stars64 quality

Opendataloader Pdf

data

PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.

22K stars74 quality

DocsGPT

data

Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.

18K stars73 quality

Paper Qa

data

High accuracy RAG for answering questions from scientific documents with citations

8.5K stars66 quality

Claude Obsidian

data

Claude + Obsidian knowledge companion. Persistent, compounding wiki vault based on Karpathy's LLM Wiki pattern. /wiki /save /autoresearch

5.4K stars68 quality

RAGFlow

data

Build document intelligence and RAG workflows for agents

81K stars78 quality

Postgresml

data

Postgres with GPUs for ML/AI apps.

6.8K stars58 quality

TencentDB Agent Memory

agent-frameworks

TencentDB Agent Memory delivers fully local long-term memory for AI Agents via a 4-tier progressive pipeline, with zero external API dependencies.

3.9K stars67 quality

Graphify

data

AI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, and more). Turn any folder of code, SQL schemas, R scripts, shell scripts, docs, papers, images, or videos into a queryable knowledge graph. App code + database schema + infrastructure in one graph.

52K stars76 quality

LlamaIndex

data

Connect agents to private data and retrieval workflows

50K stars76 quality

Career Ops

agent-frameworks

AI-powered job search system built on Claude Code. 14 skill modes, Go dashboard, PDF generation, batch processing.

47K stars76 quality

LightRAG

data

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

36K stars75 quality