PaddleOCR
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Install with one command
$ npx skills add PaddlePaddle/PaddleOCRBest for
RAG and knowledge
Use these skills to ingest documents, index knowledge, retrieve relevant context, and make agents better at answering with grounded sources.
Choose it when
- You want a GitHub-backed skill with 78.4K stars.
- You need a reusable install command for agents.
- You want to compare it with related marketplace skills.
Check before install
- Pushed 3d ago
- License: Apache-2.0
- Review the repository README and examples.
Quality profile
Excellent candidate for agent workflows
High-confidence pick with strong adoption and healthy maintenance signals.
Workflow fit
Use this skill in these scenarios
Search private knowledge
RAG and knowledge
I need my agent to build a RAG workflow over documents and retrieve reliable context.
Parse messy files
Document processing
I need my agent to read PDFs, extract tables, and turn documents into structured data.
Manage repositories
GitHub automation
I need my agent to triage GitHub issues, review pull requests, and summarize repository changes.
Stack fit
Add it to a complete workflow
Ingest, retrieve, and cite
RAG knowledge base
A stack for document-heavy agents that ingest files, create searchable knowledge, retrieve relevant context, and answer with grounded sources.
Inspect, patch, and verify code
Coding review agent
A stack for software agents that inspect repositories, review pull requests, generate tests, and turn findings into shippable patches.
Turn skills into distribution
Content growth agent
A stack for turning newly indexed skills into SEO briefs, social drafts, comparison pages, and reusable publishing workflows.
Overview
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Imported by the skill-only GitHub discovery pipeline because it matches agent skill, automation, RAG, or developer-tool signals. Protocol-server projects are excluded from automated imports.
Platform Compatibility
Technical Details
- Version
- 1.0.0
- License
- Apache-2.0
- Last Updated
- 5/23/2026
- Published
- 5/23/2026
Frameworks & Tools
Author
PaddlePaddle✓
@paddlepaddle
Tags
Platform Fit
Health Signals
- GitHub stars
- 78.4K
- Quality score
- 77/100
- Last GitHub push
- May 19, 2026
- Framework hints
- 2
Community Signal
Share whether this skill looks useful for your agent workflow. Aggregated feedback improves rankings over time.
Trust & Safety
- —Open source (public GitHub repo)
- —AI static analysis passed
- —License: Apache-2.0
- —Manually verified by team
Related Skills
MarkItDown
Convert documents into Markdown for agent-readable context
124.7K stars · 0 installsAwesome Llm Apps
100+ AI Agent & RAG apps you can actually run — clone, customize, ship.
111.5K stars · 0 installsRAGFlow
Build document intelligence and RAG workflows for agents
81.1K stars · 0 installsLlamaIndex
Connect agents to private data and retrieval workflows
49.6K stars · 0 installs