OpenOCR
OpenOCR: An Open-Source Toolkit for General-OCR Research and Applications, integrates a unified training and evaluation benchmark, commercial-grade OCR and Document Parsing systems, and faithful reproductions of the core implementations from a wide range of academic papers.
Install with one command
$ npx skills add Topdu/OpenOCRBest for
Research agents
Browse skills for market research, web research, summarization, source comparison, report writing, and evidence-backed analysis.
Choose it when
- You want a GitHub-backed skill with 1.4K stars.
- You need a reusable install command for agents.
- You want to compare it with related marketplace skills.
Check before install
- Pushed 3d ago
- License: Apache-2.0
- Review the repository README and examples.
Quality profile
Excellent candidate for agent workflows
High-confidence pick with strong adoption and healthy maintenance signals.
Workflow fit
Use this skill in these scenarios
Investigate faster
Research agents
I need my agent to research a topic, compare sources, and produce a concise report.
Parse messy files
Document processing
I need my agent to read PDFs, extract tables, and turn documents into structured data.
Search private knowledge
RAG and knowledge
I need my agent to build a RAG workflow over documents and retrieve reliable context.
Stack fit
Add it to a complete workflow
Find, compare, and synthesize
Research report agent
A stack for agents that gather sources, compare claims, summarize long material, and draft useful research briefs.
Ingest, retrieve, and cite
RAG knowledge base
A stack for document-heavy agents that ingest files, create searchable knowledge, retrieve relevant context, and answer with grounded sources.
Inspect, patch, and verify code
Coding review agent
A stack for software agents that inspect repositories, review pull requests, generate tests, and turn findings into shippable patches.
Overview
OpenOCR: An Open-Source Toolkit for General-OCR Research and Applications, integrates a unified training and evaluation benchmark, commercial-grade OCR and Document Parsing systems, and faithful reproductions of the core implementations from a wide range of academic papers.
Imported by the skill-only GitHub discovery pipeline because it matches agent skill, automation, RAG, or developer-tool signals. Protocol-server projects are excluded from automated imports.
Platform Compatibility
Technical Details
- Version
- 1.0.0
- License
- Apache-2.0
- Last Updated
- 5/24/2026
- Published
- 5/24/2026
Frameworks & Tools
Author
Topdu✓
@topdu
Tags
Platform Fit
Health Signals
- GitHub stars
- 1.4K
- Quality score
- 64/100
- Last GitHub push
- May 20, 2026
- Framework hints
- 2
Community Signal
Share whether this skill looks useful for your agent workflow. Aggregated feedback improves rankings over time.
Trust & Safety
- —Open source (public GitHub repo)
- —AI static analysis passed
- —License: Apache-2.0
- —Manually verified by team
Related Skills
Pikepdf
A Python library for reading and writing PDF, powered by QPDF
2.7K stars · 0 installsMaroto
A maroto way to create PDFs. Maroto is inspired in Bootstrap and uses gofpdf. Fast and simple.
2.7K stars · 0 installsPdfPig
Read and extract text and other content from PDFs in C# (port of PDFBox)
2.4K stars · 0 installsDecktape
PDF exporter for HTML presentations
2.4K stars · 0 installs