PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Install with one command
$ npx skills add pymupdf/PyMuPDFDecision summary
Production-ready for Document processing
Use this as a leading candidate, then validate the README and install path in your own agent stack.
Best for
- Document processing workflows
- Claude Code teams
- teams that value GitHub adoption signals
Not ideal for
- teams that need a vendor-supported SLA
- high-compliance environments without internal security review
Risk notes
- No major risk signals from current metadata
Quality profile
Excellent candidate for agent workflows
High-confidence pick with strong adoption and healthy maintenance signals.
Workflow fit
Use this skill in these scenarios
Parse messy files
Document processing
I need my agent to read PDFs, extract tables, and turn documents into structured data.
Search private knowledge
RAG and knowledge
I need my agent to build a RAG workflow over documents and retrieve reliable context.
Manage repositories
GitHub automation
I need my agent to triage GitHub issues, review pull requests, and summarize repository changes.
Stack fit
Add it to a complete workflow
Ingest, retrieve, and cite
RAG knowledge base
A stack for document-heavy agents that ingest files, create searchable knowledge, retrieve relevant context, and answer with grounded sources.
Inspect, patch, and verify code
Coding review agent
A stack for software agents that inspect repositories, review pull requests, generate tests, and turn findings into shippable patches.
Scrape, clean, and reuse web data
Web data pipeline
A practical stack for agents that crawl public pages, extract clean content, normalize data, and hand it to downstream research or RAG workflows.
Overview
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Imported by the skill-only GitHub discovery pipeline because it matches agent skill, automation, RAG, or developer-tool signals. Protocol-server projects are excluded from automated imports.
Platform Compatibility
Technical Details
- Version
- 1.0.0
- License
- AGPL-3.0
- Last Updated
- 6/7/2026
- Published
- 6/5/2026
Frameworks & Tools
Claim this skill
Project owners can request ownership review. Approved claims unlock a stronger trust signal.
Author
pymupdf✓
@pymupdf
Platform Fit
Health Signals
- GitHub stars
- 9.9K
- Quality score
- 70/100
- Last GitHub push
- Jun 5, 2026
- Framework hints
- 2
- OpenAgentSkill views
- 3
- Install copies
- 0
- Outbound clicks
- 0
Community Signal
Share whether this skill looks useful for your agent workflow. Aggregated feedback improves rankings over time.
Trust & Safety
- —Open source (public GitHub repo)
- —AI static analysis passed
- —License: AGPL-3.0
- —Manually verified by team
Related Skills
Stirling PDF
#1 PDF Application on GitHub that lets you edit PDFs on any device anywhere
80.3K stars · 0 installsTesseract
Tesseract Open Source OCR Engine (main repository)
74.6K stars · 0 installsMinerU
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
66.8K stars · 0 installsDocling
Get your documents ready for gen AI
61.1K stars · 0 installs