Pdf Inspector
Fast Rust library for PDF inspection, classification, and text extraction. Intelligently detects scanned vs text-based PDFs to enable smart routing decisions.
Install with one command
$ npx skills add firecrawl/pdf-inspectorDecision summary
Production-ready for Document processing
Use this as a leading candidate, then validate the README and install path in your own agent stack.
Best for
- Document processing workflows
- Claude Code teams
- teams that value GitHub adoption signals
Not ideal for
- teams that need a vendor-supported SLA
- high-compliance environments without internal security review
Risk notes
- No OpenAgentSkill engagement data yet
Quality profile
Excellent candidate for agent workflows
High-confidence pick with strong adoption and healthy maintenance signals.
Workflow fit
Use this skill in these scenarios
Parse messy files
Document processing
I need my agent to read PDFs, extract tables, and turn documents into structured data.
Collect structured data
Web scraping
I need my agent to scrape websites and extract structured data from pages.
Search private knowledge
RAG and knowledge
I need my agent to build a RAG workflow over documents and retrieve reliable context.
Stack fit
Add it to a complete workflow
Scrape, clean, and reuse web data
Web data pipeline
A practical stack for agents that crawl public pages, extract clean content, normalize data, and hand it to downstream research or RAG workflows.
Ingest, retrieve, and cite
RAG knowledge base
A stack for document-heavy agents that ingest files, create searchable knowledge, retrieve relevant context, and answer with grounded sources.
Inspect, patch, and verify code
Coding review agent
A stack for software agents that inspect repositories, review pull requests, generate tests, and turn findings into shippable patches.
Overview
Fast Rust library for PDF inspection, classification, and text extraction. Intelligently detects scanned vs text-based PDFs to enable smart routing decisions.
Imported by the skill-only GitHub discovery pipeline because it matches agent skill, automation, RAG, or developer-tool signals. Protocol-server projects are excluded from automated imports.
Platform Compatibility
Technical Details
- Version
- 1.0.0
- License
- Unknown
- Last Updated
- 5/31/2026
- Published
- 5/24/2026
Frameworks & Tools
Claim this skill
Project owners can request ownership review. Approved claims unlock a stronger trust signal.
Author
firecrawl✓
@firecrawl
Platform Fit
Health Signals
- GitHub stars
- 1.4K
- Quality score
- 64/100
- Last GitHub push
- May 30, 2026
- Framework hints
- 2
- OpenAgentSkill views
- 0
- Install copies
- 0
- Outbound clicks
- 0
Community Signal
Share whether this skill looks useful for your agent workflow. Aggregated feedback improves rankings over time.
Trust & Safety
- —Open source (public GitHub repo)
- —AI static analysis passed
- —License: Unknown
- —Manually verified by team
Related Skills
Pikepdf
A Python library for reading and writing PDF, powered by QPDF
2.7K stars · 0 installsMaroto
A maroto way to create PDFs. Maroto is inspired in Bootstrap and uses gofpdf. Fast and simple.
2.7K stars · 0 installsPdfPig
Read and extract text and other content from PDFs in C# (port of PDFBox)
2.4K stars · 0 installsDecktape
PDF exporter for HTML presentations
2.4K stars · 0 installs