Pdftabextract

VERIFIED

A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.

Downloads 0
Stars 2.3K
Version 1.0.0
Quality 74/100 · Strong

Install with one command

$ npx skills add WZBSocialScienceCenter/pdftabextract

Best for

Document processing

Find skills for parsing PDFs, extracting tables, running OCR, converting documents, and preparing file content for agent workflows.

Choose it when

  • You want a GitHub-backed skill with 2.3K stars.
  • You need a reusable install command for agents.
  • You want to compare it with related marketplace skills.

Check before install

  • Pushed 4y ago
  • License: Apache-2.0
  • Review the repository README and examples.

Quality profile

Strong candidate for agent workflows

Solid option that is likely worth shortlisting for production workflows.

74
GitHub stars
2.3K
Freshness
4y ago
Install ready
Yes
License
Apache-2.0
Check before install: Repository looks stale

Workflow fit

Use this skill in these scenarios

Stack fit

Add it to a complete workflow

Overview

A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.

Imported by the skill-only GitHub discovery pipeline because it matches agent skill, automation, RAG, or developer-tool signals. Protocol-server projects are excluded from automated imports.

Platform Compatibility

pythonFULL
pdfFULL

Technical Details

Version
1.0.0
License
Apache-2.0
Last Updated
5/24/2026
Published
5/23/2026

Frameworks & Tools

PythonPDF

Author

W

WZBSocialScienceCenter

@wzbsocialsciencecenter

Platform Fit

Health Signals

GitHub stars
2.3K
Quality score
50/100
Last GitHub push
Jun 24, 2022
Framework hints
2

Community Signal

Share whether this skill looks useful for your agent workflow. Aggregated feedback improves rankings over time.

Trust & Safety

  • Open source (public GitHub repo)
  • AI static analysis passed
  • License: Apache-2.0
  • Manually verified by team