Pdftabextract

REVIEW · 74

Community indexed

A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.

Downloads0

Stars2.3K

Version1.0.0

Quality74/100 · Strong

Trust74/100 · Sandbox only

Audit73/100 · Needs review

Supply asset profile

Research and knowledge work

Deep research, source comparison, literature review, RAG, knowledge search, and reports.

Browse track

Scenario

Document processing

I need my agent to read PDFs, extract tables, and turn documents into structured data.

Agent fit

Claude Code + CLI + Codex

Codex, Claude Code, Cursor, CLI, or custom agents.

Install

Ready

npx skills add WZBSocialScienceCenter/pdftabextract

Maintenance

stale

4y since push

Risk

Needs review

Repository appears stale

GitHub quality

2.3K

74/100 quality · 82/100 trust

Coverage tags

ResearchDocument processingdocument-processingocrdocuments

Review notes

Repository appears stale · Repository looks stale

Agent adoption scorecard

Trust, audit, and install readiness at a glance

These scores combine public repository metadata, OpenAgentSkill review signals, maintenance freshness, and install readiness. They are a shortlist signal, not a replacement for human review.

Quality

Strong

Solid option that is likely worth shortlisting for production workflows.

Trust

Sandbox only

Useful candidate with missing or mixed trust signals. Keep it in an isolated workspace until the outcome loop proves task fit.

Audit

Needs review

Install readiness, security metadata, maintenance, and adoption risk.

Trust Score v5

Human review before install

Run only in a sandbox and compare close alternatives before using it for real work.

PythonOCRCodexClaude CodeCursor

Stars

2.3K GitHub stars

Repo activity

2.3K stars, 369 forks

Maintenance

4y since push

License

Apache-2.0

Install

npx skills add WZBSocialScienceCenter/pdftabextract

Install safety

standard package or runtime install path

Permission surface

filesystem or document access

Agent outcomes

No agent outcome data yet

Docs

Strong README/SKILL.md context

Risk summary

Review before production

Repository looks stale
Quality score needs review
Recent maintenance: 4y since push

Install readiness

Install path available

Install path is available
Repository evidence is available
License is declared
No Agent Proven outcome evidence yet

Agent-readable metadata

Machine-readable decision data for this skill.

Use this block or the embedded JSON to decide whether an agent should install this skill, choose an alternative, or ask for human review first.

Open JSON

Suited tasks

Document processing workflows
Claude Code teams
teams that value GitHub adoption signals
Read uploaded files

Suited agents

PythonOCRCodexClaude CodeCursorOpenAgentSkill CLICLI

Install decision

Command: npx skills add WZBSocialScienceCenter/pdftabextract
Policy: review
Human review: yes

Trust and risk

Trust: 74/100
Audit: 73/100
Risk level: Needs review

Outcome loop

Endpoint: /api/agent/outcome
Event ID: resolve
Outcomes: 5

Install command

npx skills add WZBSocialScienceCenter/pdftabextract

Public audit Eval report Resolve API Install handoff

Do not use when

teams that require actively maintained dependencies
production agents without a repository review
Repository looks stale
Repository appears stale
Quality score needs review

Alternative

Markitdown

156.1K stars

npx skills add microsoft/markitdown

Alternative

PaddleOCR

83.1K stars

npx skills add PaddlePaddle/PaddleOCR

Alternative

Stirling PDF

81.2K stars

npx skills add Stirling-Tools/Stirling-PDF

Alternative

Tesseract

74.7K stars

npx skills add tesseract-ocr/tesseract

Agent safety v2

57/100 · Review before install

Experimentalreview

Sparse or mixed signals. Useful for discovery, but not for autonomous installation.

Test manually in an isolated workspace and compare against safer alternatives.

Resolve via API

medium

Network access

Skill likely fetches remote pages, APIs, repositories, or external services.

medium

Filesystem access

Skill may read or write project files, documents, generated artifacts, or local workspace state.

Repository appears stale

Install targets

Install this skill in your agent workflow

Copy the registry command or an agent-specific install prompt for Codex, Claude Code, and Cursor.

skill install

OpenAgentSkill CLI

Use the registry command when your workflow supports the OpenAgentSkill installer.

$ npx skills add WZBSocialScienceCenter/pdftabextract

Agent resolve plan

Let an agent verify fit before installing.

The Resolve API returns the selected skill, alternatives, safety policy, audit notes, install target, and copy-paste prompt an agent can follow without scraping this page.

Open text plan

Resolve JSON

/api/agent/resolve?task=Use%20Pdftabextract%20for%20an%20agent%20workflow&agent=codex&max_risk=medium

Resolve text

/api/agent/resolve?task=Use%20Pdftabextract%20for%20an%20agent%20workflow&agent=codex&max_risk=medium&format=text

Install handoff

/api/skills/wzbsocialsciencecenter-pdftabextract/install

Agent should check

Task fit and alternatives from Resolve API.
Audit score, trust score, and safety policy warnings.
Install target compatibility for Codex, Claude Code, Cursor, or CLI.

Copy prompt

Task: Use Pdftabextract in this workspace.
Resolve first: https://www.openagentskill.com/api/agent/resolve?task=Use%20Pdftabextract%20for%20an%20agent%20workflow&agent=codex&max_risk=medium
Review install handoff: https://www.openagentskill.com/api/skills/wzbsocialsciencecenter-pdftabextract/install
Install command: npx skills add WZBSocialScienceCenter/pdftabextract
Before running it, summarize audit warnings, required permissions, and the fallback skill if install is risky.

Agent handoff

Give an agent the install path, not another directory page.

Use the public install endpoint to fetch the command, safety checklist, target prompts, and canonical links for this skill.

Open install API

Install handoff

/api/skills/wzbsocialsciencecenter-pdftabextract/install

LLM text format

/api/skills/wzbsocialsciencecenter-pdftabextract/install?format=text

Find alternatives

/api/skills/search?q=Pdftabextract&limit=3

Agent prompt

Use Pdftabextract for this task. Review https://www.openagentskill.com/api/skills/wzbsocialsciencecenter-pdftabextract/install, then install with: npx skills add WZBSocialScienceCenter/pdftabextract

Registry metadata

Agent-readable profile for automatic skill selection.

This page exposes the same decision, trust, audit, use-case, and install signals through the Registry API, so agents can rank this skill without scraping the UI.

Open manifest

Manifest

/api/registry/manifest/wzbsocialsciencecenter-pdftabextract

LLM text

/api/registry/manifest/wzbsocialsciencecenter-pdftabextract?format=text

Install alias

/api/registry/install/wzbsocialsciencecenter-pdftabextract

Recommend

/api/registry/recommend?task=Use%20Pdftabextract%20in%20an%20agent%20workflow&limit=3

Agent fit

78/100

Document processing

Use-case tags

Document processing RAG and knowledge Workflow automation

Platforms

Python, OCR, Claude Code

Audit report

Needs review · 73/100

Review install readiness, maintenance, trust, quality, and metadata warnings before adding this skill to an agent workflow.

View audit report View eval report

Agent decision cockpit

Companion skill for Document processing

Shortlist this skill and compare it with close alternatives before production adoption.

Readiness

Shortlist

Stage

Role in stack

Companion skill

Primary fit

Document processing

Trust label

Strong shortlist

Install path

Command ready

Use when

Document processing workflows
Claude Code teams
teams that value GitHub adoption signals

Evidence

2,256 GitHub stars
install command or GitHub repo available
74/100 quality profile
6 OpenAgentSkill engagement events

Review first

Repository looks stale

Implementation path

1Install it in a sandbox agent and run one Document processing task end to end.
2Compare output quality, latency, and failure behavior against at least one alternative.
3Promote it into production only after reviewing repository permissions, license, and maintenance signals.

Trust profile

Sandbox only

Useful candidate with missing or mixed trust signals. Keep it in an isolated workspace until the outcome loop proves task fit.

Trust score

GitHub adoption

PASS

2.3K GitHub stars

Stars/forks activity

PASS

2.3K stars, 369 forks; issue activity unavailable in current metadata

Recent maintenance

FIX

4y since push

License clarity

PASS

Apache-2.0

Good signals

Manually verified listing
AI review approved
Install path is available
Repository evidence is available
Meaningful GitHub adoption signal
Install command has no obvious high-risk pattern
Outcome loop is ready but needs first real agent run

Review before install

Repository looks stale
Quality score needs review
Recent maintenance: 4y since push
No real agent outcome reports yet
Human review required before unattended installation

Recommended action

Run only in a sandbox and compare close alternatives before using it for real work.

Quality profile

Strong candidate for agent workflows

Solid option that is likely worth shortlisting for production workflows.

GitHub stars

2.3K

Freshness

4y ago

Install ready

Yes

License

Apache-2.0

Check before install: Repository looks stale

Workflow fit

Use this skill in these scenarios

Parse messy files

Document processing

I need my agent to read PDFs, extract tables, and turn documents into structured data.

Search private knowledge

RAG and knowledge

I need my agent to build a RAG workflow over documents and retrieve reliable context.

Automate repeated work

Workflow automation

I need my agent to automate a repeated workflow across tools and files.

Stack fit

Add it to a complete workflow

Turn skills into distribution

Content growth agent

A stack for turning newly indexed skills into SEO briefs, social drafts, comparison pages, and reusable publishing workflows.

Ingest, retrieve, and cite

RAG knowledge base

A stack for document-heavy agents that ingest files, create searchable knowledge, retrieve relevant context, and answer with grounded sources.

Inspect, patch, and verify code

Coding review agent

A stack for software agents that inspect repositories, review pull requests, generate tests, and turn findings into shippable patches.

Alternative shortlist

Compare before you install

Similar skills in this category, ranked with the same readiness and quality signals.

Compare all

Markitdown

Python tool for converting files and office documents to Markdown.

PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Stirling PDF

#1 PDF Application on GitHub that lets you edit PDFs on any device anywhere

Tesseract

Tesseract Open Source OCR Engine (main repository)

Overview

A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.

Imported by the skill-only GitHub discovery pipeline because it matches agent skill, automation, domain workflow, RAG, document-processing, data, finance, security, or developer-tool signals. Protocol-server projects are excluded from automated imports.

Platform Compatibility

pythonFULL

ocrFULL

Technical Details

Version: 1.0.0
License: Apache-2.0
Last Updated: 6/20/2026
Published: 5/23/2026

Frameworks & Tools

PythonOCR

Decision snapshot

Companion skill

Ready

Shortlist

Stage

2,256 GitHub stars

Audit snapshot

Install review

Install and adoption review

Needs review

Security: 89/100
Maintenance: 20/100
Install: 92/100

Open full audit Open eval report

Agent-proven evidence

Agent Proven evidence

Outcome reports after resolve, review, install, and one narrow run.

Proven

Needs first agent runAuto-install: review firstLast: Unknown

Success rate: —
Recent failure: —
Outcomes: 0
Output quality: —
Failed: 0
Not relevant: 0
Installs: 0
Risk blocked: 0
Setup needed: 0
Production: 0

No agent outcome data yet. The first agent run can report success, setup needs, risk blocks, failure, or not-relevant through /api/agent/outcome.

Agent-Proven ranking Outcome contract

Install

Add to agent workflow

Free and open source. Review the audit before production use.

Compare Alternatives Auto-resolve Plan View on GitHub Documentation

Growth loop

Share kit

Scenario-led draft for Pdftabextract, ready for a manual X post.

Curator note

A lot of agents don't need a bigger prompt. They need fresher context.

Pdftabextract feels like a recent-web radar for Codex, Claude Code, Cursor, and research agents.

2.3K stars

https://www.openagentskill.com/skills/wzbsocialsciencecenter-pdftabextract?ref=x
#AIAgents

Open X draft

Optional reply with install command

Listing + install path for Pdftabextract:
https://www.openagentskill.com/skills/wzbsocialsciencecenter-pdftabextract?ref=x

Install: npx skills add WZBSocialScienceCenter/pdftabextract

Open reply draft

Listing source

Community indexed

Claimable

This listing was indexed from public sources and is not marked official until a maintainer claim is approved.

Creator: WZBSocialScienceCenter
Source: WZBSocialScienceCenter/pdftabextract
Indexed by: OpenAgentSkill community index

Attribution links to the public repository or creator profile. Creators can claim the listing to update ownership signals.

Claim this skill

Owner claim

Claim this skill listing

This community indexed listing is attributed to WZBSocialScienceCenter but is not marked official yet. Claim it to add a verified owner signal and make future launch, install, and audit updates easier to trust.

Creator backlink kit

Add the evidence badges to your README

Show the canonical listing, current trust and audit signals, and real Agent Proven evidence where developers evaluate the repository.

[![Listed on OpenAgentSkill](https://www.openagentskill.com/api/badge/wzbsocialsciencecenter-pdftabextract?metric=listed&label=Listed)](https://www.openagentskill.com/skills/wzbsocialsciencecenter-pdftabextract)
[![OpenAgentSkill Trust](https://www.openagentskill.com/api/badge/wzbsocialsciencecenter-pdftabextract?metric=trust&label=Trust)](https://www.openagentskill.com/skills/wzbsocialsciencecenter-pdftabextract)
[![OpenAgentSkill Audit](https://www.openagentskill.com/api/badge/wzbsocialsciencecenter-pdftabextract?metric=audit&label=Audit)](https://www.openagentskill.com/skills/wzbsocialsciencecenter-pdftabextract/audit)
[![Agent Proven](https://www.openagentskill.com/api/badge/wzbsocialsciencecenter-pdftabextract?metric=proven&label=Agent%20Proven)](https://www.openagentskill.com/skills/wzbsocialsciencecenter-pdftabextract)

Preview badge Open audit Creator Kit

Author

WZBSocialScienceCenter✓

@wzbsocialsciencecenter

Platform Fit

Claude Code

Health Signals

GitHub stars: 2.3K
Quality score: 50/100
Last GitHub push: Jun 24, 2022
Framework hints: 2
OpenAgentSkill views: 2
Install copies: 0
Outbound clicks: 0

Community Signal

Share whether this skill looks useful for your agent workflow. Aggregated feedback improves rankings over time.

Trust & Safety

Sandbox only

GitHub adoption2.3K GitHub starsPASS
Stars/forks activity2.3K stars, 369 forks; issue activity unavailable in current metadataPASS
Recent maintenance4y since pushFIX
License clarityApache-2.0PASS
README/SKILL.md completenessMetadata includes enough usage and workflow contextPASS
Dependency/runtime riskno major dependency risk hints in public metadataPASS

Related Skills

Markitdown

Python tool for converting files and office documents to Markdown.

156.1K stars · 0 installs

PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

83.1K stars · 0 installs

Stirling PDF

#1 PDF Application on GitHub that lets you edit PDFs on any device anywhere

81.2K stars · 0 installs

Tesseract

Tesseract Open Source OCR Engine (main repository)

74.7K stars · 0 installs

Task: Use Pdftabextract in this workspace. Resolve first: https://www.openagentskill.com/api/agent/resolve?task=Use%20Pdftabextract%20for%20an%20agent%20workflow&agent=codex&max_risk=medium Review install handoff: https://www.openagentskill.com/api/skills/wzbsocialsciencecenter-pdftabextract/install Install command: npx skills add WZBSocialScienceCenter/pdftabextract Before running it, summarize audit warnings, required permissions, and the fallback skill if install is risky.

Overview

A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.

A lot of agents don't need a bigger prompt. They need fresher context. Pdftabextract feels like a recent-web radar for Codex, Claude Code, Cursor, and research agents. 2.3K stars https://www.openagentskill.com/skills/wzbsocialsciencecenter-pdftabextract?ref=x #AIAgents

[![Listed on OpenAgentSkill](https://www.openagentskill.com/api/badge/wzbsocialsciencecenter-pdftabextract?metric=listed&label=Listed)](https://www.openagentskill.com/skills/wzbsocialsciencecenter-pdftabextract) [![OpenAgentSkill Trust](https://www.openagentskill.com/api/badge/wzbsocialsciencecenter-pdftabextract?metric=trust&label=Trust)](https://www.openagentskill.com/skills/wzbsocialsciencecenter-pdftabextract) [![OpenAgentSkill Audit](https://www.openagentskill.com/api/badge/wzbsocialsciencecenter-pdftabextract?metric=audit&label=Audit)](https://www.openagentskill.com/skills/wzbsocialsciencecenter-pdftabextract/audit) [![Agent Proven](https://www.openagentskill.com/api/badge/wzbsocialsciencecenter-pdftabextract?metric=proven&label=Agent%20Proven)](https://www.openagentskill.com/skills/wzbsocialsciencecenter-pdftabextract)

Pdftabextract

Research and knowledge work

Trust, audit, and install readiness at a glance

Human review before install

Review before production

Install path available

Machine-readable decision data for this skill.

Markitdown

PaddleOCR

Stirling PDF

Tesseract

57/100 · Review before install

Network access

Filesystem access

Install this skill in your agent workflow

OpenAgentSkill CLI

Let an agent verify fit before installing.

Give an agent the install path, not another directory page.

Agent-readable profile for automatic skill selection.

Needs review · 73/100

Companion skill for Document processing

Sandbox only

Strong candidate for agent workflows

Use this skill in these scenarios

Document processing

RAG and knowledge

Workflow automation

Add it to a complete workflow

Content growth agent

RAG knowledge base

Coding review agent

Compare before you install

Markitdown

PaddleOCR

Stirling PDF

Tesseract

Overview

Platform Compatibility

Technical Details

Companion skill

Install review

Agent Proven evidence

Add to agent workflow

Share kit

Community indexed

Claim this skill listing

Add the evidence badges to your README

Author

Tags

Platform Fit

Health Signals

Community Signal

Trust & Safety

Related Skills

Pdftabextract

Research and knowledge work

Trust, audit, and install readiness at a glance

Human review before install

Review before production

Install path available

Machine-readable decision data for this skill.

Markitdown

PaddleOCR

Stirling PDF

Tesseract

57/100 · Review before install

Network access

Filesystem access

Install this skill in your agent workflow

OpenAgentSkill CLI

Let an agent verify fit before installing.

Give an agent the install path, not another directory page.

Agent-readable profile for automatic skill selection.

Needs review · 73/100

Companion skill for Document processing

Sandbox only

Strong candidate for agent workflows

Use this skill in these scenarios

Document processing

RAG and knowledge