Claude Code document workflows

Best Claude Code skills for PDF parsing

Ranked OpenAgentSkill shortlist for Claude Code users parsing PDFs, extracting tables, converting files, and building document intelligence workflows.

Claude Code users building document parsing, research, reporting, and RAG workflows. Ranked from the OpenAgentSkill index using quality, trust, freshness, adoption, and install readiness.

best Claude Code skills for PDF parsingClaude Code
30
Ranked
1.0M
Stars
94
Top trust

Search intent

Find Claude Code-ready skills for PDF parsing, OCR, table extraction, markdown conversion, and document cleanup.

These pages are generated from real registry records. The list below is not a generic article; every row links to a skill profile with install, trust, audit, and risk fields.

#1

MinerU

43 fitTrust 94Excellent 100Audit 94 · Safe to try

Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.

Excellent quality, 68K stars, and a 43 use-case fit score.

Best suited scenario

Read uploaded files

68K starsJun 17, 2026 pushProduction candidatePythonPDF
$ npx skills add opendatalab/MinerU
#2

Docling

39 fitTrust 92Excellent 100Audit 95 · Safe to try

Get your documents ready for gen AI

Excellent quality, 62K stars, and a 39 use-case fit score.

Best suited scenario

Read uploaded files

62K starsJun 18, 2026 pushProduction candidatePythonPDF
$ npx skills add docling-project/docling
#3

Unstructured

35 fitTrust 98Excellent 100Audit 96 · Safe to try

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

Excellent quality, 15K stars, and a 35 use-case fit score.

Best suited scenario

Read uploaded files

15K starsJun 18, 2026 pushProduction candidateHTMLPDF
$ npx skills add Unstructured-IO/unstructured
#4

Liteparse

35 fitTrust 95Excellent 100Audit 95 · Safe to try

A fast, helpful, and open-source document parser

Excellent quality, 10K stars, and a 35 use-case fit score.

Best suited scenario

Read uploaded files

10K starsJun 18, 2026 pushProduction candidateRustPDF
$ npx skills add run-llama/liteparse
#5

Pypdf

32 fitTrust 94Excellent 100Audit 94 · Safe to try

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files

Excellent quality, 10K stars, and a 32 use-case fit score.

Best suited scenario

Read uploaded files

10K starsJun 11, 2026 pushProduction candidatePythonPDF
$ npx skills add py-pdf/pypdf
#6

PaddleX

31 fitTrust 94Excellent 100Audit 95 · Safe to try

All-in-One Development Tool based on PaddlePaddle

Excellent quality, 6.2K stars, and a 31 use-case fit score.

Best suited scenario

Read uploaded files

6.2K starsJun 12, 2026 pushProduction candidatePythonOCR
$ npx skills add PaddlePaddle/PaddleX
#7

Markitdown

31 fitTrust 96Excellent 100Audit 96 · Safe to try

Python tool for converting files and office documents to Markdown.

Excellent quality, 156K stars, and a 31 use-case fit score.

Best suited scenario

Read uploaded files

156K starsMay 26, 2026 pushProduction candidatePythonPDF
$ npx skills add microsoft/markitdown
#8

PaddleOCR

31 fitTrust 98Excellent 100Audit 96 · Safe to try

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Excellent quality, 83K stars, and a 31 use-case fit score.

Best suited scenario

Read uploaded files

83K starsJun 16, 2026 pushProduction candidatePythonOCR
$ npx skills add PaddlePaddle/PaddleOCR
#9

Marked

30 fitTrust 92Excellent 100Audit 93 · Safe to try

A markdown parser and compiler. Built for speed.

Excellent quality, 37K stars, and a 30 use-case fit score.

Best suited scenario

Read uploaded files

37K starsJun 16, 2026 pushProduction candidateJavaScriptMarkdown
$ npx skills add markedjs/marked
#10

OCRmyPDF

30 fitTrust 98Excellent 100Audit 96 · Safe to try

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Excellent quality, 34K stars, and a 30 use-case fit score.

Best suited scenario

Read uploaded files

34K starsJun 12, 2026 pushProduction candidatePythonPDF
$ npx skills add ocrmypdf/OCRmyPDF
#11

Koodo Reader

30 fitTrust 98Excellent 100Audit 96 · Safe to try

A modern ebook manager and reader with sync and backup capacities for Windows, macOS, Linux, Android, iOS and Web

Excellent quality, 27K stars, and a 30 use-case fit score.

Best suited scenario

Read uploaded files

27K starsJun 17, 2026 pushProduction candidateJavaScriptPDF
$ npx skills add koodo-reader/koodo-reader
#12

Markdown It

29 fitTrust 98Excellent 100Audit 96 · Safe to try

Markdown parser, done right. 100% CommonMark support, extensions, syntax plugins & high speed

Excellent quality, 22K stars, and a 29 use-case fit score.

Best suited scenario

Read uploaded files

22K starsMay 26, 2026 pushProduction candidateJavaScriptMarkdown
$ npx skills add markdown-it/markdown-it
#13

Quarkdown

29 fitTrust 97Excellent 100Audit 96 · Safe to try

🪐 Markdown with superpowers: from ideas to papers, presentations, websites, books, and knowledge bases.

Excellent quality, 16K stars, and a 29 use-case fit score.

Best suited scenario

Read uploaded files

16K starsJun 17, 2026 pushProduction candidateKotlinPDF
$ npx skills add iamgio/quarkdown
#14

Xournalpp

29 fitTrust 98Excellent 100Audit 96 · Safe to try

Xournal++ is a handwriting notetaking software with PDF annotation support. Written in C++ with GTK3, supporting Linux (e.g. Ubuntu, Debian, Arch, SUSE), macOS and Windows 10. Supports pen input from devices such as Wacom Tablets.

Excellent quality, 15K stars, and a 29 use-case fit score.

Best suited scenario

Read uploaded files

15K starsJun 15, 2026 pushProduction candidateC++PDF
$ npx skills add xournalpp/xournalpp
#15

KkFileView

29 fitTrust 92Excellent 100Audit 93 · Safe to try

Universal File Online Preview Project based on Spring-Boot

Excellent quality, 14K stars, and a 29 use-case fit score.

Best suited scenario

Read uploaded files

14K starsJun 11, 2026 pushProduction candidateJavaPDF
$ npx skills add kekingcn/kkFileView
#16

ImageToolbox

29 fitTrust 97Excellent 100Audit 96 · Safe to try

🖼️ Image Toolbox is a powerful app for advanced image manipulation. It offers dozens of features, from basic tools like crop and draw to filters, OCR, and a wide range of image processing options

Excellent quality, 13K stars, and a 29 use-case fit score.

Best suited scenario

Read uploaded files

13K starsJun 18, 2026 pushProduction candidateKotlinPDF
$ npx skills add T8RIN/ImageToolbox
#17

Zettlr

29 fitTrust 95Excellent 100Audit 95 · Safe to try

Your One-Stop Publication Workbench

Excellent quality, 13K stars, and a 29 use-case fit score.

Best suited scenario

Read uploaded files

13K starsJun 14, 2026 pushProduction candidateTypeScriptPDF
$ npx skills add Zettlr/Zettlr
#18

Gotenberg

29 fitTrust 94Excellent 100Audit 95 · Safe to try

A developer-friendly API for converting many document formats into PDF files, and more!

Excellent quality, 12K stars, and a 29 use-case fit score.

Best suited scenario

Read uploaded files

12K starsJun 19, 2026 pushProduction candidateGoPDF
$ npx skills add gotenberg/gotenberg
#19

Chandra

29 fitTrust 94Excellent 100Audit 93 · Safe to try

OCR model that handles complex tables, forms, handwriting with full layout.

Excellent quality, 11K stars, and a 29 use-case fit score.

Best suited scenario

Read uploaded files

11K starsApr 22, 2026 pushProduction candidatePythonOCR
$ npx skills add datalab-to/chandra
#20

PyMuPDF

28 fitTrust 97Excellent 100Audit 96 · Safe to try

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Excellent quality, 10K stars, and a 28 use-case fit score.

Best suited scenario

Read uploaded files

10K starsJun 18, 2026 pushProduction candidatePythonPDF
$ npx skills add pymupdf/PyMuPDF
#21

Pdf Craft

28 fitTrust 96Excellent 100Audit 96 · Safe to try

PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.

Excellent quality, 5.8K stars, and a 28 use-case fit score.

Best suited scenario

Read uploaded files

5.8K starsJun 6, 2026 pushProduction candidatePythonPDF
$ npx skills add oomol-lab/pdf-craft
#22

Stirling PDF

28 fitTrust 91Excellent 100Audit 93 · Safe to try

#1 PDF Application on GitHub that lets you edit PDFs on any device anywhere

Excellent quality, 81K stars, and a 28 use-case fit score.

Best suited scenario

Read uploaded files

81K starsJun 18, 2026 pushProduction candidateTypeScriptPDF
$ npx skills add Stirling-Tools/Stirling-PDF
#23

Tesseract

28 fitTrust 96Excellent 100Audit 96 · Safe to try

Tesseract Open Source OCR Engine (main repository)

Excellent quality, 75K stars, and a 28 use-case fit score.

Best suited scenario

Read uploaded files

75K starsJun 13, 2026 pushProduction candidateC++OCR
$ npx skills add tesseract-ocr/tesseract
#24

Paperless Ngx

27 fitTrust 98Excellent 100Audit 96 · Safe to try

A community-supported supercharged document management system: scan, index and archive all your documents

Excellent quality, 42K stars, and a 27 use-case fit score.

Best suited scenario

Read uploaded files

42K starsJun 19, 2026 pushProduction candidatePythonPDF
$ npx skills add paperless-ngx/paperless-ngx
#25

ShareX

27 fitTrust 98Excellent 100Audit 96 · Safe to try

ShareX is a free and open-source application that enables users to capture or record any area of their screen with a single keystroke. It also supports uploading images, text, and various file types to a wide range of destinations.

Excellent quality, 38K stars, and a 27 use-case fit score.

Best suited scenario

Read uploaded files

38K starsJun 18, 2026 pushProduction candidateC#OCR
$ npx skills add ShareX/ShareX
#26

Tesseract.Js

27 fitTrust 95Excellent 100Audit 94 · Safe to try

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

Excellent quality, 38K stars, and a 27 use-case fit score.

Best suited scenario

Read uploaded files

38K starsMay 17, 2026 pushProduction candidateJavaScriptOCR
$ npx skills add naptha/tesseract.js
#27

Koreader

27 fitTrust 98Excellent 100Audit 96 · Safe to try

An ebook reader application supporting PDF, DjVu, EPUB, FB2 and many more formats, running on Cervantes, Kindle, Kobo, PocketBook and Android devices

Excellent quality, 27K stars, and a 27 use-case fit score.

Best suited scenario

Read uploaded files

27K starsJun 18, 2026 pushProduction candidateLuaPDF
$ npx skills add koreader/koreader
#28

Opendataloader Pdf

27 fitTrust 95Excellent 100Audit 95 · Safe to try

PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.

Excellent quality, 25K stars, and a 27 use-case fit score.

Best suited scenario

Read uploaded files

25K starsJun 18, 2026 pushProduction candidateJavaRAG
$ npx skills add opendataloader-project/opendataloader-pdf
#29

Readest

26 fitTrust 98Excellent 100Audit 96 · Safe to try

Readest is a modern, feature-rich ebook reader designed for avid readers offering seamless cross-platform access, powerful tools, and an intuitive interface to elevate your reading experience.

Excellent quality, 22K stars, and a 26 use-case fit score.

Best suited scenario

Read uploaded files

22K starsJun 19, 2026 pushProduction candidateTypeScriptPDF
$ npx skills add readest/readest
#30

Pot Desktop

26 fitTrust 97Excellent 100Audit 96 · Safe to try

🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

Excellent quality, 19K stars, and a 26 use-case fit score.

Best suited scenario

Read uploaded files

19K starsJun 16, 2026 pushProduction candidateJavaScriptOCR
$ npx skills add pot-app/pot-desktop

Selection method

How this list is ranked

OpenAgentSkill scores each candidate against the workflow keywords, then balances fit with GitHub stars, quality signals, trust profile, maintenance freshness, and whether there is a clear install path.

How does OpenAgentSkill rank claude code pdf parsing?

The ranking combines workflow fit, quality score, trust profile, GitHub adoption, maintenance freshness, and whether a clear install path exists.

Should I install the top skill immediately?

No. Treat the list as a shortlist, open the skill detail page, inspect the repository and license, then test the install command in a sandbox workflow.

Can my agent consume this ranking through an API?

Yes. Use /api/skills/search with the related task or /api/agent/rankings?slug=best-claude-code-pdf-parsing-skills to fetch ranked skill data.