Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
Excellent quality, 68K stars, and a 43 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add opendatalab/MinerUClaude Code document workflows
Ranked OpenAgentSkill shortlist for Claude Code users parsing PDFs, extracting tables, converting files, and building document intelligence workflows.
Claude Code users building document parsing, research, reporting, and RAG workflows. Ranked from the OpenAgentSkill index using quality, trust, freshness, adoption, and install readiness.
Search intent
These pages are generated from real registry records. The list below is not a generic article; every row links to a skill profile with install, trust, audit, and risk fields.
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
Excellent quality, 68K stars, and a 43 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add opendatalab/MinerUGet your documents ready for gen AI
Excellent quality, 62K stars, and a 39 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add docling-project/doclingConvert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.
Excellent quality, 15K stars, and a 35 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add Unstructured-IO/unstructuredA fast, helpful, and open-source document parser
Excellent quality, 10K stars, and a 35 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add run-llama/liteparseA pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
Excellent quality, 10K stars, and a 32 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add py-pdf/pypdfAll-in-One Development Tool based on PaddlePaddle
Excellent quality, 6.2K stars, and a 31 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add PaddlePaddle/PaddleXPython tool for converting files and office documents to Markdown.
Excellent quality, 156K stars, and a 31 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add microsoft/markitdownTurn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Excellent quality, 83K stars, and a 31 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add PaddlePaddle/PaddleOCRA markdown parser and compiler. Built for speed.
Excellent quality, 37K stars, and a 30 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add markedjs/markedOCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Excellent quality, 34K stars, and a 30 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add ocrmypdf/OCRmyPDFA modern ebook manager and reader with sync and backup capacities for Windows, macOS, Linux, Android, iOS and Web
Excellent quality, 27K stars, and a 30 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add koodo-reader/koodo-readerMarkdown parser, done right. 100% CommonMark support, extensions, syntax plugins & high speed
Excellent quality, 22K stars, and a 29 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add markdown-it/markdown-it🪐 Markdown with superpowers: from ideas to papers, presentations, websites, books, and knowledge bases.
Excellent quality, 16K stars, and a 29 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add iamgio/quarkdownXournal++ is a handwriting notetaking software with PDF annotation support. Written in C++ with GTK3, supporting Linux (e.g. Ubuntu, Debian, Arch, SUSE), macOS and Windows 10. Supports pen input from devices such as Wacom Tablets.
Excellent quality, 15K stars, and a 29 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add xournalpp/xournalppUniversal File Online Preview Project based on Spring-Boot
Excellent quality, 14K stars, and a 29 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add kekingcn/kkFileView🖼️ Image Toolbox is a powerful app for advanced image manipulation. It offers dozens of features, from basic tools like crop and draw to filters, OCR, and a wide range of image processing options
Excellent quality, 13K stars, and a 29 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add T8RIN/ImageToolboxYour One-Stop Publication Workbench
Excellent quality, 13K stars, and a 29 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add Zettlr/ZettlrA developer-friendly API for converting many document formats into PDF files, and more!
Excellent quality, 12K stars, and a 29 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add gotenberg/gotenbergOCR model that handles complex tables, forms, handwriting with full layout.
Excellent quality, 11K stars, and a 29 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add datalab-to/chandraPyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Excellent quality, 10K stars, and a 28 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add pymupdf/PyMuPDFPDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.
Excellent quality, 5.8K stars, and a 28 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add oomol-lab/pdf-craft#1 PDF Application on GitHub that lets you edit PDFs on any device anywhere
Excellent quality, 81K stars, and a 28 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add Stirling-Tools/Stirling-PDFTesseract Open Source OCR Engine (main repository)
Excellent quality, 75K stars, and a 28 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add tesseract-ocr/tesseractA community-supported supercharged document management system: scan, index and archive all your documents
Excellent quality, 42K stars, and a 27 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add paperless-ngx/paperless-ngxShareX is a free and open-source application that enables users to capture or record any area of their screen with a single keystroke. It also supports uploading images, text, and various file types to a wide range of destinations.
Excellent quality, 38K stars, and a 27 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add ShareX/ShareXPure Javascript OCR for more than 100 Languages 📖🎉🖥
Excellent quality, 38K stars, and a 27 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add naptha/tesseract.jsAn ebook reader application supporting PDF, DjVu, EPUB, FB2 and many more formats, running on Cervantes, Kindle, Kobo, PocketBook and Android devices
Excellent quality, 27K stars, and a 27 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add koreader/koreaderPDF Parser for AI-ready data. Automate PDF accessibility. Open-source.
Excellent quality, 25K stars, and a 27 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add opendataloader-project/opendataloader-pdfReadest is a modern, feature-rich ebook reader designed for avid readers offering seamless cross-platform access, powerful tools, and an intuitive interface to elevate your reading experience.
Excellent quality, 22K stars, and a 26 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add readest/readest🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.
Excellent quality, 19K stars, and a 26 use-case fit score.
Best suited scenario
Read uploaded files
$ npx skills add pot-app/pot-desktopSelection method
OpenAgentSkill scores each candidate against the workflow keywords, then balances fit with GitHub stars, quality signals, trust profile, maintenance freshness, and whether there is a clear install path.
The ranking combines workflow fit, quality score, trust profile, GitHub adoption, maintenance freshness, and whether a clear install path exists.
No. Treat the list as a shortlist, open the skill detail page, inspect the repository and license, then test the install command in a sandbox workflow.
Yes. Use /api/skills/search with the related task or /api/agent/rankings?slug=best-claude-code-pdf-parsing-skills to fetch ranked skill data.