Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
$ npx skills add opendatalab/MinerUAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
Zero-copy PDF text extraction library written in Zig. High-performance, memory-mapped parsing with SIMD acceleration.
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
$ npx skills add opendatalab/MinerUA fast, helpful, and open-source document parser
$ npx skills add run-llama/liteparseThe official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
$ npx skills add bytedance/DolphinCommunity maintained fork of pdfminer - we fathom PDF
$ npx skills add pdfminer/pdfminer.sixA modern PDF library for TypeScript. Parse, modify, and generate PDFs with a clean, intuitive API.
$ npx skills add LibPDF-js/coreA community-supported supercharged document management system: scan, index and archive all your documents
$ npx skills add paperless-ngx/paperless-ngxOCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
$ npx skills add ocrmypdf/OCRmyPDFAn ebook reader application supporting PDF, DjVu, EPUB, FB2 and many more formats, running on Cervantes, Kindle, Kobo, PocketBook and Android devices
$ npx skills add koreader/koreaderSVG file parsing / rendering library
$ npx skills add dompdf/php-svg-libA modern ebook manager and reader with sync and backup capacities for Windows, macOS, Linux, Android, iOS and Web
$ npx skills add koodo-reader/koodo-readerPython tool for converting files and office documents to Markdown.
$ npx skills add microsoft/markitdownFile Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
$ npx skills add QuivrHQ/MegaParseReadest is a modern, feature-rich ebook reader designed for avid readers offering seamless cross-platform access, powerful tools, and an intuitive interface to elevate your reading experience.
$ npx skills add readest/readestGet your documents ready for gen AI
$ npx skills add docling-project/docling#1 PDF Application on GitHub that lets you edit PDFs on any device anywhere
$ npx skills add Stirling-Tools/Stirling-PDFConvert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.
$ npx skills add Unstructured-IO/unstructuredHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Zpdf if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.