The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
$ npx skills add bytedance/DolphinAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
$ npx skills add bytedance/DolphinPyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
$ npx skills add pymupdf/PyMuPDFGet your documents ready for gen AI
$ npx skills add docling-project/doclingRead and extract text and other content from PDFs in C# (port of PDFBox)
$ npx skills add UglyToad/PdfPigCommunity maintained fork of pdfminer - we fathom PDF
$ npx skills add pdfminer/pdfminer.sixPDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.
$ npx skills add oomol-lab/pdf-craft在保留版面、公式与结构的前提下进行 PDF 翻译,适用于科研与技术文档
$ npx skills add wxyhgk/retain-pdfOCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
$ npx skills add ocrmypdf/OCRmyPDFA modern ebook manager and reader with sync and backup capacities for Windows, macOS, Linux, Android, iOS and Web
$ npx skills add koodo-reader/koodo-readerUniversal File Online Preview Project based on Spring-Boot
$ npx skills add kekingcn/kkFileViewYour One-Stop Publication Workbench
$ npx skills add Zettlr/ZettlrA community-supported supercharged document management system: scan, index and archive all your documents
$ npx skills add paperless-ngx/paperless-ngxA pure PHP library for reading and writing word processing documents
$ npx skills add PHPOffice/PHPWordOpen-source office suite pack that comprises all the tools you need to work with documents, spreadsheets, presentations, PDFs, and PDF forms on Windows, Linux, and macOS
$ npx skills add ONLYOFFICE/DesktopEditorsPlumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
$ npx skills add jsvine/pdfplumberA pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
$ npx skills add py-pdf/pypdfHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep MinerU if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.