Alternatives

MinerU Diffusion alternatives for AI agents.

Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.

Current skill

MinerU Diffusion

A diffusion-based framework for document OCR that replaces autoregressive decoding with block-level parallel diffusion decoding.

79
Quality
86
Trust
598
Stars
#1

MinerU

Similarity 158Trust 94Excellent 100

Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.

68K starsJun 17, 2026 pushdocument-processingPythonPDF
$ npx skills add opendatalab/MinerU
#2

Dolphin

Similarity 139Trust 91Excellent 100

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

9.0K starsMar 25, 2026 pushdocument-processingPythonPDF
$ npx skills add bytedance/Dolphin
#3

PyMuPDF

Similarity 132Trust 97Excellent 100

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

10K starsJun 18, 2026 pushdocument-processingPythonPDF
$ npx skills add pymupdf/PyMuPDF
#4

PdfPig

Similarity 131Trust 93Excellent 100

Read and extract text and other content from PDFs in C# (port of PDFBox)

2.5K starsJun 12, 2026 pushdocument-processingC#PDF
$ npx skills add UglyToad/PdfPig
#5

Paperless Ngx

Similarity 126Trust 98Excellent 100

A community-supported supercharged document management system: scan, index and archive all your documents

42K starsJun 19, 2026 pushdocument-processingPythonPDF
$ npx skills add paperless-ngx/paperless-ngx
#6

OCRmyPDF

Similarity 126Trust 98Excellent 100

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

34K starsJun 12, 2026 pushdocument-processingPythonPDF
$ npx skills add ocrmypdf/OCRmyPDF
#7

Markitdown

Similarity 126Trust 96Excellent 100

Python tool for converting files and office documents to Markdown.

156K starsMay 26, 2026 pushdocument-processingPythonPDF
$ npx skills add microsoft/markitdown
#8

Docling

Similarity 125Trust 92Excellent 100

Get your documents ready for gen AI

62K starsJun 18, 2026 pushdocument-processingPythonPDF
$ npx skills add docling-project/docling
#9

Pdfplumber

Similarity 124Trust 94Excellent 100

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

10K starsJan 28, 2026 pushdocument-processingPythonPDF
$ npx skills add jsvine/pdfplumber
#10

Pypdf

Similarity 124Trust 94Excellent 100

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files

10K starsJun 11, 2026 pushdocument-processingPythonPDF
$ npx skills add py-pdf/pypdf
#11

Pdf Craft

Similarity 123Trust 96Excellent 100

PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.

5.8K starsJun 6, 2026 pushdocument-processingPythonPDF
$ npx skills add oomol-lab/pdf-craft
#12

Pdfarranger

Similarity 123Trust 96Excellent 100

Small python-gtk application, which helps the user to merge or split PDF documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface.

5.6K starsJun 3, 2026 pushdocument-processingPythonPDF
$ npx skills add pdfarranger/pdfarranger
#13

Kcc

Similarity 123Trust 96Excellent 100

KCC (a.k.a. Kindle Comic Converter) is a comic and manga converter for ebook readers.

5.2K starsJun 1, 2026 pushdocument-processingPythonPDF
$ npx skills add ciromattia/kcc
#14

Pdfminer.Six

Similarity 123Trust 91Excellent 100

Community maintained fork of pdfminer - we fathom PDF

7.0K starsMar 13, 2026 pushdocument-processingPythonPDF
$ npx skills add pdfminer/pdfminer.six
#15

Malicious Pdf

Similarity 122Trust 91Excellent 100

💀 Generate malicious PDF test files for testing phone-home callbacks, SSRF, XSS, NTLM credential theft, and data exfiltration in PDF viewers, converters, and web applications. Can be used with Burp Collaborator or Interact.sh

4.1K starsJun 4, 2026 pushdocument-processingPythonPDF
$ npx skills add jonaslejon/malicious-pdf
#16

Pikepdf

Similarity 121Trust 92Excellent 100

A Python library for reading and writing PDF, powered by QPDF

2.7K starsJun 9, 2026 pushdocument-processingPythonPDF
$ npx skills add pikepdf/pikepdf

How to choose

When should you switch?

Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep MinerU Diffusion if it already passes your workflow test and repository review.

Next step

Compare top candidates side by side

Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.