Alternatives

Dedoc alternatives for AI agents.

Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.

Current skill

Dedoc

Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser

Quality

Trust

712

Stars

MinerU

Similarity 136Trust 94Excellent 100

Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.

68K starsJun 15, 2026 pushdocument-processingPythonPDF

$ npx skills add opendatalab/MinerU

OpenOCR

Similarity 129Trust 96Excellent 100

OpenOCR: An Open-Source Toolkit for General-OCR Research and Applications, integrates a unified training and evaluation benchmark, commercial-grade OCR and Document Parsing systems, and faithful reproductions of the core implementations from a wide range of academic papers.

1.4K starsMay 20, 2026 pushdocument-processingPythonOCR

$ npx skills add Topdu/OpenOCR

Docext

Similarity 128Trust 91Excellent 98

An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)

2.0K starsMar 17, 2026 pushdocument-processingPythonOCR

$ npx skills add NanoNets/docext

Docling

Similarity 127Trust 92Excellent 100

Get your documents ready for gen AI

62K starsJun 15, 2026 pushdocument-processingPythonPDF

$ npx skills add docling-project/docling

PHPWord

Similarity 127Trust 90Excellent 100

A pure PHP library for reading and writing word processing documents

7.6K starsMay 18, 2026 pushdocument-processingPHPPDF

$ npx skills add PHPOffice/PHPWord

PaddleOCR

Similarity 126Trust 98Excellent 100

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

82K starsJun 12, 2026 pushdocument-processingPythonOCR

$ npx skills add PaddlePaddle/PaddleOCR

Umi OCR

Similarity 125Trust 91Excellent 100

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。

45K starsNov 20, 2025 pushdocument-processingPythonOCR

$ npx skills add hiroi-sora/Umi-OCR

EasyOCR

Similarity 125Trust 91Excellent 100

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

30K starsDec 5, 2025 pushdocument-processingPythonOCR

$ npx skills add JaidedAI/EasyOCR

Dolphin

Similarity 125Trust 90Excellent 100

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

9.0K starsMar 25, 2026 pushdocument-processingPythonPDF

$ npx skills add bytedance/Dolphin

#10

Manga Image Translator

Similarity 124Trust 98Excellent 100

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)

10K starsMay 24, 2026 pushdocument-processingPythonOCR

$ npx skills add zyddnys/manga-image-translator

#11

X AnyLabeling

Similarity 124Trust 97Excellent 100

Effortless data labeling with AI support from Segment Anything and other awesome models.

9.4K starsJun 6, 2026 pushdocument-processingPythonOCR

$ npx skills add CVHub520/X-AnyLabeling

#12

Ddddocr

Similarity 124Trust 91Excellent 100

带带弟弟通用验证码识别OCR pypi版

14K starsMar 10, 2026 pushdocument-processingPythonOCR

$ npx skills add sml2h3/ddddocr

#13

Chandra

Similarity 124Trust 93Excellent 100

OCR model that handles complex tables, forms, handwriting with full layout.

11K starsApr 22, 2026 pushdocument-processingPythonOCR

$ npx skills add datalab-to/chandra

#14

Video Subtitle Extractor

Similarity 123Trust 94Excellent 100

视频硬字幕提取，生成srt文件。无需申请第三方API，本地实现文本识别。基于深度学习的视频字幕提取框架，包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

9.0K starsApr 9, 2026 pushdocument-processingPythonOCR

$ npx skills add YaoFANGUK/video-subtitle-extractor

#15

Doctr

Similarity 123Trust 97Excellent 100

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

6.1K starsJun 11, 2026 pushdocument-processingPythonOCR

$ npx skills add mindee/doctr

#16

PaddleX

Similarity 123Trust 94Excellent 100

All-in-One Development Tool based on PaddlePaddle

6.2K starsJun 12, 2026 pushdocument-processingPythonOCR

$ npx skills add PaddlePaddle/PaddleX

How to choose

When should you switch?

Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Dedoc if it already passes your workflow test and repository review.

Next step

Compare top candidates side by side

Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.