Alternatives

Unstructured alternatives for AI agents.

Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.

Current skill

Unstructured

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

100
Quality
100
Trust
15K
Stars
#1

Deepdoctection

Similarity 133Trust 100Excellent 100

A Repo For Document AI

3.2K starsMay 15, 2026 pushdocument-processingPythonOCR
$ npx skills add deepdoctection/deepdoctection
#2

Docling

Similarity 131Trust 100Excellent 100

Get your documents ready for gen AI

61K starsJun 9, 2026 pushdocument-processingPythonPDF
$ npx skills add docling-project/docling
#3

Tesseract.Js

Similarity 129Trust 100Excellent 100

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

38K starsMay 17, 2026 pushdocument-processingJavaScriptOCR
$ npx skills add naptha/tesseract.js
#4

Opendataloader Pdf

Similarity 128Trust 100Excellent 100

PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.

24K starsJun 5, 2026 pushdocument-processingJavaOCR
$ npx skills add opendataloader-project/opendataloader-pdf
#5

Manga Image Translator

Similarity 127Trust 100Excellent 100

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)

10.0K starsMay 24, 2026 pushdocument-processingPythonOCR
$ npx skills add zyddnys/manga-image-translator
#6

Video Subtitle Extractor

Similarity 126Trust 100Excellent 100

视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

8.9K starsApr 9, 2026 pushdocument-processingPythonOCR
$ npx skills add YaoFANGUK/video-subtitle-extractor
#7

Doctr

Similarity 126Trust 100Excellent 100

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

6.1K starsJun 6, 2026 pushdocument-processingPythonOCR
$ npx skills add mindee/doctr
#8

BallonsTranslator

Similarity 125Trust 100Excellent 100

深度学习辅助漫画翻译工具, 支持一键机翻和简单的图像/文本编辑 | Yet another computer-aided comic/manga translation tool powered by deeplearning

4.8K starsJun 9, 2026 pushdocument-processingPythonOCR
$ npx skills add dmMaze/BallonsTranslator
#9

Comic Translate

Similarity 124Trust 100Excellent 100

AI comic and manga translator app for automatically translating comics, manga, manhwa, BDs, fumetti, and more in multiple languages and formats (Images, PDF, EPUB, CBR, CBZ etc).

2.8K starsJun 4, 2026 pushdocument-processingPythonOCR
$ npx skills add ogkalu2/comic-translate
#10

Yomitoku

Similarity 123Trust 100Excellent 98

YomiTokuはAIを活用した日本語文書解析エンジンを提供するPythonパッケージです。 Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language.

1.4K starsJun 5, 2026 pushdocument-processingPythonOCR
$ npx skills add kotaro-kinoshita/yomitoku
#11

MinerU

Similarity 123Trust 100Excellent 100

Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.

67K starsJun 6, 2026 pushdocument-processingPythonPDF
$ npx skills add opendatalab/MinerU
#12

Tesseract

Similarity 121Trust 100Excellent 100

Tesseract Open Source OCR Engine (main repository)

75K starsJun 4, 2026 pushdocument-processingC++OCR
$ npx skills add tesseract-ocr/tesseract
#13

Umi OCR

Similarity 121Trust 100Excellent 100

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。

45K starsNov 20, 2025 pushdocument-processingPythonOCR
$ npx skills add hiroi-sora/Umi-OCR
#14

ShareX

Similarity 121Trust 100Excellent 100

ShareX is a free and open-source application that enables users to capture or record any area of their screen with a single keystroke. It also supports uploading images, text, and various file types to a wide range of destinations.

38K starsJun 9, 2026 pushdocument-processingC#OCR
$ npx skills add ShareX/ShareX
#15

OCRmyPDF

Similarity 121Trust 100Excellent 100

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

34K starsJun 5, 2026 pushdocument-processingPythonOCR
$ npx skills add ocrmypdf/OCRmyPDF
#16

Pot Desktop

Similarity 120Trust 100Excellent 100

🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

19K starsMay 25, 2026 pushdocument-processingJavaScriptOCR
$ npx skills add pot-app/pot-desktop

How to choose

When should you switch?

Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Unstructured if it already passes your workflow test and repository review.

Next step

Compare top candidates side by side

Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.