A Repo For Document AI
$ npx skills add deepdoctection/deepdoctectionAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
A curated list of resources for Document Understanding (DU) topic
A Repo For Document AI
$ npx skills add deepdoctection/deepdoctectionA collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
$ npx skills add AlibabaResearch/AdvancedLiterateMachineryzacharywhitley/awesome-ocr is a high-star GitHub project relevant to AI agent workflows.
$ npx skills add zacharywhitley/awesome-ocrPure Javascript OCR for more than 100 Languages 📖🎉🖥
$ npx skills add naptha/tesseract.jsReady-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
$ npx skills add JaidedAI/EasyOCRA Unified Toolkit for Deep Learning Based Document Image Analysis
$ npx skills add Layout-Parser/layout-parserTranslate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)
$ npx skills add zyddnys/manga-image-translatorEffortless data labeling with AI support from Segment Anything and other awesome models.
$ npx skills add CVHub520/X-AnyLabeling视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
$ npx skills add YaoFANGUK/video-subtitle-extractordocTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
$ npx skills add mindee/doctr深度学习辅助漫画翻译工具, 支持一键机翻和简单的图像/文本编辑 | Yet another computer-aided comic/manga translation tool powered by deeplearning
$ npx skills add dmMaze/BallonsTranslatorAI comic and manga translator app/browser extension for automatically translating comics, manga, manhwa, BDs, fumetti, and more in multiple languages and formats (Images, PDF, EPUB, CBR, CBZ etc).
$ npx skills add ogkalu2/comic-translate基于PaddleOCR重构,并且脱离PaddlePaddle深度学习训练框架的轻量级OCR,推理速度超快 —— A lightweight OCR system based on PaddleOCR, decoupled from the PaddlePaddle deep learning training framework, with ultra-fast inference speed.
$ npx skills add jingsongliujing/OnnxOCROpenOCR: An Open-Source Toolkit for General-OCR Research and Applications, integrates a unified training and evaluation benchmark, commercial-grade OCR and Document Parsing systems, and faithful reproductions of the core implementations from a wide range of academic papers.
$ npx skills add Topdu/OpenOCRpix2tex: Using a ViT to convert images of equations into LaTeX code.
$ npx skills add lukas-blecher/LaTeX-OCRAn on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)
$ npx skills add NanoNets/docextHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Awesome Document Understanding if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.