OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
$ npx skills add ocrmypdf/OCRmyPDFAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
$ npx skills add ocrmypdf/OCRmyPDFOCR model that handles complex tables, forms, handwriting with full layout.
$ npx skills add datalab-to/chandraPure Javascript OCR for more than 100 Languages 📖🎉🖥
$ npx skills add naptha/tesseract.js超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M
$ npx skills add DayBreak-u/chineseocr_liteTurn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
$ npx skills add PaddlePaddle/PaddleOCR视觉小说翻译器 / Visual Novel Translator
$ npx skills add HIllya51/LunaTranslatorPython tool for converting files and office documents to Markdown.
$ npx skills add microsoft/markitdownTransforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
$ npx skills add opendatalab/MinerUA ready-to-go translation ocr tool developed with WPF/WPF 开发的一款即用即走的翻译、OCR工具
$ npx skills add STranslate/STranslatePyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
$ npx skills add pymupdf/PyMuPDFTranslate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)
$ npx skills add zyddnys/manga-image-translatorEffortless data labeling with AI support from Segment Anything and other awesome models.
$ npx skills add CVHub520/X-AnyLabelingA pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
$ npx skills add py-pdf/pypdfThe Sphinx documentation generator
$ npx skills add sphinx-doc/sphinxShareX is a free and open-source application that enables users to capture or record any area of their screen with a single keystroke. It also supports uploading images, text, and various file types to a wide range of destinations.
$ npx skills add ShareX/ShareXTesseract Open Source OCR Engine (main repository)
$ npx skills add tesseract-ocr/tesseractHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Webvicob if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.