OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
$ npx skills add ocrmypdf/OCRmyPDFAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
$ npx skills add ocrmypdf/OCRmyPDFThe official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
$ npx skills add bytedance/DolphinSmall python-gtk application, which helps the user to merge or split PDF documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface.
$ npx skills add pdfarranger/pdfarranger带带弟弟 通用验证码识别OCR pypi版
$ npx skills add sml2h3/ddddocrOCR model that handles complex tables, forms, handwriting with full layout.
$ npx skills add datalab-to/chandraPython tool for converting files and office documents to Markdown.
$ npx skills add microsoft/markitdownTransforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
$ npx skills add opendatalab/MinerUGLM-OCR: Accurate × Fast × Comprehensive
$ npx skills add zai-org/GLM-OCRTransforms PDF, Documents and Images into Enriched Structured Data
$ npx skills add axa-group/ParsrA pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
$ npx skills add py-pdf/pypdfPyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
$ npx skills add pymupdf/PyMuPDFOCR powered screen-capture tool to capture information instead of images
$ npx skills add dynobo/normcapborb is a library for reading, creating and manipulating PDF files in python.
$ npx skills add borb-pdf/borbA Python library for reading and writing PDF, powered by QPDF
$ npx skills add pikepdf/pikepdfSpecify a github or local repo, github pull request, arXiv or Sci-Hub paper, Youtube transcript or documentation URL on the web and scrape into a text file and clipboard for easier LLM ingestion
$ npx skills add jimmc414/onefilellm在保留版面、公式与结构的前提下进行 PDF 翻译,适用于科研与技术文档
$ npx skills add wxyhgk/retain-pdfHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Pdf Craft if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.