Python tool for converting files and office documents to Markdown.
$ npx skills add microsoft/markitdownAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
A high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具
Python tool for converting files and office documents to Markdown.
$ npx skills add microsoft/markitdownThe official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
$ npx skills add bytedance/Dolphinborb is a library for reading, creating and manipulating PDF files in python.
$ npx skills add borb-pdf/borbOCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
$ npx skills add ocrmypdf/OCRmyPDFFile Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
$ npx skills add QuivrHQ/MegaParseGet your documents ready for gen AI
$ npx skills add docling-project/docling👏极客时间 pdf & markdown 文档
$ npx skills add uaxe/geektime-docsPlumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
$ npx skills add jsvine/pdfplumberA pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
$ npx skills add py-pdf/pypdfPDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.
$ npx skills add oomol-lab/pdf-craftSmall python-gtk application, which helps the user to merge or split PDF documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface.
$ npx skills add pdfarranger/pdfarrangerCommunity maintained fork of pdfminer - we fathom PDF
$ npx skills add pdfminer/pdfminer.sixFast Rust library for PDF inspection, classification, and text extraction. Intelligently detects scanned vs text-based PDFs to enable smart routing decisions.
$ npx skills add firecrawl/pdf-inspectorA Python library for reading and writing PDF, powered by QPDF
$ npx skills add pikepdf/pikepdfA web interface to extract tabular data from PDFs
$ npx skills add camelot-dev/excaliburThin wrapper for "pandoc" (MIT)
$ npx skills add JessicaTegner/pypandocHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Markpdfdown if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.