Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
$ npx skills add microsoft/unilmAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
Fast GPU OCR server. 270 img/s on FUNSD. TensorRT FP16, PP-OCRv5, HTTP + gRPC.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
$ npx skills add microsoft/unilmParseBench - A Document Parsing Benchmark for AI Agents
$ npx skills add run-llama/ParseBench在保留版面、公式与结构的前提下进行 PDF 翻译,适用于科研与技术文档
$ npx skills add wxyhgk/retain-pdfTurn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
$ npx skills add PaddlePaddle/PaddleOCRReady-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
$ npx skills add JaidedAI/EasyOCRGet your documents ready for gen AI
$ npx skills add docling-project/doclingConvert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.
$ npx skills add Unstructured-IO/unstructuredTesseract Open Source OCR Engine (main repository)
$ npx skills add tesseract-ocr/tesseractOfficial Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
$ npx skills add clovaai/donutXournal++ is a handwriting notetaking software with PDF annotation support. Written in C++ with GTK3, supporting Linux (e.g. Ubuntu, Debian, Arch, SUSE), macOS and Windows 10. Supports pen input from devices such as Wacom Tablets.
$ npx skills add xournalpp/xournalpp超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M
$ npx skills add DayBreak-u/chineseocr_lite视觉小说翻译器 / Visual Novel Translator
$ npx skills add HIllya51/LunaTranslatorKnowledge Agents and Management in the Cloud
$ npx skills add run-llama/llama_cloud_services开箱即用的AI标书编写工具,标书AI生成工具,投标工具箱、知识库、标书查重、废标项检查,完全开源免费,欢迎使用
$ npx skills add FB208/OpenBidKit_YibiaoA Repo For Document AI
$ npx skills add deepdoctection/deepdoctectionqpdf: A content-preserving PDF document transformer
$ npx skills add qpdf/qpdfHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep TurboOCR if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.