Effortless data labeling with AI support from Segment Anything and other awesome models.
$ npx skills add CVHub520/X-AnyLabelingAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)
Effortless data labeling with AI support from Segment Anything and other awesome models.
$ npx skills add CVHub520/X-AnyLabelingdocTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
$ npx skills add mindee/doctr深度学习辅助漫画翻译工具, 支持一键机翻和简单的图像/文本编辑 | Yet another computer-aided comic/manga translation tool powered by deeplearning
$ npx skills add dmMaze/BallonsTranslatorAI comic and manga translator app/browser extension for automatically translating comics, manga, manhwa, BDs, fumetti, and more in multiple languages and formats (Images, PDF, EPUB, CBR, CBZ etc).
$ npx skills add ogkalu2/comic-translateIngest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
$ npx skills add adithya-s-k/omniparseOpenOCR: An Open-Source Toolkit for General-OCR Research and Applications, integrates a unified training and evaluation benchmark, commercial-grade OCR and Document Parsing systems, and faithful reproductions of the core implementations from a wide range of academic papers.
$ npx skills add Topdu/OpenOCRA Python wrapper for the tesseract-ocr API
$ npx skills add sirfz/tesserocrOCR engine for all the languages
$ npx skills add mittagessen/krakenTurn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
$ npx skills add PaddlePaddle/PaddleOCROCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
$ npx skills add hiroi-sora/Umi-OCRReady-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
$ npx skills add JaidedAI/EasyOCRA Unified Toolkit for Deep Learning Based Document Image Analysis
$ npx skills add Layout-Parser/layout-parserTranslate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)
$ npx skills add zyddnys/manga-image-translator带带弟弟 通用验证码识别OCR pypi版
$ npx skills add sml2h3/ddddocrOCR model that handles complex tables, forms, handwriting with full layout.
$ npx skills add datalab-to/chandraOptical character recognition for Japanese text, with the main focus being Japanese manga
$ npx skills add kha-white/manga-ocrHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Parseq if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.