Convert documents into Markdown for agent-readable context
$ npx skills add microsoft/markitdownDecision filters
189 skills matching "doc"
Best blend of quality, stars, freshness, and agent usage
Convert documents into Markdown for agent-readable context
$ npx skills add microsoft/markitdownBuild document intelligence and RAG workflows for agents
$ npx skills add infiniflow/ragflowTurn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
$ npx skills add PaddlePaddle/PaddleOCRInstallable GitHub library of 1,273+ agentic skills for Claude Code, Cursor, Codex CLI, Gemini CLI, Antigravity, and more. Includes installer CLI, bundles, workflows, and official/community skill collections.
$ npx skills add sickn33/antigravity-awesome-skillsAI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, and more). Turn any folder of code, SQL schemas, R scripts, shell scripts, docs, papers, images, or videos into a queryable knowledge graph. App code + database schema + infrastructure in one graph.
$ npx skills add safishamsi/graphify🎨 Local-first, open-source alternative to Anthropic's Claude Design. ⚡ 19 Skills · ✨ 71 brand-grade Design Systems 🖼 Generate web · desktop · mobile prototypes · slides · images · videos · HyperFrames 📦 Sandboxed preview · HTML/PDF/PPTX/MP4 export 🤖 Runs on Claude Code / Codex / Cursor / Gemini / OpenCode / Qwen / Copilot / Hermes / Kimi CLI.
$ npx skills add nexu-io/open-designA curated list of awesome skills, hooks, slash-commands, agent orchestrators, applications, and plugins for Claude Code by Anthropic
$ npx skills add hesreallyhim/awesome-claude-codeToolJet is the open-source foundation of ToolJet AI - the enterprise app generation platform for building internal tools, dashboard, business applications, workflows and AI agents 🚀
$ npx skills add ToolJet/ToolJet📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG
$ npx skills add VectifyAI/PageIndexA set of ready to use Agent Skills for research, science, engineering, analysis, finance and writing.
$ npx skills add K-Dense-AI/claude-scientific-skillsPDF Parser for AI-ready data. Automate PDF accessibility. Open-source.
$ npx skills add opendataloader-project/opendataloader-pdfSpecification and documentation for Agent Skills
$ npx skills add agentskills/agentskillsAI generates natively editable PPTX from any document — real PowerPoint shapes with native animations, not images · by Hugo He
$ npx skills add hugohe3/ppt-masterClaude Code Skills and 380+ agent skills from official dev teams and the community, compatible with Codex, Antigravity, Gemini CLI, Cursor and others.
$ npx skills add VoltAgent/awesome-agent-skillsThe container platform tailored for Kubernetes multi-cloud, datacenter, and edge management ⎈ 🖥 ☁️
$ npx skills add kubesphere/kubespherePrivate AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.
$ npx skills add arc53/DocsGPT+192 Claude Code skills & agent plugins for Claude Code, Codex, Gemini CLI, Cursor, and 8 more coding agents — engineering, marketing, product, compliance, C-level advisory.
$ npx skills add alirezarezvani/claude-skillsOpen-source LLM knowledge platform: turn raw documents into a queryable RAG, an autonomous reasoning agent, and a self-maintaining Wiki.
$ npx skills add Tencent/WeKnoranewspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:
$ npx skills add codelucas/newspaperYour AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.
$ npx skills add khoj-ai/khojAll parts of Claude Code's system prompt, 27 builtin tool descriptions, sub agent prompts (Plan/Explore/Task), utility prompts (CLAUDE.md, compact, statusline, magic docs, WebFetch, Bash cmd, security review, agent creation). Updated for each Claude Code version.
$ npx skills add Piebald-AI/claude-code-system-promptsPM Skills Marketplace: 100+ agentic skills, commands, and plugins — from discovery to strategy, execution, launch, and growth.
$ npx skills add phuryn/pm-skillsAn open-source RAG-based tool for chatting with your documents.
$ npx skills add Cinnamon/kotaemon280+ free n8n automation templates — ready-to-use workflows for Gmail, Telegram, Slack, Discord, WhatsApp, Google Drive, Notion, OpenAI, and more. AI agents, RAG chatbots, email automation, social media, DevOps, and document processing. The largest open-source n8n template collection.
$ npx skills add enescingoz/awesome-n8n-templatesA lightweight, lightning-fast, in-process vector database
$ npx skills add alibaba/zvecAnthony Fu's curated collection of agent skills.
$ npx skills add antfu/skillsPython API for JMComic | 提供Python API访问禁漫天堂,同时支持网页端和移动端 | 禁漫天堂GitHub Actions下载器🚀
$ npx skills add hect0x7/JMComic-Crawler-PythonAutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
$ npx skills add Marker-Inc-Korea/AutoRAG🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines
$ npx skills add chonkie-inc/chonkieThe first open-source agent skills builder. Define skills by vibe workflow, run on Claude Code, Cursor, Codex & more. Build Clawdbot 🦞· APIs for Lovable · Bots for Slack & Lark/Feishu · Skills are infrastructure, not prompts.
$ npx skills add refly-ai/reflyEasiest and laziest way for building multi-agent LLMs applications.
$ npx skills add LazyAGI/LazyLLMThe most accurate document search and store for building AI apps
$ npx skills add morphik-org/morphik-coreHigh accuracy RAG for answering questions from scientific documents with citations
$ npx skills add Future-House/paper-qaREADME file generator, powered by AI.
$ npx skills add eli64s/readme-aihttps://spatie.be/docs/crawler
$ npx skills add spatie/crawlerA Python library for reading and writing PDF, powered by QPDF
$ npx skills add pikepdf/pikepdfA maroto way to create PDFs. Maroto is inspired in Bootstrap and uses gofpdf. Fast and simple.
$ npx skills add johnfercher/marotoA curated list of skills, tools, tutorials, and capabilities for AI coding agents (Claude, Codex, Antigravity, Copilot, VS Code)
$ npx skills add heilcheng/awesome-agent-skillsApache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
$ npx skills add apache/hamiltonRead and extract text and other content from PDFs in C# (port of PDFBox)
$ npx skills add UglyToad/PdfPigFulling is an AI-powered Full-stack Engineer Agent. Built with Next.js, Claude, shadcn/ui, and PostgreSQL. Use kubernetes as infra.
$ npx skills add FullAgent/fullingPDF exporter for HTML presentations
$ npx skills add astefanutti/decktapeAI Agent 驱动的开源视频生成工作台 — 小说→角色/场景/道具设计→剧本→分镜图→视频,跨镜头角色与场景一致 | Open-source AI video workspace powered by AI Agents, Nano Banana 2 & Veo 3.1 / Grok / Seedance / OpenAI
$ npx skills add ArcReel/ArcReelDistributed vector search for AI-native applications
$ npx skills add vearch/vearchiText for Java represents the next level of SDKs for developers that want to take advantage of the benefits PDF can bring. Equipped with a better document engine, high and low-level programming capabilities and the ability to create, edit and enhance PDF documents, iText can be a boon to nearly every workflow.
$ npx skills add itext/itext-javaAssist in organizing your piles of documents, resulting from scanners, e-mails and other sources with miminal effort.
$ npx skills add eikek/docspell🏕️ Reproducible development environment for humans and agents
$ npx skills add tensorchord/envdDocument scanning app
$ npx skills add ossappscollective/OSS-DocumentScannerDistributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
$ npx skills add crawlab-team/crawlabA search engine that "just works" for Obsidian. Supports OCR and PDF indexing.
$ npx skills add scambier/obsidian-omnisearch📐⚙ 2D vector line drawing and shape modeling for CNC and laser cutters.
$ npx skills add microsoft/maker.jsEdegQuake 🌋 High-performance GraphRAG inspired from LightRag written in Rust; Transform documents into intelligent knowledge graphs for superior retrieval and generation
$ npx skills add raphaelmansuy/edgequakePDF editor for Windows. Install or run portable. GPLv3. No account, no subscription, no telemetry.
$ npx skills add SteveTheKiller/KillerPDFiText for .NET is the .NET version of the iText library, formerly known as iTextSharp, which it replaces. iText represents the next level of SDKs for developers that want to take advantage of the benefits PDF can bring. Equipped with a better document engine, high and low-level programming capabilities and the ability to create, edit and enha
$ npx skills add itext/itext-dotnetComic and Manga reader, written with Node.js and using Electron
$ npx skills add ollm/OpenComicPHP PDF Library (official TCPDF successor)
$ npx skills add tecnickcom/tc-lib-pdfMinimal PDF creation library. <400 LOC, zero dependencies, makes real PDFs.
$ npx skills add Lulzx/tinypdfVector graphics in Go
$ npx skills add tdewolff/canvasA web interface to extract tabular data from PDFs
$ npx skills add camelot-dev/excaliburA <Pdf /> component for react-native
$ npx skills add wonday/react-native-pdfRust Bindings for the Skia Graphics Library
$ npx skills add rust-skia/rust-skiaThe SILE Typesetter — Simon’s Improved Layout Engine
$ npx skills add sile-typesetter/sile在保留版面、公式与结构的前提下进行 PDF 翻译,适用于科研与技术文档
$ npx skills add wxyhgk/retain-pdfOpiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.
$ npx skills add QuivrHQ/quivrA modern PDF library for TypeScript. Parse, modify, and generate PDFs with a clean, intuitive API.
$ npx skills add LibPDF-js/core基于 manga-image-translator 的开源漫画翻译工具。支持日/韩/美漫自动翻译,内置 OpenAI、Gemini 等 5 种翻译引擎,并提供可视化编辑器自由调整文本样式。一键安装,开箱即用。如果喜欢,欢迎点亮 ⭐ Star 支持!
$ npx skills add hgmzhn/manga-translator-uiAn extensible Markdown Editor, Viewer and Weblog Publisher for Windows
$ npx skills add RickStrahl/MarkdownMonsterInteractive architecture diagrams for codebases
$ npx skills add CodeBoarding/CodeBoardingRust library to read, manipulate and write PDF files.
$ npx skills add pdf-rs/pdfLocal AI anywhere, for everyone — LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.
$ npx skills add Light-Heart-Labs/DreamServerBuilding blocks for rapid development of GenAI applications
$ npx skills add deepsense-ai/ragbitsPageLM is a community driven version of NotebookLM & a education platform that transforms study materials into interactive resources like quizzes, flashcards, notes, and podcasts.
$ npx skills add CaviraOSS/PageLM🤖 𝗟𝗲𝗮𝗿𝗻 for 𝗳𝗿𝗲𝗲 how to 𝗯𝘂𝗶𝗹𝗱 an end-to-end 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻-𝗿𝗲𝗮𝗱𝘆 𝗟𝗟𝗠 & 𝗥𝗔𝗚 𝘀𝘆𝘀𝘁𝗲𝗺 using 𝗟𝗟𝗠𝗢𝗽𝘀 best practices: ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 12 𝘩𝘢𝘯𝘥𝘴-𝘰𝘯 𝘭𝘦𝘴𝘴𝘰𝘯𝘴
$ npx skills add decodingai-magazine/llm-twin-courseDatabase Reporting Tool and Tasks (.Net)
$ npx skills add ariacom/Seal-ReportA lightning fast image processing and resizing library for Go
$ npx skills add davidbyttow/govips学习计算机科学的电子书
$ npx skills add tolerious/Programming_learning_resourceMORT 번역기 프로젝트 - Real-time game translator with OCR
$ npx skills add killkimno/MORTA lightweight 2D graphics library for modern GPUs, delivering high-performance text, image, and vector rendering across major platforms.
$ npx skills add Tencent/tgfxNext-gen phpDoc parser with support for intersection types and generics
$ npx skills add phpstan/phpdoc-parserDeclarative way to run AI models in React Native on device, powered by ExecuTorch.
$ npx skills add software-mansion/react-native-executorchPython wrapper for the arXiv API
$ npx skills add lukasschwab/arxiv.pyCollection of open-source libraries and tools for Robotic Process Automation (RPA), designed to be used with both Robot Framework and Python
$ npx skills add robocorp/rpaframeworkYomiTokuはAIを活用した日本語文書解析エンジンを提供するPythonパッケージです。 Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language.
$ npx skills add kotaro-kinoshita/yomitokuVersatile PDF creation and manipulation for Ruby
$ npx skills add gettalong/hexapdf📰 Binary distribution of PDFium
$ npx skills add bblanchon/pdfium-binariesOpen source PDF editor.
$ npx skills add JakubMelka/PDF4QTOpenOCR: An Open-Source Toolkit for General-OCR Research and Applications, integrates a unified training and evaluation benchmark, commercial-grade OCR and Document Parsing systems, and faithful reproductions of the core implementations from a wide range of academic papers.
$ npx skills add Topdu/OpenOCRA self-hosted file conversion server & share tool that supports 445 file formats in 13 languages.
$ npx skills add zelon88/HRConvert2JasperReports® - Free Java Reporting Library
$ npx skills add Jaspersoft/jasperreportsDocker image that provides static analysis tools for PHP
$ npx skills add jakzal/phpqajavascript based business reporting platform :rocket:
$ npx skills add jsreport/jsreportAn iOS OCR Server Using Apple’s Vision Framework
$ npx skills add riddleling/iOS-OCR-ServerA curated collection of practical AI projects implementing OCR systems, RAG, AI agents, and other AI use cases.
$ npx skills add Sumanth077/Hands-On-AI-EngineeringMouseover Translate Any Language At Once - Chrome Extension: PDF Translator, EBOOK, EPUB, OCR, TTS, NETFLIX, YOUTUBE DUAL SUBTITLES, GOOGLE DOCS, AI, VIEWER, GMAIL, WRITING, IMAGE, DUAL SUBS, MANGA, HOVER, DICTIONARY, WEBTOON, EDGE, JAPANESE, ENGLISH
$ npx skills add ttop32/MouseTooltipTranslator🌝 MLKit是一个强大易用的工具包。通过ML Kit您可以很轻松的实现文字识别、条码识别、图像标记、人脸检测、对象检测等功能。
$ npx skills add jenly1314/MLKitA group of notebooks and other files which can help you learn AI from scratch.
$ npx skills add Ramakm/ai-hands-onRun a high-fidelity browser-based web archiving crawler in a single Docker container
$ npx skills add webrecorder/browsertrix-crawlerSPEC-First Agentic Development Kit for Claude Code — 24 AI agents + 52 skills with TDD/DDD quality gates, 16-language projects, 4-language docs. Go CLI, zero deps.
$ npx skills add modu-ai/moai-adkPDF references add-on for Zotero.
$ npx skills add MuiseDestiny/zotero-referenceOCR engine for all the languages
$ npx skills add mittagessen/krakenCross-platform desktop GUI app to clean image metadata
$ npx skills add szTheory/exifcleanerAI coding workstation: Claude Code + web UI + 7 AI CLIs + headless browser + 50+ tools
$ npx skills add CoderLuii/HolyClaudeconverts binary PDF to JSON and text, for server-side PDF processing and command-line use. Zero dependency.
$ npx skills add modesty/pdf2jsonAn on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)
$ npx skills add NanoNets/docextCreate chatbots with ease
$ npx skills add n4ze3m/dialoqbaseSelfhosted PDF manager, viewer and editor offering a seamless user experience on multiple devices.
$ npx skills add mrmn2/PdfDingEnjoy reading with your favorite style.
$ npx skills add jesselau76/ebook-GPT-translatorDocument reader
$ npx skills add baskerville/platoTrench — Open-Source Analytics Infrastructure. A single production-ready Docker image built on ClickHouse, Kafka, and Node.js for tracking events. Easily build product analytics dashboards, LLM RAGs, observability platforms, or any other analytics product.
$ npx skills add FrigadeHQ/trench中文古籍刻本風格直排電子書製作工具 Chinese Ancient eBooks Generator
$ npx skills add shanleiguang/vRainGet clean data from tricky documents, powered by vision-language models ⚡
$ npx skills add emcf/thepipePdf creation module for dart/flutter
$ npx skills add DavBfr/dart_pdfA CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats including EPUB books and PDF documents.
$ npx skills add nazdridoy/kokoro-ttsConvert a pdf to an image
$ npx skills add spatie/pdf-to-image微信公众号文章批量下载工具,支持导出阅读量与评论数据。无需搭建环境,支持在线使用、Docker 私有化部署和 Cloudflare 部署。支持多种格式导出,HTML 格式可100%还原文章排版与样式。
$ npx skills add wechat-article/wechat-article-exporterDisplay paginated content in the browser and generate print books using web technology
$ npx skills add pagedjs/pagedjsA multi-threaded PDF password cracking utility equipped with commonly encountered password format builders and dictionary attacks.
$ npx skills add mufeedvh/pdfripA blazing fast, async-first, undetectable webscraping/web automation framework based on ultrafunkamsterdam/nodriver. Now with Docker support!
$ npx skills add cdpdriver/zendriverA library for converting HTML into PDFs using ReportLab
$ npx skills add xhtml2pdf/xhtml2pdfAn Open source app to download and read books from shadow library (Anna’s Archive)
$ npx skills add dstark5/OpenlibSpecify a github or local repo, github pull request, arXiv or Sci-Hub paper, Youtube transcript or documentation URL on the web and scrape into a text file and clipboard for easier LLM ingestion
$ npx skills add jimmc414/onefilellmHackable CLI tool for converting Markdown files to PDF using Node.js and headless Chrome.
$ npx skills add simonhaenisch/md-to-pdfOffline markdown to pdf, choose -> edit -> transform 🥂
$ npx skills add realdennis/md2pdfkramdown is a fast, pure Ruby Markdown superset converter, using a strict syntax definition and supporting several common extensions.
$ npx skills add gettalong/kramdownA high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具
$ npx skills add MarkPDFdown/markpdfdownRead Japanese manga inside browser with selectable text.
$ npx skills add kha-white/mokuroSVG file parsing / rendering library
$ npx skills add dompdf/php-svg-libCLI tool for saving a faithful copy of a complete web page in a single HTML file (based on SingleFile)
$ npx skills add gildas-lormeau/single-file-cliFree Offline OCR 离线的中文文本检测+识别SDK
$ npx skills add myhub/tr📄 PDF Viewer Component for Angular
$ npx skills add VadimDez/ng2-pdf-viewerKibana Alert & Report App for Elasticsearch
$ npx skills add sentinl/sentinlAn app to convert images to PDF file!
$ npx skills add Swati4star/Images-to-PDFAI-powered news digest that curates thousands of sources down to the highlights that matter. Generates structured summaries (4H/daily/weekly/monthly) from Twitter, RSS, HackerNews, Reddit, GitHub Trending and more. Features multi-user support, bookmarks, source packs, and feed output.
$ npx skills add kevinho/clawfeedOpen Source Document Management System for Digital Archives (Scanned Documents)
$ npx skills add ciur/papermergeRMT (RuoMengTu) is a free, open-source macro tool built on AHKv2. Let the code handle the tedious work—you have more meaningful things to do.
$ npx skills add zclucas/RMTSnapX is a free, open-source, cross-platform tool that lets you capture or record any area of your screen and instantly share it with a single keypress. Upload images, videos, text, and more to multiple supported destinations—all with ease. ShareX fork
$ npx skills add SnapXL/SnapXCCExtractor - Official version maintained by the core team
$ npx skills add CCExtractor/ccextractor📜 A Cheat-Sheet Collection from the WWW
$ npx skills add sk3pp3r/cheat-sheet-pdfOpen-source screenshot and screen recording for macOS. The free, native alternative to CleanShot X. Built with Swift 6.0 and SwiftUI.
$ npx skills add lzhgus/CapsoLocal-first, open-source AI assistant for your data. Unify tasks, notes, docs, photos, and bookmarks. Private, self-hosted, and extensible via APIs.
$ npx skills add eclaire-labs/eclaireNote Companion: AI assistant for Obsidian that goes beyond just a chat. (prev File Organizer 2000)
$ npx skills add Nexus-JPF/note-companionPDF++: the most Obsidian-native PDF annotation & viewing tool ever. Comes with optional Vim keybindings.
$ npx skills add RyotaUshio/obsidian-pdf-plusA minimalist SOTA LaTeX OCR model with only 20M parameters, running in browser. Full training pipeline available for self-reproduction. | 超轻量SOTA LaTeX公式识别模型,仅20M参数量,可在浏览器中运行。训练全流程代码开源,以便自学复现。
$ npx skills add alephpi/TexoFree Open Source Document Management System (mirror, no pull request or issues)
$ npx skills add mayan-edms/Mayan-EDMSDownload your resume from resume.io as PDF
$ npx skills add felipeall/resumeio-to-pdfCnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包
$ npx skills add breezedeus/CnSTDAI Agent Development Platform - Supports multiple models (OpenAI/DeepSeek/Wenxin/Tongyi), knowledge base management, workflow automation, and enterprise-grade security. Built with Flask + Vue3 + LangChain, featuring one-click Docker deployment.
$ npx skills add Haohao-end/openagentZotero Plugin for OCR
$ npx skills add UB-Mannheim/zotero-ocrWeb interface for recognizing text, proofreading OCR, and creating fully-digitized documents.
$ npx skills add scribeocr/scribeocrAn local, offline (after initial setup), portable OCR software that can process images and PDF files, using DeepSeek-OCR AI (running directly on your machine).
$ npx skills add th1nhhdk/local_ai_ocrQuick, painless, intuitive OCR platform written in Rust and TypeScript. Modern UI with modern API, with an emphasis on intuitive user experience.
$ npx skills add readur/readurOpen source SEO audit tool.
$ npx skills add StJudeWasHere/seonautDedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser
$ npx skills add ispras/dedocMulti-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
$ npx skills add raphael-seo/Versatile-OCR-ProgramDoctrine extensions for PHPStan
$ npx skills add phpstan/phpstan-doctrineSelf-healing infrastructure for AI agent payments. 90.3% auto-recovery.
$ npx skills add adrianhihi/helixExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.
$ npx skills add enoch3712/ExtractThinkerCurated Agent Skills for Microsoft & Azure – giving AI coding assistants structured, real-time expertise from Microsoft Learn docs.
$ npx skills add MicrosoftDocs/Agent-SkillsExtract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with intelligent structured data extraction and advanced OCR.
$ npx skills add NanoNets/docstrangeA packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.
$ npx skills add faustomorales/keras-ocrJavaScript Promiseの本
$ npx skills add azu/promises-booktranslate scientific papers in latex, especially arxiv papers
$ npx skills add SUSYUSTC/MathTranslate(eBook,PDFs Translation) A multilingual eBook processing tool supporting all eBook formats. Features online and offline translation while preserving original layouts. Compatible with both scanned and digital PDFs. Elegant user interface. The world's highest-performing open-source layout-preserving eBook translator.
$ npx skills add CBIhalsen/PolyglotPDFThe easiest way to use Agentic RAG in any enterprise
$ npx skills add ragapp/ragappPaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)
$ npx skills add frotms/PaddleOCR2PytorchStreamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭建后端🗝️、Docker-compose 打包部署🐋
$ npx skills add PeterH0323/Streamer-SalesONNX Model Exporter for PaddlePaddle
$ npx skills add PaddlePaddle/Paddle2ONNXHypernetworks that update LLMs to remember factual information
$ npx skills add SakanaAI/doc-to-lora:blue_book: 电子书 -《Real-Time Rendering 3rd》提炼总结 | 全书共9万7千余字。你可以把它看做中文通俗版的《Real-Time Rendering 3rd》,也可以把它看做《Real-Time Rendering 3rd》的解读版与配套学习伴侣,或者《Real-Time Rendering 4th》的前置阅读材料。
$ npx skills add QianMo/Real-Time-Rendering-3rd-CN-Summary-Ebook🤖 A Node queue API for generating PDFs using headless Chrome. Comes with a CLI, S3 storage and webhooks for notifying subscribers about generated PDFs
$ npx skills add esbenp/pdf-botAI VTuber with LLM, ASR, TTS, OCR, CV and more technologies to live stream or play Minecraft with you.
$ npx skills add AkagawaTsurunaki/ZerolanLiveRobotScan, index, and archive all of your paper documents (acquired by Mayan EDMS)
$ npx skills add zhoubear/open-paperlessAIRecon is an autonomous cybersecurity agent that combines a self-hosted Large Language Model (Ollama) with a Kali Linux Docker sandbox and a Textual TUI. It is designed to automate security assessments, penetration testing, and bug bounty reconnaissance — without any API keys or cloud dependency.
$ npx skills add pikpikcu/aireconSimple wrapper of tabula-java: extract table from PDF into pandas DataFrame
$ npx skills add chezou/tabula-pyvue.js pdf viewer
$ npx skills add FranckFreiburger/vue-pdfA set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
$ npx skills add WZBSocialScienceCenter/pdftabextractbooks pdf
$ npx skills add huyubing/books-pdfAI Bank Statement Document Automation By LLM model and Personal Finanical Analysis
$ npx skills add johnsonhk88/AI-Bank-Statement-Document-Automation-By-LLM-And-Personal-Finanical-Analysis-PredictionAn HTML to PDF library for the JVM. Based on Flying Saucer and Apache PDF-BOX 2. With SVG image support. Now also with accessible PDF support (WCAG, Section 508, PDF/UA)!
$ npx skills add danfickle/openhtmltopdfA python module that wraps the pdftoppm utility to convert PDF to PIL Image object
$ npx skills add Belval/pdf2imageText-To-Speech, RAG, and LLMs. All local!
$ npx skills add alexpinel/DotOpen Source Virtual (Network) Printer for Windows that allows you to create PDFs, OCR text, and print images, with advanced features usually available only in enterprise solutions.
$ npx skills add clawsoftware/clawPDFConverts PDF, DOC, DOCX, XML, HTML, RTF, etc to plain text
$ npx skills add sajari/docconvPython tool for grabbing text via screenshot
$ npx skills add ianzhao/textshotVision utilities for web interaction agents 👀
$ npx skills add reworkd/tarsierCAJ 转 PDF 转换器(GUI 版本)
$ npx skills add sainnhe/caj2pdf-qtFast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
$ npx skills add yobix-ai/extractousA plugin for reading and annotating PDFs and EPUBs in obsidian.
$ npx skills add elias-sundqvist/obsidian-annotatorAndroid widget that can render PDF documents stored on SD card, linked as assets, or downloaded from a remote URL.
$ npx skills add voghDev/PdfViewPager