OpenAgentSkill guide
Best document processing skills for AI agents
Find skills for parsing PDFs, extracting tables, running OCR, converting documents, and preparing file content for agent workflows.
When to use this guide
Start from the job, then shortlist the tools.
Extract tables from PDFs
Use quality and freshness signals to decide whether a skill belongs in this workflow.
Convert files to markdown
Use quality and freshness signals to decide whether a skill belongs in this workflow.
Run OCR over scans
Use quality and freshness signals to decide whether a skill belongs in this workflow.
Normalize document metadata
Use quality and freshness signals to decide whether a skill belongs in this workflow.
Shortlist
Top skills to evaluate
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Best fit: High-confidence pick with strong adoption and healthy maintenance signals.
PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.
Best fit: High-confidence pick with strong adoption and healthy maintenance signals.
Convert documents into Markdown for agent-readable context
Best fit: High-confidence pick with strong adoption and healthy maintenance signals.
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
Best fit: High-confidence pick with strong adoption and healthy maintenance signals.
🔥 Search, scrape, and clean the web for AI agents.
Best fit: High-confidence pick with strong adoption and healthy maintenance signals.
A marketplace for AI-assisted security analysis and auditing plugins.
Best fit: High-confidence pick with strong adoption and healthy maintenance signals.
Build document intelligence and RAG workflows for agents
Best fit: High-confidence pick with strong adoption and healthy maintenance signals.
Open-source LLM-friendly web crawler and scraper
Best fit: High-confidence pick with strong adoption and healthy maintenance signals.
To extract main article from given URL with Node.js
Best fit: High-confidence pick with strong adoption and healthy maintenance signals.
A curated collection of over 380 agent skills from official teams and the community, enhancing AI capabilities.
Best fit: High-confidence pick with strong adoption and healthy maintenance signals.