Web scraping skills

AI agent skills for web scraping.

Find reusable crawler, browser automation, HTML-to-markdown, and structured extraction skills that agents can compare before scraping public websites.

Built for builders searching for reliable web scraping skills, crawler skills, and browser automation skills for AI agents.

Matched

16

Stars

459K

Workflow

Crawl

Output

Structured data

Agent jobs

Start from a real workflow, not a keyword.

These pages are built for high-intent search and for agents that need a structured shortlist before installing third-party code.

01

Scrape public pricing pages into JSON or markdown

02

Crawl documentation sites for RAG ingestion

03

Monitor public web pages and extract changed fields

04

Use browser automation when static HTML is not enough

Ranked shortlist

High-signal skills to inspect first.

Open best list
24K stars

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

100

Quality

100

Trust

74

Fit

browser-automationJun 12, 2026 pushApache-2.0
$ npx skills add apify/crawlee
9.2K stars

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

100

Quality

100

Trust

69

Fit

browser-automationJun 12, 2026 pushApache-2.0
$ npx skills add apify/crawlee-python
132K stars

The API to search, scrape, and interact with the web at scale. 🔥

100

Quality

100

Trust

78

Fit

agent-frameworksJun 12, 2026 pushAGPL-3.0
$ npx skills add firecrawl/firecrawl
27K stars

Python scraper based on AI

100

Quality

100

Trust

74

Fit

web-automationJun 11, 2026 pushMIT
$ npx skills add ScrapeGraphAI/Scrapegraph-ai
16K stars

🔥 The open-source no-code platform for web scraping, crawling, search and AI data extraction • Turn websites into structured APIs in minutes 🔥

100

Quality

100

Trust

73

Fit

web-automationJun 11, 2026 pushAGPL-3.0
$ npx skills add getmaxun/maxun
66K stars

Open-source LLM-friendly web crawler and scraper

100

Quality

100

Trust

79

Fit

web-automationMay 22, 2026 pushApache-2.0
$ npx skills add unclecode/crawl4ai

Evaluation

How to choose the right skill.

Scope controls for crawl depth and allowed domains

Clean markdown or structured output support

Fresh maintenance and visible repository activity

Clear install command and sandbox-friendly usage

Questions

What is the best web scraping skill for an AI agent?

Start with skills that return clean markdown or structured data, expose crawl limits, and have strong maintenance signals. OpenAgentSkill ranks candidates by task fit, trust, quality, stars, and install readiness.

Can an agent use these skills automatically?

Yes. Agents can call the Resolve API with a scraping task, inspect the ranked shortlist, and fetch install handoffs before running anything.