Web scraping skills

AI agent skills for web scraping.

Find reusable crawler, browser automation, HTML-to-markdown, and structured extraction skills that agents can compare before scraping public websites.

Built for builders searching for reliable web scraping skills, crawler skills, and browser automation skills for AI agents.

Resolve via agent API Browse matching skills

Matched

Stars

459K

Workflow

Crawl

Output

Structured data

Agent jobs

Start from a real workflow, not a keyword.

These pages are built for high-intent search and for agents that need a structured shortlist before installing third-party code.

Scrape public pricing pages into JSON or markdown

Crawl documentation sites for RAG ingestion

Monitor public web pages and extract changed fields

Use browser automation when static HTML is not enough

Task routes

Task pages agents can call.

View all tasks

Scrape pricing pages

Scrape competitor pricing pages

Extract pricing data from public competitor pages and turn it into clean structured output.

Open task

Crawl docs

Crawl a documentation site

Turn documentation pages into clean markdown or records that an agent can search and reuse.

Open task

Browser workflow

Automate a browser workflow

Navigate web apps, fill forms, take screenshots, and verify state through browser automation.

Open task

Ranked shortlist

High-signal skills to inspect first.

Open best list

#01

Crawlee

24K stars

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

100

Quality

100

Trust

Fit

browser-automationJun 12, 2026 pushApache-2.0

$ npx skills add apify/crawlee

#02

Crawlee Python

9.2K stars

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

100

Quality

100

Trust

Fit

browser-automationJun 12, 2026 pushApache-2.0

$ npx skills add apify/crawlee-python

#03

Firecrawl

132K stars

The API to search, scrape, and interact with the web at scale. 🔥

100

Quality

100

Trust

Fit

agent-frameworksJun 12, 2026 pushAGPL-3.0

$ npx skills add firecrawl/firecrawl

#04

Scrapegraph AI

27K stars

Python scraper based on AI

100

Quality

100

Trust

Fit

web-automationJun 11, 2026 pushMIT

$ npx skills add ScrapeGraphAI/Scrapegraph-ai

#05

Maxun

16K stars

🔥 The open-source no-code platform for web scraping, crawling, search and AI data extraction • Turn websites into structured APIs in minutes 🔥

100

Quality

100

Trust

Fit

web-automationJun 11, 2026 pushAGPL-3.0

$ npx skills add getmaxun/maxun

#06

Crawl4AI

66K stars

Open-source LLM-friendly web crawler and scraper

100

Quality

100

Trust

Fit

web-automationMay 22, 2026 pushApache-2.0

$ npx skills add unclecode/crawl4ai

Evaluation

How to choose the right skill.

Scope controls for crawl depth and allowed domains

Clean markdown or structured output support

Fresh maintenance and visible repository activity

Clear install command and sandbox-friendly usage

Questions

What is the best web scraping skill for an AI agent?

Start with skills that return clean markdown or structured data, expose crawl limits, and have strong maintenance signals. OpenAgentSkill ranks candidates by task fit, trust, quality, stars, and install readiness.

Can an agent use these skills automatically?

Yes. Agents can call the Resolve API with a scraping task, inspect the ranked shortlist, and fetch install handoffs before running anything.

Best web scraping skills Web scraping use case Scrape pricing task Browser automation skills