Decision filters

Choose skills by scenario, quality, and trust signals.

29 skills matching "crawling"

Best blend of quality, stars, freshness, and agent usage

1

Crawl4AI

VERIFIEDEXCELLENT · 100

Web crawling built for AI

$ npx skills add unclecode/crawl4ai
3 agent calls100% success66.1K stars79 qualityClaude Code + OpenAI Agents31.0K installs
High-confidence pick with strong adoption and healthy maintenance signals.
claudegpt-4langchaincrewaiopenclaw
by unclecodeQuick view
2

Scrapy

VERIFIEDEXCELLENT · 100

High-throughput crawling and scraping for agent data pipelines

$ npx skills add scrapy/scrapy
61.8K stars77 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawlerrag
by scrapyQuick view
3

Firecrawl

VERIFIEDEXCELLENT · 100

Web data for AI applications

$ npx skills add firecrawl/firecrawl
123.1K stars76 qualityClaude Code + OpenAI Agents25.0K installs
High-confidence pick with strong adoption and healthy maintenance signals.
claudegpt-4langchainopenclaw
by mendableaiQuick view
4

Colly

VERIFIEDEXCELLENT · 100

Elegant Scraper and Crawler Framework for Golang

$ npx skills add gocolly/colly
25.3K stars74 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
gocrawler
by gocollyQuick view
5

Katana

VERIFIEDEXCELLENT · 100

A next-generation crawling and spidering framework.

$ npx skills add projectdiscovery/katana
16.7K stars73 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
gocrawler
by projectdiscoveryQuick view
6

Newspaper

VERIFIEDEXCELLENT · 100

newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:

$ npx skills add codelucas/newspaper
15.1K stars72 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by codelucasQuick view
7

Crawlee Python

VERIFIEDEXCELLENT · 100

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

$ npx skills add apify/crawlee-python
9.1K stars69 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
pythonweb-automation
by apifyQuick view
8

Ferret

VERIFIEDEXCELLENT · 100

Declarative web scraping

$ npx skills add MontFerret/ferret
6.0K stars68 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
gocrawler
by MontFerretQuick view
9

Puppeteer Sharp

VERIFIEDEXCELLENT · 100

Headless Chrome .NET API

$ npx skills add hardkoded/puppeteer-sharp
3.9K stars67 qualityClaude Code + Browser agents
High-confidence pick with strong adoption and healthy maintenance signals.
c#crawler
by hardkodedQuick view
10

Cariddi

VERIFIEDEXCELLENT · 100

Take a list of domains, crawl urls and scan for endpoints, secrets, api keys, file extensions, tokens and more

$ npx skills add edoardottt/cariddi
3.4K stars66 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
gocrawler
by edoardotttQuick view
11

Awesome Web Scraping

VERIFIEDEXCELLENT · 99

List of libraries, tools and APIs for web scraping and data processing.

$ npx skills add lorien/awesome-web-scraping
7.9K stars66 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
makefileweb-automation
by lorienQuick view
12

Skycaiji

VERIFIEDEXCELLENT · 100

蓝天采集器是一款开源免费的爬虫系统,仅需点选编辑规则即可采集数据,可运行在本地、虚拟主机或云服务器中,几乎能采集所有类型的网页,无缝对接各类CMS建站程序,免登录实时发布数据,全自动无需人工干预!是网页大数据采集软件中完全跨平台的云端爬虫系统

$ npx skills add zorlan/skycaiji
2.1K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
phpcrawler
by zorlanQuick view
13

Gain

VERIFIEDEXCELLENT · 98

Web crawling framework based on asyncio.

$ npx skills add elliotgao2/gain
2.0K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by elliotgao2Quick view
14

Crawlab

VERIFIEDEXCELLENT · 100

Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架

$ npx skills add crawlab-team/crawlab
12.2K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
gocrawler
by crawlab-teamQuick view
15

WaterCrawl

VERIFIEDEXCELLENT · 93

Transform Web Content into LLM-Ready Data

$ npx skills add watercrawl/WaterCrawl
1.8K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptcrawler
by watercrawlQuick view
16

DotnetSpider

VERIFIEDEXCELLENT · 100

DotnetSpider, a .NET standard web crawling library. It is lightweight, efficient and fast high-level web crawling & scraping framework

$ npx skills add dotnetcore/DotnetSpider
4.1K stars64 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
c#crawler
by dotnetcoreQuick view
17

Rod

VERIFIEDEXCELLENT · 100

A Chrome DevTools Protocol driver for web automation and scraping.

$ npx skills add go-rod/rod
6.9K stars62 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
goweb-automation
by go-rodQuick view
18

Oxylabs AI Studio Py

VERIFIEDEXCELLENT · 96

Structured data gathering from any website using AI-powered scraper, crawler, and browser automation. Scraping and crawling with natural language prompts. Equip your LLM agents with fresh data. AI Studio python SDK for intelligent web data gathering.

$ npx skills add oxylabs/oxylabs-ai-studio-py
2.9K stars59 qualityClaude Code + Browser agents
High-confidence pick with strong adoption and healthy maintenance signals.
pythonweb-automation
by oxylabsQuick view
19

Trafilatura

VERIFIEDEXCELLENT · 91

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

$ npx skills add adbar/trafilatura
6.0K stars57 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonweb-automation
by adbarQuick view
20

Scrapfly Scrapers

STRONG · 82

Scalable Python web scraping scripts for +40 popular domains

$ npx skills add scrapfly/scrapfly-scrapers
983 stars55 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.
pythonweb-automation
by scrapflyQuick view
21

Grab

VERIFIEDSTRONG · 81

Web Scraping Framework

$ npx skills add lorien/grab
2.5K stars54 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.
pythonweb-automation
by lorienQuick view
22

Headless Chrome Crawler

VERIFIEDSTRONG · 72

Distributed crawler powered by Headless Chrome

$ npx skills add yujiosaka/headless-chrome-crawler
5.6K stars53 qualityClaude Code + Browser agents
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
javascriptcrawler
by yujiosakaQuick view
23

Core

VERIFIEDSTRONG · 74

The complete web scraping toolkit for PHP.

$ npx skills add roach-php/core
1.5K stars53 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.
phpweb-automation
by roach-phpQuick view
24

RED HAWK

VERIFIEDSTRONG · 76

All in one tool for Information Gathering, Vulnerability Scanning and Crawling. A must have tool for all penetration testers

$ npx skills add Tuhinshubhra/RED_HAWK
3.7K stars52 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
phpcrawler
by TuhinshubhraQuick view
25

Geziyor

VERIFIEDSTRONG · 75

Geziyor, blazing fast web crawling & scraping framework for Go. Supports JS rendering.

$ npx skills add geziyor/geziyor
2.8K stars51 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
gocrawler
by geziyorQuick view
26

Deepcrawl

STRONG · 79

100% free and full open-source edge Firecrawl alternative with better links extraction for agents - that you can deploy to cloudflare or vercel by yourself.

$ npx skills add lumpinif/deepcrawl
576 stars50 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.
typescriptweb-automation
by lumpinifQuick view
27

Ruia

VERIFIEDSTRONG · 73

Async Python 3.6+ web scraping micro-framework based on asyncio

$ npx skills add howie6879/ruia
1.7K stars49 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
pythoncrawler
by howie6879Quick view
28

Mlscraper

VERIFIEDPROMISING · 67

🤖 Scrape data from HTML websites automatically by just providing examples

$ npx skills add lorey/mlscraper
1.4K stars49 qualityClaude Code
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
pythoncrawler
by loreyQuick view
29

Rebrowser Patches

VERIFIEDPROMISING · 67

Collection of patches for puppeteer and playwright to avoid automation detection and leaks. Helps to avoid Cloudflare and DataDome CAPTCHA pages. Easy to patch/unpatch, can be enabled/disabled on demand.

$ npx skills add rebrowser/rebrowser-patches
1.4K stars49 qualityClaude Code + Browser agents
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
javascriptweb-automation
by rebrowserQuick view