Decision filters

Choose skills by scenario, quality, and trust signals.

109 skills matching "extract"

Best blend of quality, stars, freshness, and agent usage

1

Crawl4AI

VERIFIEDEXCELLENT · 100

Web crawling built for AI

$ npx skills add unclecode/crawl4ai
3 agent calls100% success66.1K stars79 qualityClaude Code + OpenAI Agents31.0K installs
High-confidence pick with strong adoption and healthy maintenance signals.
claudegpt-4langchaincrewaiopenclaw
by unclecodeQuick view
2

Firecrawl

VERIFIEDEXCELLENT · 100

🔥 Search, scrape, and clean the web for AI agents.

$ npx skills add firecrawl/firecrawl
123.2K stars78 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptai-agents
by firecrawlQuick view
3

PaddleOCR

VERIFIEDEXCELLENT · 100

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

$ npx skills add PaddlePaddle/PaddleOCR
78.4K stars77 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonrag
by PaddlePaddleQuick view
4

Scrapy

VERIFIEDEXCELLENT · 100

High-throughput crawling and scraping for agent data pipelines

$ npx skills add scrapy/scrapy
61.8K stars77 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawlerrag
by scrapyQuick view
5

EasySpider

VERIFIEDEXCELLENT · 100

A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/网页爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。

$ npx skills add NaiboWang/EasySpider
43.9K stars76 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javascriptcrawler
by NaiboWangQuick view
6

Browser Use

VERIFIEDEXCELLENT · 100

Give your AI agent a web browser

$ npx skills add browser-use/browser-use
95.1K stars75 qualityClaude Code + OpenAI Agents28.0K installs
High-confidence pick with strong adoption and healthy maintenance signals.
claudegpt-4langchainopenclaw
by browser-useQuick view
7

ScrapeGraphAI

VERIFIEDEXCELLENT · 100

Extract web data with LLM-guided scraping graphs

$ npx skills add ScrapeGraphAI/Scrapegraph-ai
25.8K stars74 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonllmweb-automation
by ScrapeGraphAIQuick view
8

Colly

VERIFIEDEXCELLENT · 100

Elegant Scraper and Crawler Framework for Golang

$ npx skills add gocolly/colly
25.3K stars74 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
gocrawler
by gocollyQuick view
9

Proxy Pool

VERIFIEDEXCELLENT · 100

Python ProxyPool for web spider

$ npx skills add jhao104/proxy_pool
23.4K stars74 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by jhao104Quick view
10

Katana

VERIFIEDEXCELLENT · 100

A next-generation crawling and spidering framework.

$ npx skills add projectdiscovery/katana
16.7K stars73 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
gocrawler
by projectdiscoveryQuick view
11

Newspaper

VERIFIEDEXCELLENT · 100

newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:

$ npx skills add codelucas/newspaper
15.1K stars72 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by codelucasQuick view
12

Lux

VERIFIEDEXCELLENT · 100

👾 Fast and simple video download library and CLI tool written in Go

$ npx skills add iawia002/lux
31.4K stars72 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
gocrawler
by iawia002Quick view
13

Python

VERIFIEDEXCELLENT · 100

Python脚本。模拟登录知乎, 爬虫,操作excel,微信公众号,远程开机

$ npx skills add injetlee/Python
10.6K stars71 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by injetleeQuick view
14

Crawlee Python

VERIFIEDEXCELLENT · 100

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

$ npx skills add apify/crawlee-python
9.1K stars69 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
pythonweb-automation
by apifyQuick view
15

Wiseflow

VERIFIEDEXCELLENT · 100

为你 7*24 在线搞钱的“云上牛马”团队

$ npx skills add TeamWiseFlow/wiseflow
8.2K stars69 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptcrawler
by TeamWiseFlowQuick view
16

Skills

VERIFIEDEXCELLENT · 100

Trail of Bits Claude Code skills for security research, vulnerability detection, and audit workflows

$ npx skills add trailofbits/skills
5.3K stars69 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
by trailofbitsQuick view
17

Ferret

VERIFIEDEXCELLENT · 100

Declarative web scraping

$ npx skills add MontFerret/ferret
6.0K stars68 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
gocrawler
by MontFerretQuick view
18

JMComic Crawler Python

VERIFIEDEXCELLENT · 100

Python API for JMComic | 提供Python API访问禁漫天堂,同时支持网页端和移动端 | 禁漫天堂GitHub Actions下载器🚀

$ npx skills add hect0x7/JMComic-Crawler-Python
5.8K stars68 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by hect0x7Quick view
19

Scrapy Redis

VERIFIEDEXCELLENT · 100

Redis-based components for Scrapy.

$ npx skills add rmax/scrapy-redis
5.6K stars68 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by rmaxQuick view
20

Sparrow

VERIFIEDEXCELLENT · 100

Structured data extraction and instruction calling with ML, LLM and Vision LLM

$ npx skills add katanaml/sparrow
5.2K stars68 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
pythonrag
by katanamlQuick view
21

Browser Fingerprinting

VERIFIEDEXCELLENT · 100

Analysis of Bot Protection systems with available countermeasures 🚿. How to defeat anti-bot system 👻 and get around browser fingerprinting scripts 🕵️‍♂️ when scraping the web?

$ npx skills add niespodd/browser-fingerprinting
5.0K stars68 qualityClaude Code + Browser agents
High-confidence pick with strong adoption and healthy maintenance signals.
javascriptcrawler
by niespoddQuick view
22

Weibo Crawler

VERIFIEDEXCELLENT · 97

新浪微博爬虫,用python爬取新浪微博数据,并下载微博图片和微博视频

$ npx skills add dataabc/weibo-crawler
4.5K stars67 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by dataabcQuick view
23

Google Maps Scraper

VERIFIEDEXCELLENT · 100

scrape data from Google Maps. Extracts data such as the name, address, phone number, website URL, rating, reviews number, latitude and longitude, reviews,email and more for each place

$ npx skills add gosom/google-maps-scraper
4.1K stars67 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
goweb-automation
by gosomQuick view
24

Puppeteer Sharp

VERIFIEDEXCELLENT · 100

Headless Chrome .NET API

$ npx skills add hardkoded/puppeteer-sharp
3.9K stars67 qualityClaude Code + Browser agents
High-confidence pick with strong adoption and healthy maintenance signals.
c#crawler
by hardkodedQuick view
25

Feapder

VERIFIEDEXCELLENT · 100

🚀🚀🚀feapder is an easy to use, powerful crawler framework | feapder是一款上手简单,功能强大的Python爬虫框架。内置AirSpider、Spider、TaskSpider、BatchSpider四种爬虫解决不同场景的需求。且支持断点续爬、监控报警、浏览器渲染、海量数据去重等功能。更有功能强大的爬虫管理系统feaplat为其提供方便的部署及调度

$ npx skills add Boris-code/feapder
3.7K stars67 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by Boris-codeQuick view
26

Toapi

VERIFIEDEXCELLENT · 100

Every web site provides APIs.

$ npx skills add elliotgao2/toapi
3.5K stars67 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by elliotgao2Quick view
27

Cariddi

VERIFIEDEXCELLENT · 100

Take a list of domains, crawl urls and scan for endpoints, secrets, api keys, file extensions, tokens and more

$ npx skills add edoardottt/cariddi
3.4K stars66 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
gocrawler
by edoardotttQuick view
28

Amazon Scraper

VERIFIEDEXCELLENT · 100

Free Trial Amazon Scraper API for extracting search, product, offer listing, reviews, question and answers, best sellers and sellers data.

$ npx skills add oxylabs/amazon-scraper
3.0K stars66 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonweb-automation
by oxylabsQuick view
29

Crawler

VERIFIEDEXCELLENT · 100

https://spatie.be/docs/crawler

$ npx skills add spatie/crawler
2.8K stars66 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
phpcrawler
by spatieQuick view
30

FinalRecon

VERIFIEDEXCELLENT · 100

All In One Web Recon

$ npx skills add thewhiteh4t/FinalRecon
2.8K stars66 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by thewhiteh4tQuick view
31

QueryList

VERIFIEDEXCELLENT · 100

:spider: The progressive PHP crawler framework! 优雅的渐进式PHP采集框架。

$ npx skills add jae-jae/QueryList
2.7K stars66 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
phpcrawler
by jae-jaeQuick view
32

Crawler Detect

VERIFIEDEXCELLENT · 100

🕷 CrawlerDetect is a PHP class for detecting bots/crawlers/spiders via the user agent

$ npx skills add JayBizzle/Crawler-Detect
2.4K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
phpcrawler
by JayBizzleQuick view
33

WechatSogou

VERIFIEDEXCELLENT · 97

基于搜狗微信搜索的微信公众号爬虫接口

$ npx skills add chyroc/WechatSogou
6.3K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by chyrocQuick view
34

Photon

VERIFIEDEXCELLENT · 98

Incredibly fast crawler designed for OSINT.

$ npx skills add s0md3v/Photon
12.9K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by s0md3vQuick view
35

Videodl

VERIFIEDEXCELLENT · 100

Videodl: A lightweight video downloader written in pure python. (轻量级视频下载器,优先高清无水印,支持抖音,快手,小红书,B站,TikTok,YouTube,FIFA+,优酷,腾讯,爱奇艺,1905电影网,乐视,芒果,咪咕,PPTV,搜狐,Facebook,Twitter,新浪微博,今日头条,网易公开课,全民K歌,CCTV央视频,酷狗音乐MV,新片场,知乎,百度贴吧,TED等海量流媒体平台)

$ npx skills add CharlesPikachu/videodl
2.1K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by CharlesPikachuQuick view
36

Skycaiji

VERIFIEDEXCELLENT · 100

蓝天采集器是一款开源免费的爬虫系统,仅需点选编辑规则即可采集数据,可运行在本地、虚拟主机或云服务器中,几乎能采集所有类型的网页,无缝对接各类CMS建站程序,免登录实时发布数据,全自动无需人工干预!是网页大数据采集软件中完全跨平台的云端爬虫系统

$ npx skills add zorlan/skycaiji
2.1K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
phpcrawler
by zorlanQuick view
37

SCrawler

VERIFIEDEXCELLENT · 100

🏳️‍🌈 Media downloader from any sites, including Twitter, Reddit, Instagram, BlueSky, TikTok, Threads, Facebook, OnlyFans, YouTube, Pinterest, PornHub, XHamster, XVIDEOS, ThisVid etc.

$ npx skills add AAndyProgram/SCrawler
2.0K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
visual-basic-.netcrawler
by AAndyProgramQuick view
38

Gain

VERIFIEDEXCELLENT · 98

Web crawling framework based on asyncio.

$ npx skills add elliotgao2/gain
2.0K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by elliotgao2Quick view
39

Crawlab

VERIFIEDEXCELLENT · 100

Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架

$ npx skills add crawlab-team/crawlab
12.2K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
gocrawler
by crawlab-teamQuick view
40

Webmagic

VERIFIEDEXCELLENT · 98

A scalable web crawler framework for Java.

$ npx skills add code4craft/webmagic
11.7K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javacrawler
by code4craftQuick view
41

Article Extractor

VERIFIEDEXCELLENT · 100

To extract main article from given URL with Node.js

$ npx skills add extractus/article-extractor
1.9K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javascriptcrawler
by extractusQuick view
42

NewPipeExtractor

VERIFIEDEXCELLENT · 100

NewPipe's core library for extracting data from streaming sites

$ npx skills add TeamNewPipe/NewPipeExtractor
1.9K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javacrawler
by TeamNewPipeQuick view
43

X Crawl

VERIFIEDEXCELLENT · 98

Flexible Node.js AI-assisted crawler library

$ npx skills add coder-hxl/x-crawl
1.9K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptcrawler
by coder-hxlQuick view
44

WaterCrawl

VERIFIEDEXCELLENT · 93

Transform Web Content into LLM-Ready Data

$ npx skills add watercrawl/WaterCrawl
1.8K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptcrawler
by watercrawlQuick view
45

Browserless

VERIFIEDEXCELLENT · 100

The headless Chrome/Chromium driver on top of Puppeteer. Take screenshots, generate PDFs, extract text and HTML with a production-ready API.

$ npx skills add microlinkhq/browserless
1.8K stars65 qualityClaude Code + Browser agents
High-confidence pick with strong adoption and healthy maintenance signals.
javascriptbrowser-automation
by microlinkhqQuick view
46

Crawler Illegal Cases In China

VERIFIEDEXCELLENT · 97

Collection of China illegal cases about web crawler 本项目用来整理所有中国大陆爬虫开发者涉诉与违规相关的新闻、资料与法律法规。致力于帮助在中国大陆工作的爬虫行业从业者了解我国相关法律,避免触碰数据合规红线。

$ npx skills add hiddendevj/Crawler_Illegal_Cases_In_China
4.6K stars64 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
htmlcrawler
by hiddendevjQuick view
47

Douyin

VERIFIEDEXCELLENT · 97

抖音爬虫——采集账号主页、喜欢、收藏、音乐原声、话题、搜索、合集、作品、关注、粉丝等公开数据。

$ npx skills add erma0/douyin
1.6K stars64 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptcrawler
by erma0Quick view
48

DotnetSpider

VERIFIEDEXCELLENT · 100

DotnetSpider, a .NET standard web crawling library. It is lightweight, efficient and fast high-level web crawling & scraping framework

$ npx skills add dotnetcore/DotnetSpider
4.1K stars64 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
c#crawler
by dotnetcoreQuick view
49

ScopeSentry

VERIFIEDEXCELLENT · 98

ScopeSentry-Cyberspace mapping, subdomain enumeration, port scanning, sensitive information discovery, vulnerability scanning, distributed nodes

$ npx skills add Autumn-27/ScopeSentry
1.5K stars64 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
gocrawler
by Autumn-27Quick view
50

Work Crawler

VERIFIEDEXCELLENT · 96

Download comics novels 小说漫画下载工具 小説漫画のダウンローダ 小說漫畫下載:腾讯漫画 大角虫漫画 有妖气 咪咕 SF漫画 哦漫画 看漫画 漫画柜 汗汗酷漫 動漫伊甸園 快看漫画 微博动漫 733动漫网 大古漫画网 漫画DB 無限動漫 動漫狂 卡推漫画 动漫之家 动漫屋 古风漫画网 36漫画网 亲亲漫画网 乙女漫画 webtoons 咚漫 ニコニコ静画 ComicWalker ヤングエースUP モアイ pixivコミック サイコミ;アルファポリス カクヨム ハーメルン 小説家になろう 起点中文网 八一中文网 顶点小说 落霞小说网 努努书坊 笔趣阁→epub.

$ npx skills add kanasimi/work_crawler
4.0K stars64 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javascriptcrawler
by kanasimiQuick view
51

Fscrawler

VERIFIEDEXCELLENT · 97

Elasticsearch File System Crawler (FS Crawler)

$ npx skills add dadoonet/fscrawler
1.4K stars64 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javacrawler
by dadoonetQuick view
52

Skills

VERIFIEDEXCELLENT · 100

Give your AI the power to browse, scrape, and extract structured data from complex websites — with faster execution, lower cost, and more reliable results.

$ npx skills add browser-act/skills
1.4K stars64 qualityClaude Code + Cursor
High-confidence pick with strong adoption and healthy maintenance signals.
pythonweb-automation
by browser-actQuick view
53

OpenWPM

VERIFIEDEXCELLENT · 92

A web privacy measurement framework

$ npx skills add openwpm/OpenWPM
1.4K stars64 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by openwpmQuick view
54

Agentql

VERIFIEDEXCELLENT · 100

AgentQL is a suite of tools for connecting your AI to the web. Featuring a query language and Playwright integrations for interacting with elements and extracting data quickly, precisely, and at scale. Includes REST API, Python and JavaScript SDKs, browser debugger.

$ npx skills add tinyfish-io/agentql
1.4K stars64 qualityClaude Code + Browser agents
High-confidence pick with strong adoption and healthy maintenance signals.
pythonweb-automation
by tinyfish-ioQuick view
55

Google Play Scraper

VERIFIEDEXCELLENT · 94

Node.js scraper to get data from Google Play

$ npx skills add facundoolano/google-play-scraper
2.9K stars63 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javascriptcrawler
by facundoolanoQuick view
56

News Please

VERIFIEDEXCELLENT · 99

news-please - an integrated web crawler and information extractor for news that just works

$ npx skills add fhamborg/news-please
2.5K stars62 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by fhamborgQuick view
57

Goclone

VERIFIEDEXCELLENT · 99

Website Cloner - Utilizes powerful Go routines to clone websites to your computer within seconds.

$ npx skills add goclone-dev/goclone
2.1K stars62 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
gocrawler
by goclone-devQuick view
58

Diskover Community

VERIFIEDEXCELLENT · 98

Diskover Community Edition - Open source file indexer, file search engine and data management and analytics powered by Elasticsearch

$ npx skills add diskoverdata/diskover-community
1.8K stars61 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
phpcrawler
by diskoverdataQuick view
59

Examples Of Web Crawlers

VERIFIEDEXCELLENT · 97

一些非常有趣的python爬虫例子,对新手比较友好,主要爬取淘宝、天猫、微信、微信读书、豆瓣、QQ等网站。(Some interesting examples of python crawlers that are friendly to beginners. )

$ npx skills add shengqiangzhang/examples-of-web-crawlers
14.6K stars61 qualityClaude Code + Browser agents
High-confidence pick with strong adoption and healthy maintenance signals.
htmlcrawler
by shengqiangzhangQuick view
60

Sperm

VERIFIEDEXCELLENT · 91

浏览过的精彩逆向文章汇总,值得一看

$ npx skills add darbra/sperm
1.4K stars61 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
crawler
by darbraQuick view
61

AI Map Py

VERIFIEDEXCELLENT · 91

AI Map is an AI-powered website mapping tool by Oxylabs AI Studio that uses natural language prompts to intelligently discover and extract relevant URLs from any website.

$ npx skills add oxylabs/ai-map-py
1.2K stars60 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
web-automation
by oxylabsQuick view
62

MyGPTReader

VERIFIEDEXCELLENT · 98

A community-driven way to read and chat with AI bots - powered by chatGPT.

$ npx skills add myreader-io/myGPTReader
4.4K stars60 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by myreader-ioQuick view
63

TorBot

VERIFIEDEXCELLENT · 87

Dark Web OSINT Tool

$ npx skills add DedSecInside/TorBot
4.1K stars60 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by DedSecInsideQuick view
64

How To Scrape Google Trends

VERIFIEDEXCELLENT · 90

Learn step-by-step how to scrape Google Trends data and make a result comparison using Python and Oxylabs SERP API. Extract keywords, their popularity, breakdown by region, related queries, and more.

$ npx skills add oxylabs/how-to-scrape-google-trends
2.6K stars59 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonweb-automation
by oxylabsQuick view
65

Gecco

VERIFIEDEXCELLENT · 89

Easy to use lightweight web crawler(易用的轻量化网络爬虫)

$ npx skills add xtuhcy/gecco
2.5K stars59 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javacrawler
by xtuhcyQuick view
66

Node Crawler

VERIFIEDEXCELLENT · 92

Web Crawler/Spider for NodeJS + server-side jQuery ;-)

$ npx skills add bda-research/node-crawler
6.8K stars58 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptcrawler
by bda-researchQuick view
67

How To Scrape Google Scholar

VERIFIEDEXCELLENT · 88

A guide for extracting titles, authors, and citations from Google Scholar using Python and Oxylabs SERP Scraper API.

$ npx skills add oxylabs/how-to-scrape-google-scholar
1.6K stars57 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonweb-automation
by oxylabsQuick view
68

Trafilatura

VERIFIEDEXCELLENT · 91

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

$ npx skills add adbar/trafilatura
6.0K stars57 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonweb-automation
by adbarQuick view
69

XHS Spider

VERIFIEDEXCELLENT · 93

小红书数据采集、网站图片、视频资源批量下载工具,颜值超高的数据采集工具(批量下载,视频提取,图片)Telegram:https://t.me/+ZtLSwuIKTo44MDY1

$ npx skills add xisuo67/XHS-Spider
1.4K stars57 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
crawler
by xisuo67Quick view
70

Spider Flow

VERIFIEDSTRONG · 77

新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。

$ npx skills add ssssssss-team/spider-flow
11.3K stars57 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
javacrawler
by ssssssss-teamQuick view
71

Scylla

VERIFIEDEXCELLENT · 89

Intelligent proxy pool for Humans™ to extract content from the internet and build your own Large Language Models in this new AI era

$ npx skills add MikeChongCan/scylla
4.0K stars56 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by MikeChongCanQuick view
72

Mdcx

VERIFIEDSTRONG · 83

Movie metadata scraper

$ npx skills add sqzw-x/mdcx
3.6K stars56 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.
pythoncrawler
by sqzw-xQuick view
73

The process of extracting product data from Amazon using Python, including titles, ratings, prices, images, and descriptions.

$ npx skills add oxylabs/how-to-scrape-amazon-product-data
2.9K stars55 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.
web-automation
by oxylabsQuick view
74

Avbook

VERIFIEDSTRONG · 75

AV 电影管理系统, avmoo , javbus , javlibrary 爬虫,线上 AV 影片图书馆,AV 磁力链接数据库,Japanese Adult Video Library,Adult Video Magnet Links - Japanese Adult Video Database

$ npx skills add guyueyingmu/avbook
10.0K stars55 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
phpcrawler
by guyueyingmuQuick view
75

Awesome Crawler

VERIFIEDSTRONG · 79

A collection of awesome web crawler,spider in different languages

$ npx skills add BruceDone/awesome-crawler
7.2K stars54 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
crawler
by BruceDoneQuick view
76

How To Scrape Amazon Prices

VERIFIEDSTRONG · 81

A code for extracting best-selling items, search results, and currently available deals from Amazon using Python and Oxylabs E-Commerce Scraper API.

$ npx skills add oxylabs/how-to-scrape-amazon-prices
1.7K stars53 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.
pythonweb-automation
by oxylabsQuick view
77

Grab Site

VERIFIEDSTRONG · 80

The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns

$ npx skills add ArchiveTeam/grab-site
1.6K stars53 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.
pythoncrawler
by ArchiveTeamQuick view
78

Headless Chrome Crawler

VERIFIEDSTRONG · 72

Distributed crawler powered by Headless Chrome

$ npx skills add yujiosaka/headless-chrome-crawler
5.6K stars53 qualityClaude Code + Browser agents
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
javascriptcrawler
by yujiosakaQuick view
79

Haipproxy

VERIFIEDSTRONG · 78

:sparkling_heart: High available distributed ip proxy pool, powerd by Scrapy and Redis

$ npx skills add SpiderClub/haipproxy
5.5K stars53 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
pythoncrawler
by SpiderClubQuick view
80

ECommerceCrawlers

VERIFIEDSTRONG · 78

实战🐍多种网站、电商数据爬虫🕷。包含🕸:淘宝商品、微信公众号、大众点评、企查查、招聘网站、闲鱼、阿里任务、博客园、微博、百度贴吧、豆瓣电影、包图网、全景网、豆瓣音乐、某省药监局、搜狐新闻、机器学习文本采集、fofa资产采集、汽车之家、国家统计局、百度关键词收录数、蜘蛛泛目录、今日头条、豆瓣影评、携程、小米应用商店、安居客、途家民宿❤️❤️❤️。微信爬虫展示项目:

$ npx skills add DropsDevopsOrg/ECommerceCrawlers
5.5K stars53 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
pythoncrawler
by DropsDevopsOrgQuick view
81

Reader

STRONG · 84

Open source web infrastructure for AI. Scrape, crawl, and automate the web, clean markdown, browser sessions, ready for your agents.

$ npx skills add vakra-dev/reader
529 stars53 qualityClaude Code + Browser agents
Solid option that is likely worth shortlisting for production workflows.
typescriptai-agents
by vakra-devQuick view
82

ProxyBroker

VERIFIEDSTRONG · 77

Proxy [Finder | Checker | Server]. HTTP(S) & SOCKS :performing_arts:

$ npx skills add constverum/ProxyBroker
4.2K stars52 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
pythoncrawler
by constverumQuick view
83

Proxypool

VERIFIEDSTRONG · 76

Automatically crawls proxy nodes on the public internet, de-duplicates and tests for usability and then provides a list of nodes

$ npx skills add zu1k/proxypool
4.0K stars52 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
gocrawler
by zu1kQuick view
84

How To Scrape Google Finance

VERIFIEDSTRONG · 78

Use Web Scraper API to extract data from Google Finance, including stock titles, pricing, and price changes in percentages.

$ npx skills add oxylabs/how-to-scrape-google-finance
1.0K stars52 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.
pythonweb-automation
by oxylabsQuick view
85

RED HAWK

VERIFIEDSTRONG · 76

All in one tool for Information Gathering, Vulnerability Scanning and Crawling. A must have tool for all penetration testers

$ npx skills add Tuhinshubhra/RED_HAWK
3.7K stars52 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
phpcrawler
by TuhinshubhraQuick view
86

Python3 Spider

VERIFIEDSTRONG · 71

Python爬虫实战 - 模拟登陆各大网站 包含但不限于:滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝,如果喜欢请start ❤️

$ npx skills add wkunzhi/Python3-Spider
3.4K stars51 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
pythoncrawler
by wkunzhiQuick view
87

Crawlergo

VERIFIEDSTRONG · 75

A powerful browser crawler for web vulnerability scanners

$ npx skills add Qianlitp/crawlergo
3.0K stars51 qualityClaude Code + Browser agents
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
gocrawler
by QianlitpQuick view
88

Gospider

VERIFIEDPROMISING · 69

Gospider - Fast web spider written in Go

$ npx skills add jaeles-project/gospider
3.0K stars51 qualityClaude Code
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
gocrawler
by jaeles-projectQuick view
89

DecryptLogin

VERIFIEDSTRONG · 75

DecryptLogin: APIs for loginning some websites by using requests.

$ npx skills add CharlesPikachu/DecryptLogin
2.9K stars51 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
pythoncrawler
by CharlesPikachuQuick view
90

Owllook

VERIFIEDPROMISING · 69

owllook-小说搜索引擎

$ npx skills add howie6879/owllook
2.8K stars51 qualityClaude Code
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
pythoncrawler
by howie6879Quick view
91

GoogleScraper

VERIFIEDSTRONG · 75

A Python module to scrape several search engines (like Google, Yandex, Bing, Duckduckgo, ...). Including asynchronous networking support.

$ npx skills add NikolaiT/GoogleScraper
2.8K stars51 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
htmlcrawler
by NikolaiTQuick view
92

Geziyor

VERIFIEDSTRONG · 75

Geziyor, blazing fast web crawling & scraping framework for Go. Supports JS rendering.

$ npx skills add geziyor/geziyor
2.8K stars51 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
gocrawler
by geziyorQuick view
93

Leaked GPTs

VERIFIEDPROMISING · 69

Leaked GPTs Prompts Bypass the 25 message limit or to try out GPTs without a Plus subscription.

$ npx skills add friuns2/Leaked-GPTs
2.4K stars50 qualityClaude Code + OpenAI Agents
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
pythoncrawler
by friuns2Quick view
94

Abot

VERIFIEDSTRONG · 74

Cross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.

$ npx skills add sjdirect/abot
2.3K stars50 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
c#crawler
by sjdirectQuick view
95

Deepcrawl

STRONG · 79

100% free and full open-source edge Firecrawl alternative with better links extraction for agents - that you can deploy to cloudflare or vercel by yourself.

$ npx skills add lumpinif/deepcrawl
576 stars50 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.
typescriptweb-automation
by lumpinifQuick view
96

Vulnx

VERIFIEDSTRONG · 74

vulnx 🕷️ an intelligent Bot, Shell can achieve automatic injection, and help researchers detect security vulnerabilities CMS system. It can perform a quick CMS security detection, information collection (including sub-domain name, ip address, country information, organizational information and time zone, etc.) and vulnerability scanning.

$ npx skills add anouarbensaad/vulnx
2.1K stars50 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
pythoncrawler
by anouarbensaadQuick view
97

Gocrawl

VERIFIEDPROMISING · 67

Polite, slim and concurrent web crawler.

$ npx skills add PuerkitoBio/gocrawl
2.1K stars50 qualityClaude Code
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
gocrawler
by PuerkitoBioQuick view
98

Dirhunt

VERIFIEDPROMISING · 67

Find web directories without bruteforce

$ npx skills add Nekmo/dirhunt
2.0K stars50 qualityClaude Code
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
pythoncrawler
by NekmoQuick view
99

LxSpider

VERIFIEDSTRONG · 73

爬虫案例合集。包括但不限于《淘宝、京东、天猫、豆瓣、抖音、快手、微博、微信、阿里、头条、pdd、优酷、爱奇艺、携程、12306、58、搜狐、各种指数、维普万方、Zlibraty、Oalib、小说、招标网、采购网、小红书、大众点评、推特、脉脉、知乎》

$ npx skills add lixi5338619/lxSpider
1.9K stars50 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
pythoncrawler
by lixi5338619Quick view
100

Ast Hook For Js RE

VERIFIEDPROMISING · 62

浏览器内存漫游解决方案(探索中...)

$ npx skills add JSREI/ast-hook-for-js-RE
1.9K stars50 qualityClaude Code
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
javascriptcrawler
by JSREIQuick view
101

BT Btt

VERIFIEDPROMISING · 67

磁力網站U3C3介紹以及域名更新

$ npx skills add u3c3/BT-btt
1.8K stars50 qualityClaude Code
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
crawler
by u3c3Quick view
102

PSpider

VERIFIEDPROMISING · 67

简单易用的Python爬虫框架,QQ交流群:597510560

$ npx skills add xianhu/PSpider
1.8K stars50 qualityClaude Code
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
pythoncrawler
by xianhuQuick view
103

Go Spider

VERIFIEDSTRONG · 73

[爬虫框架 (golang)] An awesome Go concurrent Crawler(spider) framework. The crawler is flexible and modular. It can be expanded to an Individualized crawler easily or you can use the default crawl components only.

$ npx skills add hu17889/go_spider
1.8K stars50 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
gocrawler
by hu17889Quick view
104

Ruia

VERIFIEDSTRONG · 73

Async Python 3.6+ web scraping micro-framework based on asyncio

$ npx skills add howie6879/ruia
1.7K stars49 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
pythoncrawler
by howie6879Quick view
105

AutoCrawler

VERIFIEDSTRONG · 73

Google, Naver multiprocess image web crawler (Selenium)

$ npx skills add YoongiKim/AutoCrawler
1.7K stars49 qualityClaude Code + Browser agents
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
pythoncrawler
by YoongiKimQuick view
106

Spider Collection

VERIFIEDSTRONG · 72

python爬虫,目前库存:网易云音乐歌曲爬取,B站视频爬取,知乎问答爬取,壁纸爬取,xvideos视频爬取,有声书爬取,微博爬虫,安居客信息爬取+数据可视化,哔哩哔哩视频封面提取器,ip代理池封装,知乎百万级用户爬虫+数据分析,github用户爬虫

$ npx skills add srx-2000/spider_collection
1.6K stars49 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
pythoncrawler
by srx-2000Quick view
107

Mlscraper

VERIFIEDPROMISING · 67

🤖 Scrape data from HTML websites automatically by just providing examples

$ npx skills add lorey/mlscraper
1.4K stars49 qualityClaude Code
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
pythoncrawler
by loreyQuick view
108

Scrapecraft

STRONG · 75

🤖 AI-powered web scraping editor with visual workflow builder. Build, test & deploy web scrapers using natural language. Powered by ScrapeGraphAI & LangGraph.

$ npx skills add ScrapeGraphAI/scrapecraft
641 stars46 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.
pythonweb-automation
by ScrapeGraphAIQuick view
109

Uscrapper

PROMISING · 55

Uscrapper Vanta: Dive deeper into the web with this powerful open-source tool. Extract valuable insights with ease and efficiency, from both surface and deep web sources. Empower your data mining and analysis with Vanta's advanced capabilities. Fast, reliable, and user-friendly, Uscrapper Vanta is the ultimate choice for researchers and analysts.

$ npx skills add z0m31en7/Uscrapper
778 stars39 qualityClaude Code
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
pythonweb-automation
by z0m31en7Quick view