Decision filters

Choose skills by scenario, quality, and trust signals.

194 skills matching "data"

Best blend of quality, stars, freshness, and agent usage

1

Crawl4AI

VERIFIEDEXCELLENT · 100

Web crawling built for AI

$ npx skills add unclecode/crawl4ai
3 agent calls100% success66.1K stars79 qualityClaude Code + OpenAI Agents31.0K installs
High-confidence pick with strong adoption and healthy maintenance signals.
claudegpt-4langchaincrewaiopenclaw
by unclecodeQuick view
2

Firecrawl

VERIFIEDEXCELLENT · 100

🔥 Search, scrape, and clean the web for AI agents.

$ npx skills add firecrawl/firecrawl
123.2K stars78 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptai-agents
by firecrawlQuick view
3

PaddleOCR

VERIFIEDEXCELLENT · 100

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

$ npx skills add PaddlePaddle/PaddleOCR
78.4K stars77 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonrag
by PaddlePaddleQuick view
4

Scrapy

VERIFIEDEXCELLENT · 100

High-throughput crawling and scraping for agent data pipelines

$ npx skills add scrapy/scrapy
61.8K stars77 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawlerrag
by scrapyQuick view
5

Scrapling

VERIFIEDEXCELLENT · 100

Adaptive web scraping for agent data collection

$ npx skills add D4Vinci/Scrapling
53.2K stars77 qualityClaude Code + Browser agents
High-confidence pick with strong adoption and healthy maintenance signals.
pythonweb-automationrag
by D4VinciQuick view
6

LlamaIndex

VERIFIEDEXCELLENT · 100

Connect agents to private data and retrieval workflows

$ npx skills add run-llama/llama_index
49.6K stars76 qualityClaude Code + LlamaIndex
High-confidence pick with strong adoption and healthy maintenance signals.
llamaindexpythonrag
by run-llamaQuick view
7

Hello Agents

VERIFIEDEXCELLENT · 100

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

$ npx skills add datawhalechina/hello-agents
52.5K stars76 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonrag
by datawhalechinaQuick view
8

Graphify

VERIFIEDEXCELLENT · 100

AI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, and more). Turn any folder of code, SQL schemas, R scripts, shell scripts, docs, papers, images, or videos into a queryable knowledge graph. App code + database schema + infrastructure in one graph.

$ npx skills add safishamsi/graphify
52.0K stars76 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
pythonrag
by safishamsiQuick view
9

Milvus

VERIFIEDEXCELLENT · 100

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

$ npx skills add milvus-io/milvus
44.4K stars76 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
gorag
by milvus-ioQuick view
10

EasySpider

VERIFIEDEXCELLENT · 100

A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/网页爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。

$ npx skills add NaiboWang/EasySpider
43.9K stars76 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javascriptcrawler
by NaiboWangQuick view
11

Firecrawl

VERIFIEDEXCELLENT · 100

Web data for AI applications

$ npx skills add firecrawl/firecrawl
123.1K stars76 qualityClaude Code + OpenAI Agents25.0K installs
High-confidence pick with strong adoption and healthy maintenance signals.
claudegpt-4langchainopenclaw
by mendableaiQuick view
12

Browser Use

VERIFIEDEXCELLENT · 100

Give your AI agent a web browser

$ npx skills add browser-use/browser-use
95.1K stars75 qualityClaude Code + OpenAI Agents28.0K installs
High-confidence pick with strong adoption and healthy maintenance signals.
claudegpt-4langchainopenclaw
by browser-useQuick view
13

Happy Llm

VERIFIEDEXCELLENT · 100

📚 从零开始构建大模型

$ npx skills add datawhalechina/happy-llm
30.5K stars75 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
jupyter-notebookrag
by datawhalechinaQuick view
14

ScrapeGraphAI

VERIFIEDEXCELLENT · 100

Extract web data with LLM-guided scraping graphs

$ npx skills add ScrapeGraphAI/Scrapegraph-ai
25.8K stars74 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonllmweb-automation
by ScrapeGraphAIQuick view
15

Chroma

VERIFIEDEXCELLENT · 100

Search infrastructure for AI

$ npx skills add chroma-core/chroma
28.1K stars74 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
rustai-agents
by chroma-coreQuick view
16

Scientific Agent Skills

VERIFIEDEXCELLENT · 100

A set of ready to use Agent Skills for research, science, engineering, analysis, finance and writing.

$ npx skills add K-Dense-AI/scientific-agent-skills
25.2K stars74 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
by K-Dense-AIQuick view
17

Claude Scientific Skills

VERIFIEDEXCELLENT · 100

A set of ready to use Agent Skills for research, science, engineering, analysis, finance and writing.

$ npx skills add K-Dense-AI/claude-scientific-skills
25.2K stars74 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
by K-Dense-AIQuick view
18

Mlflow

VERIFIEDEXCELLENT · 100

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.

$ npx skills add mlflow/mlflow
26.1K stars74 qualityClaude Code + LangChain
High-confidence pick with strong adoption and healthy maintenance signals.
pythonllmops
by mlflowQuick view
19

Crawlee

VERIFIEDEXCELLENT · 100

Build reliable crawlers for LLM and RAG data ingestion

$ npx skills add apify/crawlee
23.4K stars74 qualityClaude Code + Browser agents
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptplaywrightpuppeteer
by apifyQuick view
20

Colly

VERIFIEDEXCELLENT · 100

Elegant Scraper and Crawler Framework for Golang

$ npx skills add gocolly/colly
25.3K stars74 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
gocrawler
by gocollyQuick view
21

OpenViking

VERIFIEDEXCELLENT · 100

OpenViking is an open-source context database designed specifically for AI Agents(such as openclaw). OpenViking unifies the management of context (memory, resources, and skills) that Agents need through a file system paradigm, enabling hierarchical context delivery and self-evolving.

$ npx skills add volcengine/OpenViking
24.5K stars74 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonai-agents
by volcengineQuick view
22

Proxy Pool

VERIFIEDEXCELLENT · 100

Python ProxyPool for web spider

$ npx skills add jhao104/proxy_pool
23.4K stars74 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by jhao104Quick view
23

Dolt

VERIFIEDEXCELLENT · 100

Dolt – Git for Data

$ npx skills add dolthub/dolt
22.8K stars74 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
goai-agents
by dolthubQuick view
24

FinceptTerminal

VERIFIEDEXCELLENT · 100

FinceptTerminal is a modern finance application offering advanced market analytics, investment research, and economic data tools, designed for interactive exploration and data-driven decision-making in a user-friendly environment.

$ npx skills add Fincept-Corporation/FinceptTerminal
22.8K stars74 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonai-agents
by Fincept-CorporationQuick view
25

Opendataloader Pdf

VERIFIEDEXCELLENT · 100

PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.

$ npx skills add opendataloader-project/opendataloader-pdf
21.5K stars74 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javarag
by opendataloader-projectQuick view
26

DB GPT

VERIFIEDEXCELLENT · 100

open-source agentic AI data assistant for the next generation of AI + Data products.

$ npx skills add eosphoros-ai/DB-GPT
18.8K stars73 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
pythonrag
by eosphoros-aiQuick view
27

Kubesphere

VERIFIEDEXCELLENT · 100

The container platform tailored for Kubernetes multi-cloud, datacenter, and edge management ⎈ 🖥 ☁️

$ npx skills add kubesphere/kubesphere
16.9K stars73 quality
High-confidence pick with strong adoption and healthy maintenance signals.
by kubesphereQuick view
28

Katana

VERIFIEDEXCELLENT · 100

A next-generation crawling and spidering framework.

$ npx skills add projectdiscovery/katana
16.7K stars73 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
gocrawler
by projectdiscoveryQuick view
29

WrenAI

VERIFIEDEXCELLENT · 100

Turn any AI Agents into world-class data analysts through the open context layer that gives AI agents grounded, governed memory, context, SQL across 20+ data sources, that helps you build GenBI, agentic BI, text-to-sql, dashboards, and agentic analytics.

$ npx skills add Canner/WrenAI
15.3K stars72 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonrag
by CannerQuick view
30

Newspaper

VERIFIEDEXCELLENT · 100

newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:

$ npx skills add codelucas/newspaper
15.1K stars72 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by codelucasQuick view
31

Easy Dataset

VERIFIEDEXCELLENT · 100

A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval

$ npx skills add ConardLi/easy-dataset
14.3K stars72 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javascriptrag
by ConardLiQuick view
32

SurfSense

VERIFIEDEXCELLENT · 100

An open source, privacy focused alternative to NotebookLM for teams with no data limits. Join our Discord: https://discord.gg/ejRNvftDp9

$ npx skills add MODSetter/SurfSense
14.3K stars72 qualityClaude Code + LangChain
High-confidence pick with strong adoption and healthy maintenance signals.
pythonrag
by MODSetterQuick view
33

Lux

VERIFIEDEXCELLENT · 100

👾 Fast and simple video download library and CLI tool written in Go

$ npx skills add iawia002/lux
31.4K stars72 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
gocrawler
by iawia002Quick view
34

Bisheng

VERIFIEDEXCELLENT · 100

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.

$ npx skills add dataelement/bisheng
11.4K stars72 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptrag
by dataelementQuick view
35

Python

VERIFIEDEXCELLENT · 100

Python脚本。模拟登录知乎, 爬虫,操作excel,微信公众号,远程开机

$ npx skills add injetlee/Python
10.6K stars71 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by injetleeQuick view
36

InsForge

VERIFIEDEXCELLENT · 100

The all-in-one, open-source backend platform for agentic coding. InsForge gives your coding agent database, auth, storage, compute, hosting, and AI gateway to ship full-stack apps end-to-end.

$ npx skills add InsForge/InsForge
10.5K stars71 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptai-agents
by InsForgeQuick view
37

Metaflow

VERIFIEDEXCELLENT · 100

Build, Manage and Deploy AI/ML Systems

$ npx skills add Netflix/metaflow
10.1K stars71 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonllmops
by NetflixQuick view
38

Zvec

VERIFIEDEXCELLENT · 100

A lightweight, lightning-fast, in-process vector database

$ npx skills add alibaba/zvec
9.7K stars71 quality
High-confidence pick with strong adoption and healthy maintenance signals.
by alibabaQuick view
39

Xget

VERIFIEDEXCELLENT · 100

Ultra-high-performance, secure, all-in-one acceleration engine for developer resources

$ npx skills add xixu-me/xget
8.1K stars70 quality
High-confidence pick with strong adoption and healthy maintenance signals.
by xixu-meQuick view
40

Cocoindex

VERIFIEDEXCELLENT · 100

Incremental engine for long horizon agents 🌟 Star if you like it!

$ npx skills add cocoindex-io/cocoindex
10.0K stars70 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonai-agents
by cocoindex-ioQuick view
41

Phoenix

VERIFIEDEXCELLENT · 100

AI Observability & Evaluation

$ npx skills add Arize-ai/phoenix
9.8K stars70 qualityClaude Code + LangChain
High-confidence pick with strong adoption and healthy maintenance signals.
pythonllmops
by Arize-aiQuick view
42

Deeplake

VERIFIEDEXCELLENT · 100

Deeplake is AI Data Runtime for Agents. It provides serverless postgres with a multimodal datalake, enabling scalable retrieval and training.

$ npx skills add activeloopai/deeplake
9.1K stars69 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
c++rag
by activeloopaiQuick view
43

Crawlee Python

VERIFIEDEXCELLENT · 100

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

$ npx skills add apify/crawlee-python
9.1K stars69 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
pythonweb-automation
by apifyQuick view
44

Wiseflow

VERIFIEDEXCELLENT · 100

为你 7*24 在线搞钱的“云上牛马”团队

$ npx skills add TeamWiseFlow/wiseflow
8.2K stars69 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptcrawler
by TeamWiseFlowQuick view
45

Llm Universe

VERIFIEDEXCELLENT · 100

本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/

$ npx skills add datawhalechina/llm-universe
13.1K stars69 qualityClaude Code + LangChain
High-confidence pick with strong adoption and healthy maintenance signals.
jupyter-notebookrag
by datawhalechinaQuick view
46

All In RAG

VERIFIEDEXCELLENT · 100

🔍大模型应用开发实战一:RAG 技术全栈指南,在线阅读地址:https://datawhalechina.github.io/all-in-rag/

$ npx skills add datawhalechina/all-in-rag
7.8K stars69 qualityClaude Code + LangChain
High-confidence pick with strong adoption and healthy maintenance signals.
pythonrag
by datawhalechinaQuick view
47

Skills

VERIFIEDEXCELLENT · 100

Trail of Bits Claude Code skills for security research, vulnerability detection, and audit workflows

$ npx skills add trailofbits/skills
5.3K stars69 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
by trailofbitsQuick view
48

Flyte

VERIFIEDEXCELLENT · 100

Dynamic, resilient AI orchestration. Coordinate data, models, and compute as you build AI workflows.

$ npx skills add flyteorg/flyte
7.0K stars69 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
goai-agents
by flyteorgQuick view
49

Vespa

VERIFIEDEXCELLENT · 100

AI + Data, online. https://vespa.ai

$ npx skills add vespa-engine/vespa
6.9K stars69 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javarag
by vespa-engineQuick view
50

Plano

VERIFIEDEXCELLENT · 100

Plano is an AI-native proxy and data plane for agentic apps — with built-in orchestration, safety, observability, and smart LLM routing so you stay focused on your agents core logic.

$ npx skills add katanemo/plano
6.5K stars68 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
rustllmops
by katanemoQuick view
51

ChatLab

VERIFIEDEXCELLENT · 100

Local-first chat history analyzer with AI. | 本地优先的 AI 聊天记录分析工具

$ npx skills add ChatLab/ChatLab
6.5K stars68 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptai-agents
by ChatLabQuick view
52

Airweave

VERIFIEDEXCELLENT · 100

Open-source context retrieval layer for AI agents

$ npx skills add airweave-ai/airweave
6.4K stars68 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonai-agents
by airweave-aiQuick view
53

SQLBot

VERIFIEDEXCELLENT · 100

🔥 基于大模型和 RAG 的智能问数系统,对话式数据分析神器。Text-to-SQL Generation via LLMs using RAG.

$ npx skills add dataease/SQLBot
6.1K stars68 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javascriptrag
by dataeaseQuick view
54

Genkit

VERIFIEDEXCELLENT · 100

Open-source framework for building AI-powered apps in JavaScript, Go, and Python, built and used in production by Google

$ npx skills add genkit-ai/genkit
6.0K stars68 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptrag
by genkit-aiQuick view
55

Ferret

VERIFIEDEXCELLENT · 100

Declarative web scraping

$ npx skills add MontFerret/ferret
6.0K stars68 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
gocrawler
by MontFerretQuick view
56

JMComic Crawler Python

VERIFIEDEXCELLENT · 100

Python API for JMComic | 提供Python API访问禁漫天堂,同时支持网页端和移动端 | 禁漫天堂GitHub Actions下载器🚀

$ npx skills add hect0x7/JMComic-Crawler-Python
5.8K stars68 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by hect0x7Quick view
57

Scrapy Redis

VERIFIEDEXCELLENT · 100

Redis-based components for Scrapy.

$ npx skills add rmax/scrapy-redis
5.6K stars68 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by rmaxQuick view
58

Zenml

VERIFIEDEXCELLENT · 100

ZenML 🙏: One AI Platform from Pipelines to Agents. https://zenml.io.

$ npx skills add zenml-io/zenml
5.4K stars68 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonllmops
by zenml-ioQuick view
59

Sparrow

VERIFIEDEXCELLENT · 100

Structured data extraction and instruction calling with ML, LLM and Vision LLM

$ npx skills add katanaml/sparrow
5.2K stars68 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
pythonrag
by katanamlQuick view
60

Browser Fingerprinting

VERIFIEDEXCELLENT · 100

Analysis of Bot Protection systems with available countermeasures 🚿. How to defeat anti-bot system 👻 and get around browser fingerprinting scripts 🕵️‍♂️ when scraping the web?

$ npx skills add niespodd/browser-fingerprinting
5.0K stars68 qualityClaude Code + Browser agents
High-confidence pick with strong adoption and healthy maintenance signals.
javascriptcrawler
by niespoddQuick view
61

Html Anything

VERIFIEDEXCELLENT · 100

✨ The agentic HTML editor — your local AI agent writes the HTML, you ship it. 🚀 75 Skills × 9 Surfaces (magazine · deck · poster · XHS / tweet · prototype · data report · Hyperframes) 🛡️ Sandboxed preview · 📤 1-click to WeChat / X / Zhihu / HTML / PNG 🔑 Zero API key — Claude Code / Cursor / Codex / Gemini / Copilot / OpenCode / Qwen / Aider.

$ npx skills add nexu-io/html-anything
4.7K stars67 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
htmlai-agents
by nexu-ioQuick view
62

Thunderbolt

VERIFIEDEXCELLENT · 100

AI You Control: Choose your models. Own your data. Eliminate vendor lock-in.

$ npx skills add thunderbird/thunderbolt
4.6K stars67 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptai-agents
by thunderbirdQuick view
63

Weibo Crawler

VERIFIEDEXCELLENT · 97

新浪微博爬虫,用python爬取新浪微博数据,并下载微博图片和微博视频

$ npx skills add dataabc/weibo-crawler
4.5K stars67 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by dataabcQuick view
64

Google Maps Scraper

VERIFIEDEXCELLENT · 100

scrape data from Google Maps. Extracts data such as the name, address, phone number, website URL, rating, reviews number, latitude and longitude, reviews,email and more for each place

$ npx skills add gosom/google-maps-scraper
4.1K stars67 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
goweb-automation
by gosomQuick view
65

Memgraph

VERIFIEDEXCELLENT · 100

High-performance open-source in-memory graph database for GraphRAG, AI memory, agentic AI, and real-time graph analytics. Cypher-compatible, built in C++.

$ npx skills add memgraph/memgraph
4.1K stars67 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
c++ai-agents
by memgraphQuick view
66

Puppeteer Sharp

VERIFIEDEXCELLENT · 100

Headless Chrome .NET API

$ npx skills add hardkoded/puppeteer-sharp
3.9K stars67 qualityClaude Code + Browser agents
High-confidence pick with strong adoption and healthy maintenance signals.
c#crawler
by hardkodedQuick view
67

LazyLLM

VERIFIEDEXCELLENT · 100

Easiest and laziest way for building multi-agent LLMs applications.

$ npx skills add LazyAGI/LazyLLM
3.8K stars67 qualityClaude Code + LangChain
High-confidence pick with strong adoption and healthy maintenance signals.
pythonai-agents
by LazyAGIQuick view
68

Feapder

VERIFIEDEXCELLENT · 100

🚀🚀🚀feapder is an easy to use, powerful crawler framework | feapder是一款上手简单,功能强大的Python爬虫框架。内置AirSpider、Spider、TaskSpider、BatchSpider四种爬虫解决不同场景的需求。且支持断点续爬、监控报警、浏览器渲染、海量数据去重等功能。更有功能强大的爬虫管理系统feaplat为其提供方便的部署及调度

$ npx skills add Boris-code/feapder
3.7K stars67 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by Boris-codeQuick view
69

Toapi

VERIFIEDEXCELLENT · 100

Every web site provides APIs.

$ npx skills add elliotgao2/toapi
3.5K stars67 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by elliotgao2Quick view
70

Cariddi

VERIFIEDEXCELLENT · 100

Take a list of domains, crawl urls and scan for endpoints, secrets, api keys, file extensions, tokens and more

$ npx skills add edoardottt/cariddi
3.4K stars66 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
gocrawler
by edoardotttQuick view
71

Acontext

VERIFIEDEXCELLENT · 100

Agent Skills as a Memory Layer

$ npx skills add memodb-io/Acontext
3.4K stars66 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javascriptai-agents
by memodb-ioQuick view
72

Langwatch

VERIFIEDEXCELLENT · 100

The platform for LLM evaluations and AI agent testing

$ npx skills add langwatch/langwatch
3.3K stars66 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptllmops
by langwatchQuick view
73

Datahaven

VERIFIEDEXCELLENT · 100

An EVM compatible Substrate chain, powered by StorageHub and secured by EigenLayer

$ npx skills add datahaven-xyz/datahaven
8.0K stars66 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
rustai-agents
by datahaven-xyzQuick view
74

Amazon Scraper

VERIFIEDEXCELLENT · 100

Free Trial Amazon Scraper API for extracting search, product, offer listing, reviews, question and answers, best sellers and sellers data.

$ npx skills add oxylabs/amazon-scraper
3.0K stars66 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonweb-automation
by oxylabsQuick view
75

Awesome Web Scraping

VERIFIEDEXCELLENT · 99

List of libraries, tools and APIs for web scraping and data processing.

$ npx skills add lorien/awesome-web-scraping
7.9K stars66 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
makefileweb-automation
by lorienQuick view
76

Crawler

VERIFIEDEXCELLENT · 100

https://spatie.be/docs/crawler

$ npx skills add spatie/crawler
2.8K stars66 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
phpcrawler
by spatieQuick view
77

FinalRecon

VERIFIEDEXCELLENT · 100

All In One Web Recon

$ npx skills add thewhiteh4t/FinalRecon
2.8K stars66 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by thewhiteh4tQuick view
78

QueryList

VERIFIEDEXCELLENT · 100

:spider: The progressive PHP crawler framework! 优雅的渐进式PHP采集框架。

$ npx skills add jae-jae/QueryList
2.7K stars66 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
phpcrawler
by jae-jaeQuick view
79

Llm Scraper

VERIFIEDEXCELLENT · 98

Turn any webpage into structured data using LLMs

$ npx skills add mishushakov/llm-scraper
6.7K stars66 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptbrowser-automation
by mishushakovQuick view
80

Spider

VERIFIEDEXCELLENT · 99

Low latency web data collector

$ npx skills add spider-rs/spider
2.5K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
rustai-agents
by spider-rsQuick view
81

Hamilton

VERIFIEDEXCELLENT · 100

Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

$ npx skills add apache/hamilton
2.5K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
jupyter-notebookllmops
by apacheQuick view
82

Crawler Detect

VERIFIEDEXCELLENT · 100

🕷 CrawlerDetect is a PHP class for detecting bots/crawlers/spiders via the user agent

$ npx skills add JayBizzle/Crawler-Detect
2.4K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
phpcrawler
by JayBizzleQuick view
83

WechatSogou

VERIFIEDEXCELLENT · 97

基于搜狗微信搜索的微信公众号爬虫接口

$ npx skills add chyroc/WechatSogou
6.3K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by chyrocQuick view
84

Photon

VERIFIEDEXCELLENT · 98

Incredibly fast crawler designed for OSINT.

$ npx skills add s0md3v/Photon
12.9K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by s0md3vQuick view
85

Videodl

VERIFIEDEXCELLENT · 100

Videodl: A lightweight video downloader written in pure python. (轻量级视频下载器,优先高清无水印,支持抖音,快手,小红书,B站,TikTok,YouTube,FIFA+,优酷,腾讯,爱奇艺,1905电影网,乐视,芒果,咪咕,PPTV,搜狐,Facebook,Twitter,新浪微博,今日头条,网易公开课,全民K歌,CCTV央视频,酷狗音乐MV,新片场,知乎,百度贴吧,TED等海量流媒体平台)

$ npx skills add CharlesPikachu/videodl
2.1K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by CharlesPikachuQuick view
86

Skycaiji

VERIFIEDEXCELLENT · 100

蓝天采集器是一款开源免费的爬虫系统,仅需点选编辑规则即可采集数据,可运行在本地、虚拟主机或云服务器中,几乎能采集所有类型的网页,无缝对接各类CMS建站程序,免登录实时发布数据,全自动无需人工干预!是网页大数据采集软件中完全跨平台的云端爬虫系统

$ npx skills add zorlan/skycaiji
2.1K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
phpcrawler
by zorlanQuick view
87

SCrawler

VERIFIEDEXCELLENT · 100

🏳️‍🌈 Media downloader from any sites, including Twitter, Reddit, Instagram, BlueSky, TikTok, Threads, Facebook, OnlyFans, YouTube, Pinterest, PornHub, XHamster, XVIDEOS, ThisVid etc.

$ npx skills add AAndyProgram/SCrawler
2.0K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
visual-basic-.netcrawler
by AAndyProgramQuick view
88

Gain

VERIFIEDEXCELLENT · 98

Web crawling framework based on asyncio.

$ npx skills add elliotgao2/gain
2.0K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by elliotgao2Quick view
89

Crawlab

VERIFIEDEXCELLENT · 100

Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架

$ npx skills add crawlab-team/crawlab
12.2K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
gocrawler
by crawlab-teamQuick view
90

Neuron AI

VERIFIEDEXCELLENT · 100

The PHP Agentic Framework to build production-ready AI driven applications. Connect components (LLMs, vector DBs, memory) to agents that can interact with your data.

$ npx skills add neuron-core/neuron-ai
1.9K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
phpllm
by neuron-coreQuick view
91

Webmagic

VERIFIEDEXCELLENT · 98

A scalable web crawler framework for Java.

$ npx skills add code4craft/webmagic
11.7K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javacrawler
by code4craftQuick view
92

RPA

VERIFIEDEXCELLENT · 99

Ui.Vision Open-Source RPA Software with Computer Vision, OCR, Anthropic Computer Use/LLM. Selenium IDE import/export.

$ npx skills add A9T9/RPA
1.9K stars65 qualityClaude Code + Browser agents
High-confidence pick with strong adoption and healthy maintenance signals.
javascriptbrowser-automation
by A9T9Quick view
93

Article Extractor

VERIFIEDEXCELLENT · 100

To extract main article from given URL with Node.js

$ npx skills add extractus/article-extractor
1.9K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javascriptcrawler
by extractusQuick view
94

NewPipeExtractor

VERIFIEDEXCELLENT · 100

NewPipe's core library for extracting data from streaming sites

$ npx skills add TeamNewPipe/NewPipeExtractor
1.9K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javacrawler
by TeamNewPipeQuick view
95

X Crawl

VERIFIEDEXCELLENT · 98

Flexible Node.js AI-assisted crawler library

$ npx skills add coder-hxl/x-crawl
1.9K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptcrawler
by coder-hxlQuick view
96

WaterCrawl

VERIFIEDEXCELLENT · 93

Transform Web Content into LLM-Ready Data

$ npx skills add watercrawl/WaterCrawl
1.8K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptcrawler
by watercrawlQuick view
97

Quivr

VERIFIEDEXCELLENT · 95

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.

$ npx skills add QuivrHQ/quivr
39.2K stars64 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
pythonrag
by QuivrHQQuick view
98

Crawler Illegal Cases In China

VERIFIEDEXCELLENT · 97

Collection of China illegal cases about web crawler 本项目用来整理所有中国大陆爬虫开发者涉诉与违规相关的新闻、资料与法律法规。致力于帮助在中国大陆工作的爬虫行业从业者了解我国相关法律,避免触碰数据合规红线。

$ npx skills add hiddendevj/Crawler_Illegal_Cases_In_China
4.6K stars64 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
htmlcrawler
by hiddendevjQuick view
99

Adala

VERIFIEDEXCELLENT · 100

Adala: Autonomous DAta (Labeling) Agent framework

$ npx skills add HumanSignal/Adala
1.6K stars64 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
pythonllm
by HumanSignalQuick view
100

Agently

VERIFIEDEXCELLENT · 100

[GenAI Application Development Framework] 🚀 Build GenAI application quick and easy 💬 Easy to interact with GenAI agent in code using structure data and chained-calls syntax 🧩 Use Event-Driven Flow *TriggerFlow* to manage complex GenAI working logic 🔀 Switch to any model without rewrite application code

$ npx skills add AgentEra/Agently
1.6K stars64 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonllm
by AgentEraQuick view
101

Douyin

VERIFIEDEXCELLENT · 97

抖音爬虫——采集账号主页、喜欢、收藏、音乐原声、话题、搜索、合集、作品、关注、粉丝等公开数据。

$ npx skills add erma0/douyin
1.6K stars64 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptcrawler
by erma0Quick view
102

DotnetSpider

VERIFIEDEXCELLENT · 100

DotnetSpider, a .NET standard web crawling library. It is lightweight, efficient and fast high-level web crawling & scraping framework

$ npx skills add dotnetcore/DotnetSpider
4.1K stars64 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
c#crawler
by dotnetcoreQuick view
103

ScopeSentry

VERIFIEDEXCELLENT · 98

ScopeSentry-Cyberspace mapping, subdomain enumeration, port scanning, sensitive information discovery, vulnerability scanning, distributed nodes

$ npx skills add Autumn-27/ScopeSentry
1.5K stars64 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
gocrawler
by Autumn-27Quick view
104

Work Crawler

VERIFIEDEXCELLENT · 96

Download comics novels 小说漫画下载工具 小説漫画のダウンローダ 小說漫畫下載:腾讯漫画 大角虫漫画 有妖气 咪咕 SF漫画 哦漫画 看漫画 漫画柜 汗汗酷漫 動漫伊甸園 快看漫画 微博动漫 733动漫网 大古漫画网 漫画DB 無限動漫 動漫狂 卡推漫画 动漫之家 动漫屋 古风漫画网 36漫画网 亲亲漫画网 乙女漫画 webtoons 咚漫 ニコニコ静画 ComicWalker ヤングエースUP モアイ pixivコミック サイコミ;アルファポリス カクヨム ハーメルン 小説家になろう 起点中文网 八一中文网 顶点小说 落霞小说网 努努书坊 笔趣阁→epub.

$ npx skills add kanasimi/work_crawler
4.0K stars64 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javascriptcrawler
by kanasimiQuick view
105

Scweet

VERIFIEDEXCELLENT · 100

Scrape tweets, profiles, followers and following from Twitter/X, no API key needed. Python library with smart multi-account pooling, proxy support and async.

$ npx skills add Altimis/Scweet
1.5K stars64 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonweb-automation
by AltimisQuick view
106

Fscrawler

VERIFIEDEXCELLENT · 97

Elasticsearch File System Crawler (FS Crawler)

$ npx skills add dadoonet/fscrawler
1.4K stars64 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javacrawler
by dadoonetQuick view
107

Skills

VERIFIEDEXCELLENT · 100

Give your AI the power to browse, scrape, and extract structured data from complex websites — with faster execution, lower cost, and more reliable results.

$ npx skills add browser-act/skills
1.4K stars64 qualityClaude Code + Cursor
High-confidence pick with strong adoption and healthy maintenance signals.
pythonweb-automation
by browser-actQuick view
108

OpenWPM

VERIFIEDEXCELLENT · 92

A web privacy measurement framework

$ npx skills add openwpm/OpenWPM
1.4K stars64 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by openwpmQuick view
109

Agentql

VERIFIEDEXCELLENT · 100

AgentQL is a suite of tools for connecting your AI to the web. Featuring a query language and Playwright integrations for interacting with elements and extracting data quickly, precisely, and at scale. Includes REST API, Python and JavaScript SDKs, browser debugger.

$ npx skills add tinyfish-io/agentql
1.4K stars64 qualityClaude Code + Browser agents
High-confidence pick with strong adoption and healthy maintenance signals.
pythonweb-automation
by tinyfish-ioQuick view
110

Intellagent

VERIFIEDEXCELLENT · 100

A framework for comprehensive diagnosis and optimization of agents using simulated, realistic synthetic interactions

$ npx skills add plurai-ai/intellagent
1.2K stars63 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonllmops
by plurai-aiQuick view
111

Decodo

VERIFIEDEXCELLENT · 100

HTTP(S)/SOCKS5 rotating residential proxies - code examples & general information.

$ npx skills add Decodo/Decodo
1.2K stars63 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javaweb-automation
by DecodoQuick view
112

User Agents

VERIFIEDEXCELLENT · 97

A JavaScript library for generating random user agents with data that's updated daily.

$ npx skills add intoli/user-agents
1.2K stars63 qualityClaude Code + Browser agents
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptbrowser-automation
by intoliQuick view
113

Faster Than Requests

VERIFIEDEXCELLENT · 96

Faster requests on Python 3

$ npx skills add juancarlospaco/faster-than-requests
1.1K stars63 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
nimweb-automation
by juancarlospacoQuick view
114

Google Play Scraper

VERIFIEDEXCELLENT · 94

Node.js scraper to get data from Google Play

$ npx skills add facundoolano/google-play-scraper
2.9K stars63 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javascriptcrawler
by facundoolanoQuick view
115

Second Brain AI Assistant Course

VERIFIEDEXCELLENT · 100

Learn to build your Second Brain AI assistant with LLMs, agents, RAG, fine-tuning, LLMOps and AI systems techniques.

$ npx skills add decodingai-magazine/second-brain-ai-assistant-course
2.7K stars63 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
jupyter-notebookllmops
by decodingai-magazineQuick view
116

News Please

VERIFIEDEXCELLENT · 99

news-please - an integrated web crawler and information extractor for news that just works

$ npx skills add fhamborg/news-please
2.5K stars62 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by fhamborgQuick view
117

Goclone

VERIFIEDEXCELLENT · 99

Website Cloner - Utilizes powerful Go routines to clone websites to your computer within seconds.

$ npx skills add goclone-dev/goclone
2.1K stars62 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
gocrawler
by goclone-devQuick view
118

Diskover Community

VERIFIEDEXCELLENT · 98

Diskover Community Edition - Open source file indexer, file search engine and data management and analytics powered by Elasticsearch

$ npx skills add diskoverdata/diskover-community
1.8K stars61 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
phpcrawler
by diskoverdataQuick view
119

Examples Of Web Crawlers

VERIFIEDEXCELLENT · 97

一些非常有趣的python爬虫例子,对新手比较友好,主要爬取淘宝、天猫、微信、微信读书、豆瓣、QQ等网站。(Some interesting examples of python crawlers that are friendly to beginners. )

$ npx skills add shengqiangzhang/examples-of-web-crawlers
14.6K stars61 qualityClaude Code + Browser agents
High-confidence pick with strong adoption and healthy maintenance signals.
htmlcrawler
by shengqiangzhangQuick view
120

Mianshiya

VERIFIEDEXCELLENT · 99

持续维护的企业面试题库网站,帮你拿到满意 offer!⭐️ 2026年最新Java面试题、前端面试题、AI大模型面试题、AI Agent面试题、RAG面试题、C++面试题、Go面试题、Python面试题、测试面试题、运维面试题、后端面试题、操作系统面试题、计算机网络面试题、Redis面试题、MySQL数据库面试题、算法面试题、Spring面试题、JVM面试题、Java并发面试题、Linux面试题、LLM面试题、Prompt工程面试题、系统设计面试题等1万多道高频程序员求职必备八股文。面试刷题就选面试鸭 💎 React 前端 + Node 后端 + 云开发全栈项目 by 程序员鱼皮

$ npx skills add liyupi/mianshiya
5.5K stars61 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptai-agents
by liyupiQuick view
121

Sperm

VERIFIEDEXCELLENT · 91

浏览过的精彩逆向文章汇总,值得一看

$ npx skills add darbra/sperm
1.4K stars61 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
crawler
by darbraQuick view
122

Scrape Google Python

VERIFIEDEXCELLENT · 91

In this tutorial, we showcase how to scrape public Google data with Python and Oxylabs API.

$ npx skills add oxylabs/scrape-google-python
1.3K stars60 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
web-automation
by oxylabsQuick view
123

MyGPTReader

VERIFIEDEXCELLENT · 98

A community-driven way to read and chat with AI bots - powered by chatGPT.

$ npx skills add myreader-io/myGPTReader
4.4K stars60 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by myreader-ioQuick view
124

TorBot

VERIFIEDEXCELLENT · 87

Dark Web OSINT Tool

$ npx skills add DedSecInside/TorBot
4.1K stars60 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by DedSecInsideQuick view
125

Oxylabs AI Studio Py

VERIFIEDEXCELLENT · 96

Structured data gathering from any website using AI-powered scraper, crawler, and browser automation. Scraping and crawling with natural language prompts. Equip your LLM agents with fresh data. AI Studio python SDK for intelligent web data gathering.

$ npx skills add oxylabs/oxylabs-ai-studio-py
2.9K stars59 qualityClaude Code + Browser agents
High-confidence pick with strong adoption and healthy maintenance signals.
pythonweb-automation
by oxylabsQuick view
126

How To Scrape Google Trends

VERIFIEDEXCELLENT · 90

Learn step-by-step how to scrape Google Trends data and make a result comparison using Python and Oxylabs SERP API. Extract keywords, their popularity, breakdown by region, related queries, and more.

$ npx skills add oxylabs/how-to-scrape-google-trends
2.6K stars59 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonweb-automation
by oxylabsQuick view
127

Gecco

VERIFIEDEXCELLENT · 89

Easy to use lightweight web crawler(易用的轻量化网络爬虫)

$ npx skills add xtuhcy/gecco
2.5K stars59 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javacrawler
by xtuhcyQuick view
128

Deep Searcher

VERIFIEDEXCELLENT · 92

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

$ npx skills add zilliztech/deep-searcher
7.8K stars58 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonrag
by zilliztechQuick view
129

Node Crawler

VERIFIEDEXCELLENT · 92

Web Crawler/Spider for NodeJS + server-side jQuery ;-)

$ npx skills add bda-research/node-crawler
6.8K stars58 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptcrawler
by bda-researchQuick view
130

Trafilatura

VERIFIEDEXCELLENT · 91

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

$ npx skills add adbar/trafilatura
6.0K stars57 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonweb-automation
by adbarQuick view
131

Superduper

VERIFIEDEXCELLENT · 91

Superduper: End-to-end framework for building custom AI applications and agents.

$ npx skills add superduper-io/superduper
5.3K stars57 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonrag
by superduper-ioQuick view
132

XHS Spider

VERIFIEDEXCELLENT · 93

小红书数据采集、网站图片、视频资源批量下载工具,颜值超高的数据采集工具(批量下载,视频提取,图片)Telegram:https://t.me/+ZtLSwuIKTo44MDY1

$ npx skills add xisuo67/XHS-Spider
1.4K stars57 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
crawler
by xisuo67Quick view
133

Spider Flow

VERIFIEDSTRONG · 77

新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。

$ npx skills add ssssssss-team/spider-flow
11.3K stars57 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
javacrawler
by ssssssss-teamQuick view
134

Finance Skills

EXCELLENT · 100

A collection of agent skills for financial analysis and trading. Includes options payoff charts, stock correlation analysis, yfinance data fetching, Discord/Telegram/Twitter financial research, and generative UI for interactive visualizations.

$ npx skills add himself65/finance-skills
2.5K stars56 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
claudeclaude-code
by himself65Quick view
135

Kimuraframework

VERIFIEDEXCELLENT · 92

Write web scrapers in Ruby using a clean, AI-assisted DSL. Kimurai uses AI to figure out where the data lives, then caches the selectors and scrapes with pure Ruby. Get the intelligence of an LLM without the per-request latency or token costs.

$ npx skills add vifreefly/kimuraframework
1.1K stars56 qualityClaude Code + Browser agents
High-confidence pick with strong adoption and healthy maintenance signals.
rubyweb-automation
by vifreeflyQuick view
136

Scylla

VERIFIEDEXCELLENT · 89

Intelligent proxy pool for Humans™ to extract content from the internet and build your own Large Language Models in this new AI era

$ npx skills add MikeChongCan/scylla
4.0K stars56 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by MikeChongCanQuick view
137

Mdcx

VERIFIEDSTRONG · 83

Movie metadata scraper

$ npx skills add sqzw-x/mdcx
3.6K stars56 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.
pythoncrawler
by sqzw-xQuick view
138

The process of extracting product data from Amazon using Python, including titles, ratings, prices, images, and descriptions.

$ npx skills add oxylabs/how-to-scrape-amazon-product-data
2.9K stars55 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.
web-automation
by oxylabsQuick view
139

Avbook

VERIFIEDSTRONG · 75

AV 电影管理系统, avmoo , javbus , javlibrary 爬虫,线上 AV 影片图书馆,AV 磁力链接数据库,Japanese Adult Video Library,Adult Video Magnet Links - Japanese Adult Video Database

$ npx skills add guyueyingmu/avbook
10.0K stars55 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
phpcrawler
by guyueyingmuQuick view
140

Scrapfly Scrapers

STRONG · 82

Scalable Python web scraping scripts for +40 popular domains

$ npx skills add scrapfly/scrapfly-scrapers
983 stars55 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.
pythonweb-automation
by scrapflyQuick view
141

Openagent

STRONG · 81

AI Agent Development Platform - Supports multiple models (OpenAI/DeepSeek/Wenxin/Tongyi), knowledge base management, workflow automation, and enterprise-grade security. Built with Flask + Vue3 + LangChain, featuring one-click Docker deployment.

$ npx skills add Haohao-end/openagent
789 stars54 qualityClaude Code + OpenAI Agents
Solid option that is likely worth shortlisting for production workflows.
pythonllmops
by Haohao-endQuick view
142

Free Proxy List

EXCELLENT · 86

Free Proxy List ✅​🚀 HTTP, HTTPS, SOCKS4 & SOCKS5 | Updated every 5 minutes | Strict SSL, zero MITM, multi-country

$ npx skills add databay-labs/free-proxy-list
755 stars54 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
web-automation
by databay-labsQuick view
143

Awesome Crawler

VERIFIEDSTRONG · 79

A collection of awesome web crawler,spider in different languages

$ npx skills add BruceDone/awesome-crawler
7.2K stars54 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
crawler
by BruceDoneQuick view
144

Swiftide

EXCELLENT · 86

Fast, streaming indexing, query, and agentic LLM applications in Rust

$ npx skills add bosun-ai/swiftide
700 stars54 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
rustllmops
by bosun-aiQuick view
145

Powermem

STRONG · 81

PowerMem: Your AI-Powered Long-Term Memory — Accurate, Agile, Affordable. Also friendly support for the OpenClaw Memory Plugin.

$ npx skills add oceanbase/powermem
674 stars54 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.
pythonai-agents
by oceanbaseQuick view
146

Dac

EXCELLENT · 86

DaC is a dashboard-as-code tool. Build interactive dashboards using YAML and JSX. Built-in semantic layer. Get your agents to build standardized, reviewable dashboards.

$ npx skills add bruin-data/dac
673 stars54 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
goai-agents
by bruin-dataQuick view
147

Rags

VERIFIEDSTRONG · 79

Build ChatGPT over your data, all with natural language

$ npx skills add run-llama/rags
6.5K stars53 qualityClaude Code + OpenAI Agents
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
pythonrag
by run-llamaQuick view
148

Self Host N8n On Gcr

EXCELLENT · 85

Self-host n8n on Google Cloud without the subscription fees or server headaches - because your automation workflows shouldn't cost more than your coffee budget

$ npx skills add datawranglerai/self-host-n8n-on-gcr
605 stars53 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
hclai-agents
by datawrangleraiQuick view
149

Grab Site

VERIFIEDSTRONG · 80

The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns

$ npx skills add ArchiveTeam/grab-site
1.6K stars53 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.
pythoncrawler
by ArchiveTeamQuick view
150

Headless Chrome Crawler

VERIFIEDSTRONG · 72

Distributed crawler powered by Headless Chrome

$ npx skills add yujiosaka/headless-chrome-crawler
5.6K stars53 qualityClaude Code + Browser agents
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
javascriptcrawler
by yujiosakaQuick view
151

Haipproxy

VERIFIEDSTRONG · 78

:sparkling_heart: High available distributed ip proxy pool, powerd by Scrapy and Redis

$ npx skills add SpiderClub/haipproxy
5.5K stars53 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
pythoncrawler
by SpiderClubQuick view
152

ECommerceCrawlers

VERIFIEDSTRONG · 78

实战🐍多种网站、电商数据爬虫🕷。包含🕸:淘宝商品、微信公众号、大众点评、企查查、招聘网站、闲鱼、阿里任务、博客园、微博、百度贴吧、豆瓣电影、包图网、全景网、豆瓣音乐、某省药监局、搜狐新闻、机器学习文本采集、fofa资产采集、汽车之家、国家统计局、百度关键词收录数、蜘蛛泛目录、今日头条、豆瓣影评、携程、小米应用商店、安居客、途家民宿❤️❤️❤️。微信爬虫展示项目:

$ npx skills add DropsDevopsOrg/ECommerceCrawlers
5.5K stars53 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
pythoncrawler
by DropsDevopsOrgQuick view
153

Jekyll

STRONG · 80

Jekyll-based static site for The Programming Historian

$ npx skills add programminghistorian/jekyll
544 stars53 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.
htmlweb-automation
by programminghistorianQuick view
154

Second Brain

STRONG · 80

Second Brain is an agentic framework that acts as an operating system, using local file intelligence, workflow automation, and LLMs to complete tasks and communicate over multiple modalities and messaging platforms.

$ npx skills add henrydaum/second-brain
532 stars53 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.
pythonai-agents
by henrydaumQuick view
155

Reader

STRONG · 84

Open source web infrastructure for AI. Scrape, crawl, and automate the web, clean markdown, browser sessions, ready for your agents.

$ npx skills add vakra-dev/reader
529 stars53 qualityClaude Code + Browser agents
Solid option that is likely worth shortlisting for production workflows.
typescriptai-agents
by vakra-devQuick view
156

🌟 A curated collection of free, high quality AI tools 🤖, APIs 🔗, datasets 📊, and learning resources 📚 covering machine learning 🧠, deep learning 🧩, generative AI 🎨, NLP 💬, and data science 📈. Designed to help developers 👩‍💻, researchers 🔬, and creators ✨ explore and build with AI faster ⚡.

$ npx skills add CelaDaniel/free-ai-resources-x
523 stars53 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.
ai-agents
by CelaDanielQuick view
157

ProxyBroker

VERIFIEDSTRONG · 77

Proxy [Finder | Checker | Server]. HTTP(S) & SOCKS :performing_arts:

$ npx skills add constverum/ProxyBroker
4.2K stars52 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
pythoncrawler
by constverumQuick view
158

Proxypool

VERIFIEDSTRONG · 76

Automatically crawls proxy nodes on the public internet, de-duplicates and tests for usability and then provides a list of nodes

$ npx skills add zu1k/proxypool
4.0K stars52 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
gocrawler
by zu1kQuick view
159

How To Scrape Google Finance

VERIFIEDSTRONG · 78

Use Web Scraper API to extract data from Google Finance, including stock titles, pricing, and price changes in percentages.

$ npx skills add oxylabs/how-to-scrape-google-finance
1.0K stars52 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.
pythonweb-automation
by oxylabsQuick view
160

RED HAWK

VERIFIEDSTRONG · 76

All in one tool for Information Gathering, Vulnerability Scanning and Crawling. A must have tool for all penetration testers

$ npx skills add Tuhinshubhra/RED_HAWK
3.7K stars52 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
phpcrawler
by TuhinshubhraQuick view
161

Python3 Spider

VERIFIEDSTRONG · 71

Python爬虫实战 - 模拟登陆各大网站 包含但不限于:滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝,如果喜欢请start ❤️

$ npx skills add wkunzhi/Python3-Spider
3.4K stars51 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
pythoncrawler
by wkunzhiQuick view
162

Crawlergo

VERIFIEDSTRONG · 75

A powerful browser crawler for web vulnerability scanners

$ npx skills add Qianlitp/crawlergo
3.0K stars51 qualityClaude Code + Browser agents
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
gocrawler
by QianlitpQuick view
163

Gospider

VERIFIEDPROMISING · 69

Gospider - Fast web spider written in Go

$ npx skills add jaeles-project/gospider
3.0K stars51 qualityClaude Code
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
gocrawler
by jaeles-projectQuick view
164

DecryptLogin

VERIFIEDSTRONG · 75

DecryptLogin: APIs for loginning some websites by using requests.

$ npx skills add CharlesPikachu/DecryptLogin
2.9K stars51 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
pythoncrawler
by CharlesPikachuQuick view
165

Owllook

VERIFIEDPROMISING · 69

owllook-小说搜索引擎

$ npx skills add howie6879/owllook
2.8K stars51 qualityClaude Code
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
pythoncrawler
by howie6879Quick view
166

GoogleScraper

VERIFIEDSTRONG · 75

A Python module to scrape several search engines (like Google, Yandex, Bing, Duckduckgo, ...). Including asynchronous networking support.

$ npx skills add NikolaiT/GoogleScraper
2.8K stars51 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
htmlcrawler
by NikolaiTQuick view
167

Geziyor

VERIFIEDSTRONG · 75

Geziyor, blazing fast web crawling & scraping framework for Go. Supports JS rendering.

$ npx skills add geziyor/geziyor
2.8K stars51 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
gocrawler
by geziyorQuick view
168

Leaked GPTs

VERIFIEDPROMISING · 69

Leaked GPTs Prompts Bypass the 25 message limit or to try out GPTs without a Plus subscription.

$ npx skills add friuns2/Leaked-GPTs
2.4K stars50 qualityClaude Code + OpenAI Agents
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
pythoncrawler
by friuns2Quick view
169

Abot

VERIFIEDSTRONG · 74

Cross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.

$ npx skills add sjdirect/abot
2.3K stars50 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
c#crawler
by sjdirectQuick view
170

LLMStack

VERIFIEDPROMISING · 69

No-code multi-agent framework to build LLM Agents, workflows and applications with your data

$ npx skills add trypromptly/LLMStack
2.3K stars50 qualityClaude Code
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
pythonllm
by trypromptlyQuick view
171

Vulnx

VERIFIEDSTRONG · 74

vulnx 🕷️ an intelligent Bot, Shell can achieve automatic injection, and help researchers detect security vulnerabilities CMS system. It can perform a quick CMS security detection, information collection (including sub-domain name, ip address, country information, organizational information and time zone, etc.) and vulnerability scanning.

$ npx skills add anouarbensaad/vulnx
2.1K stars50 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
pythoncrawler
by anouarbensaadQuick view
172

Fetch user's data across social media

$ npx skills add shaikhsajid1111/social-media-profile-scrapers
553 stars50 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.
pythonweb-automation
by shaikhsajid1111Quick view
173

Gocrawl

VERIFIEDPROMISING · 67

Polite, slim and concurrent web crawler.

$ npx skills add PuerkitoBio/gocrawl
2.1K stars50 qualityClaude Code
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
gocrawler
by PuerkitoBioQuick view
174

Dirhunt

VERIFIEDPROMISING · 67

Find web directories without bruteforce

$ npx skills add Nekmo/dirhunt
2.0K stars50 qualityClaude Code
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
pythoncrawler
by NekmoQuick view
175

LxSpider

VERIFIEDSTRONG · 73

爬虫案例合集。包括但不限于《淘宝、京东、天猫、豆瓣、抖音、快手、微博、微信、阿里、头条、pdd、优酷、爱奇艺、携程、12306、58、搜狐、各种指数、维普万方、Zlibraty、Oalib、小说、招标网、采购网、小红书、大众点评、推特、脉脉、知乎》

$ npx skills add lixi5338619/lxSpider
1.9K stars50 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
pythoncrawler
by lixi5338619Quick view
176

Ast Hook For Js RE

VERIFIEDPROMISING · 62

浏览器内存漫游解决方案(探索中...)

$ npx skills add JSREI/ast-hook-for-js-RE
1.9K stars50 qualityClaude Code
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
javascriptcrawler
by JSREIQuick view
177

BT Btt

VERIFIEDPROMISING · 67

磁力網站U3C3介紹以及域名更新

$ npx skills add u3c3/BT-btt
1.8K stars50 qualityClaude Code
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
crawler
by u3c3Quick view
178

PSpider

VERIFIEDPROMISING · 67

简单易用的Python爬虫框架,QQ交流群:597510560

$ npx skills add xianhu/PSpider
1.8K stars50 qualityClaude Code
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
pythoncrawler
by xianhuQuick view
179

Go Spider

VERIFIEDSTRONG · 73

[爬虫框架 (golang)] An awesome Go concurrent Crawler(spider) framework. The crawler is flexible and modular. It can be expanded to an Individualized crawler easily or you can use the default crawl components only.

$ npx skills add hu17889/go_spider
1.8K stars50 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
gocrawler
by hu17889Quick view
180

Ruia

VERIFIEDSTRONG · 73

Async Python 3.6+ web scraping micro-framework based on asyncio

$ npx skills add howie6879/ruia
1.7K stars49 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
pythoncrawler
by howie6879Quick view
181

AutoCrawler

VERIFIEDSTRONG · 73

Google, Naver multiprocess image web crawler (Selenium)

$ npx skills add YoongiKim/AutoCrawler
1.7K stars49 qualityClaude Code + Browser agents
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
pythoncrawler
by YoongiKimQuick view
182

DAT8

VERIFIEDPROMISING · 67

General Assembly's 2015 Data Science course in Washington, DC

$ npx skills add justmarkham/DAT8
1.6K stars49 qualityClaude Code
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
jupyter-notebookweb-automation
by justmarkhamQuick view
183

Spider Collection

VERIFIEDSTRONG · 72

python爬虫,目前库存:网易云音乐歌曲爬取,B站视频爬取,知乎问答爬取,壁纸爬取,xvideos视频爬取,有声书爬取,微博爬虫,安居客信息爬取+数据可视化,哔哩哔哩视频封面提取器,ip代理池封装,知乎百万级用户爬虫+数据分析,github用户爬虫

$ npx skills add srx-2000/spider_collection
1.6K stars49 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.Check: Repository looks stale
pythoncrawler
by srx-2000Quick view
184

Mlscraper

VERIFIEDPROMISING · 67

🤖 Scrape data from HTML websites automatically by just providing examples

$ npx skills add lorey/mlscraper
1.4K stars49 qualityClaude Code
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
pythoncrawler
by loreyQuick view
185

Rebrowser Patches

VERIFIEDPROMISING · 67

Collection of patches for puppeteer and playwright to avoid automation detection and leaks. Helps to avoid Cloudflare and DataDome CAPTCHA pages. Easy to patch/unpatch, can be enabled/disabled on demand.

$ npx skills add rebrowser/rebrowser-patches
1.4K stars49 qualityClaude Code + Browser agents
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
javascriptweb-automation
by rebrowserQuick view
186

Scrapecraft

STRONG · 75

🤖 AI-powered web scraping editor with visual workflow builder. Build, test & deploy web scrapers using natural language. Powered by ScrapeGraphAI & LangGraph.

$ npx skills add ScrapeGraphAI/scrapecraft
641 stars46 qualityClaude Code
Solid option that is likely worth shortlisting for production workflows.
pythonweb-automation
by ScrapeGraphAIQuick view
187

Web Scraping

PROMISING · 56

Detailed web scraping tutorials for dummies with financial data crawlers on Reddit WallStreetBets, CME (both options and futures), US Treasury, CFTC, LME, MacroTrends, SHFE and alternative data crawlers on Tomtom, BBC, Wall Street Journal, Al Jazeera, Reuters, Financial Times, Bloomberg, CNN, Fortune, The Economist

$ npx skills add je-suis-tm/web-scraping
880 stars39 qualityClaude Code
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
pythonweb-automation
by je-suis-tmQuick view
188

NeumAI

PROMISING · 56

Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.

$ npx skills add NeumTry/NeumAI
866 stars39 qualityClaude Code + OpenAI Agents
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
pythonllmops
by NeumTryQuick view
189

Till

PROMISING · 55

DataHen Till is a companion tool to your existing web scraper that instantly makes it scalable, maintainable, and more unblockable, with minimal code changes on your scraper. Integrates with any scraper in 5 minutes.

$ npx skills add DataHenHQ/till
815 stars39 qualityClaude Code
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
goweb-automation
by DataHenHQQuick view
190

Uscrapper

PROMISING · 55

Uscrapper Vanta: Dive deeper into the web with this powerful open-source tool. Extract valuable insights with ease and efficiency, from both surface and deep web sources. Empower your data mining and analysis with Vanta's advanced capabilities. Fast, reliable, and user-friendly, Uscrapper Vanta is the ultimate choice for researchers and analysts.

$ npx skills add z0m31en7/Uscrapper
778 stars39 qualityClaude Code
Useful candidate, but compare it with alternatives before adopting.Check: Repository looks stale
pythonweb-automation
by z0m31en7Quick view
191

Complete-Life-Cycle-of-a-Data-Science-Project

$ npx skills add achuthasubhash/Complete-Life-Cycle-of-a-Data-Science-Project
641 stars38 qualityClaude Code
Inspect the repository carefully before adding it to an agent workflow.Check: Repository looks stale
web-automation
by achuthasubhashQuick view
192

Scrape Linkedin Selenium

NEEDS REVIEW · 53

`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.

$ npx skills add austinoboyle/scrape-linkedin-selenium
529 stars38 qualityClaude Code + Browser agents
Inspect the repository carefully before adding it to an agent workflow.Check: Repository looks stale
htmlweb-automation
by austinoboyleQuick view
193

Continuous Eval

NEEDS REVIEW · 53

Data-Driven Evaluation for LLM-Powered Applications

$ npx skills add relari-ai/continuous-eval
516 stars38 qualityClaude Code
Inspect the repository carefully before adding it to an agent workflow.Check: Repository looks stale
pythonllmops
by relari-aiQuick view
194

Query crypto newsflashes, articles, and on-chain market data via BlockBeats Pro API. Covers 1,500+ information sources including AI-driven insights, Hyperliquid on-chain data, Polymarket analytics. Features market overview, capital flow analysis, macro environment assessment, derivatives analysis, and keyword search.

$ npx skills add https://clawhub.ai/BlockBeatsOfficial/blockbeats-skill
1 stars29 quality99 installs
High-confidence pick with strong adoption and healthy maintenance signals.Check: Low GitHub adoption signal
curlclawdis
by BlockBeatsOfficialQuick view