A collection of awesome web crawler,spider in different languages
$ npx skills add BruceDone/awesome-crawlerAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
A collection of awesome web crawler,spider in different languages
$ npx skills add BruceDone/awesome-crawlerWrite web scrapers in Ruby using a clean, AI-assisted DSL. Kimurai uses AI to figure out where the data lives, then caches the selectors and scrapes with pure Ruby. Get the intelligence of an LLM without the per-request latency or token costs.
$ npx skills add vifreefly/kimuraframeworkElegant Scraper and Crawler Framework for Golang
$ npx skills add gocolly/collyLightweight Ruby web crawler/scraper with an elegant DSL which extracts structured data from pages.
$ npx skills add felipecsl/wombatEvery web site provides APIs.
$ npx skills add elliotgao2/toapi:spider: The progressive PHP crawler framework! 优雅的渐进式PHP采集框架。
$ npx skills add jae-jae/QueryList新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。
$ npx skills add ssssssss-team/spider-flow👾 Fast and simple video download library and CLI tool written in Go
$ npx skills add iawia002/luxPython ProxyPool for web spider
$ npx skills add jhao104/proxy_poolnewspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:
$ npx skills add codelucas/newspaperDistributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
$ npx skills add crawlab-team/crawlab一些非常有趣的python爬虫例子,对新手比较友好,主要爬取淘宝、天猫、微信、微信读书、豆瓣、QQ等网站。(Some interesting examples of python crawlers that are friendly to beginners. )
$ npx skills add shengqiangzhang/examples-of-web-crawlersCrawly, a high-level web crawling & scraping framework for Elixir.
$ npx skills add elixir-crawly/crawlyGeziyor, blazing fast web crawling & scraping framework for Go. Supports JS rendering.
$ npx skills add geziyor/geziyorWeb Crawler/Spider for NodeJS + server-side jQuery ;-)
$ npx skills add bda-research/node-crawlerA lightweight web crawler framework.(Java爬虫框架)
$ npx skills add xuxueli/xxl-crawlerHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Spidr if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.