CrawlProject
python爬虫项目合集,从基础到js逆向,包含基础篇、自动化篇、进阶篇以及验证码篇。案例涵盖各大网站(xhs douyin weibo ins boss job,jd...),你将会学到有关爬虫以及反爬虫、自动化和验证码的各方面知识
Supply asset profile
Coding and developer agents
Code review, repo analysis, testing, CI, GitHub, DevOps, and developer workflow skills.
Scenario
Testing and QA
I need my agent to test a web app, reproduce bugs, and verify fixes.
Agent fit
Claude Code + Browser agents + CLI
Codex, Claude Code, Cursor, CLI, or custom agents.
Install
Ready
npx skills add xishandong/crawlProject
Maintenance
stale
2y since push
Risk
Needs review
License is unclear
GitHub quality
1.7K
68/100 quality · 79/100 trust
Coverage tags
Review notes
License is unclear · Repository appears stale
Agent adoption scorecard
Trust, audit, and install readiness at a glance
These scores combine public repository metadata, OpenAgentSkill review signals, maintenance freshness, and install readiness. They are a shortlist signal, not a replacement for human review.
Quality
PromisingUseful candidate, but compare it with alternatives before adopting.
Trust
Strong shortlistGood trust signals with a few areas worth checking before rollout.
Audit
Needs reviewInstall readiness, security metadata, maintenance, and adoption risk.
Agent safety v2
57/100 · Review before install
Sparse or mixed signals. Useful for discovery, but not for autonomous installation.
Test manually in an isolated workspace and compare against safer alternatives.
medium
Browser automation
Skill may drive a browser or interact with web pages.
medium
Network access
Skill likely fetches remote pages, APIs, repositories, or external services.
- License is unclear
Install targets
Install this skill in your agent workflow
Copy the registry command or an agent-specific install prompt for Codex, Claude Code, and Cursor.
OpenAgentSkill CLI
Use the registry command when your workflow supports the OpenAgentSkill installer.
$ npx skills add xishandong/crawlProjectAgent resolve plan
Let an agent verify fit before installing.
The Resolve API returns the selected skill, alternatives, safety policy, audit notes, install target, and copy-paste prompt an agent can follow without scraping this page.
Resolve JSON
/api/agent/resolve?task=Use%20CrawlProject%20for%20an%20agent%20workflow&agent=codex&max_risk=medium
Resolve text
/api/agent/resolve?task=Use%20CrawlProject%20for%20an%20agent%20workflow&agent=codex&max_risk=medium&format=text
Install handoff
/api/skills/xishandong-crawlproject/install
Agent should check
- Task fit and alternatives from Resolve API.
- Audit score, trust score, and safety policy warnings.
- Install target compatibility for Codex, Claude Code, Cursor, or CLI.
Copy prompt
Task: Use CrawlProject in this workspace.
Resolve first: https://www.openagentskill.com/api/agent/resolve?task=Use%20CrawlProject%20for%20an%20agent%20workflow&agent=codex&max_risk=medium
Review install handoff: https://www.openagentskill.com/api/skills/xishandong-crawlproject/install
Install command: npx skills add xishandong/crawlProject
Before running it, summarize audit warnings, required permissions, and the fallback skill if install is risky.Agent handoff
Give an agent the install path, not another directory page.
Use the public install endpoint to fetch the command, safety checklist, target prompts, and canonical links for this skill.
Install handoff
/api/skills/xishandong-crawlproject/install
LLM text format
/api/skills/xishandong-crawlproject/install?format=text
Find alternatives
/api/skills/search?q=CrawlProject&limit=3
Agent prompt
Use CrawlProject for this task. Review https://www.openagentskill.com/api/skills/xishandong-crawlproject/install, then install with: npx skills add xishandong/crawlProjectRegistry metadata
Agent-readable profile for automatic skill selection.
This page exposes the same decision, trust, audit, use-case, and install signals through the Registry API, so agents can rank this skill without scraping the UI.
Manifest
/api/registry/manifest/xishandong-crawlproject
LLM text
/api/registry/manifest/xishandong-crawlproject?format=text
Install alias
/api/registry/install/xishandong-crawlproject
Recommend
/api/registry/recommend?task=Use%20CrawlProject%20in%20an%20agent%20workflow&limit=3
Agent fit
Web scraping
Use-case tags
Platforms
JavaScript, Playwright, Claude Code, Browser agents
Audit report
Needs review · 73/100
Review install readiness, maintenance, trust, quality, and metadata warnings before adding this skill to an agent workflow.
Agent decision cockpit
Fallback candidate for Web scraping
Prototype with this skill first; keep a fallback candidate ready.
Role in stack
Fallback candidate
Primary fit
Web scraping
Trust label
Prototype first
Install path
Command ready
Use when
- Web scraping workflows
- Claude Code teams
- teams that value GitHub adoption signals
Evidence
- 1,731 GitHub stars
- install command or GitHub repo available
- 68/100 quality profile
- 1 OpenAgentSkill engagement events
Review first
- Repository looks stale
Implementation path
- 1Install it in a sandbox agent and run one Web scraping task end to end.
- 2Compare output quality, latency, and failure behavior against at least one alternative.
- 3Promote it into production only after reviewing repository permissions, license, and maintenance signals.
Trust profile
Strong shortlist
Good trust signals with a few areas worth checking before rollout.
GitHub adoption
PASS1.7K GitHub stars
Recent maintenance
FIX2y since push
License clarity
CHECKUnknown
README/SKILL.md completeness
PASSMetadata includes enough usage and workflow context
Good signals
- Manually verified listing
- AI review approved
- Install path is available
- Repository evidence is available
- Meaningful GitHub adoption signal
Review before install
- License is unclear
- Repository looks stale
- Quality score needs review
- Recent maintenance: 2y since push
- License clarity: Unknown
Recommended action
Test in a sandbox workflow and compare its install path with close alternatives.
Quality profile
Promising candidate for agent workflows
Useful candidate, but compare it with alternatives before adopting.
Workflow fit
Use this skill in these scenarios
Collect structured data
Web scraping
I need my agent to scrape websites and extract structured data from pages.
Operate web apps
Browser automation
I need my agent to control a browser, fill forms, and verify web app workflows.
Verify behavior
Testing and QA
I need my agent to test a web app, reproduce bugs, and verify fixes.
Stack fit
Add it to a complete workflow
Operate and verify web apps
Browser QA agent
A stack for agents that navigate products, fill forms, take screenshots, and verify real user flows across web applications.
Scrape, clean, and reuse web data
Web data pipeline
A practical stack for agents that crawl public pages, extract clean content, normalize data, and hand it to downstream research or RAG workflows.
Inspect, patch, and verify code
Coding review agent
A stack for software agents that inspect repositories, review pull requests, generate tests, and turn findings into shippable patches.
Alternative shortlist
Compare before you install
Similar skills in this category, ranked with the same readiness and quality signals.
Playwright
Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.
Crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
Playwright Python
Python version of the Playwright testing and automation library.
FlareSolverr
Proxy server to bypass Cloudflare protection
Overview
python爬虫项目合集,从基础到js逆向,包含基础篇、自动化篇、进阶篇以及验证码篇。案例涵盖各大网站(xhs douyin weibo ins boss job,jd...),你将会学到有关爬虫以及反爬虫、自动化和验证码的各方面知识
Imported by the skill-only GitHub discovery pipeline because it matches agent skill, automation, RAG, or developer-tool signals. Protocol-server projects are excluded from automated imports.
Platform Compatibility
Technical Details
- Version
- 1.0.0
- License
- Unknown
- Last Updated
- 6/14/2026
- Published
- 6/5/2026
Frameworks & Tools
Decision snapshot
Fallback candidate
1,731 GitHub stars
Audit Snapshot
Install and adoption review
- Security
- 88/100
- Maintenance
- 38/100
- Install
- 92/100
Growth loop
Share this skill
Scenario-led draft for CrawlProject, with the OpenAgentSkill Update theme and canonical URL.
OpenAgentSkill Update Today: CrawlProject Use it when you need an agent to browse, extract, or monitor web pages without building a scraper fro... 1.7K stars - browser-automation Link: https://www.openagentskill.com/skills/xishandong-crawlproject?ref=x #AIAgents #OpenAgentSkill
Optional reply with install command
Link for CrawlProject: https://www.openagentskill.com/skills/xishandong-crawlproject?ref=x Install: npx skills add xishandong/crawlProject
Listing source
Community indexed
This listing was indexed from public sources and is not marked official until a maintainer claim is approved.
- Creator
- xishandong
- Source
- xishandong/crawlProject
- Indexed by
- OpenAgentSkill community index
Attribution links to the public repository or creator profile. Creators can claim the listing to update ownership signals.
Claim this skillOwner claim
Claim this skill listing
This community indexed listing is attributed to xishandong but is not marked official yet. Claim it to add a verified owner signal and make future launch, install, and audit updates easier to trust.
README badge
Add this badge to your GitHub README to show the listing, trust score, and install handoff.
[](https://www.openagentskill.com/skills/xishandong-crawlproject)Author
xishandong✓
@xishandong
Tags
Platform Fit
Health Signals
- GitHub stars
- 1.7K
- Quality score
- 49/100
- Last GitHub push
- Sep 23, 2024
- Framework hints
- 2
- OpenAgentSkill views
- 1
- Install copies
- 0
- Outbound clicks
- 0
Community Signal
Share whether this skill looks useful for your agent workflow. Aggregated feedback improves rankings over time.
Trust & Safety
Strong shortlist
- GitHub adoption1.7K GitHub starsPASS
- Recent maintenance2y since pushFIX
- License clarityUnknownCHECK
- README/SKILL.md completenessMetadata includes enough usage and workflow contextPASS
- Dependency risknetwork or browser surfacePASS
- Install availabilitynpx skills add xishandong/crawlProjectPASS
Related Skills
Playwright
Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.
91.0K stars · 0 installsCrawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
23.8K stars · 0 installsPlaywright Python
Python version of the Playwright testing and automation library.
14.7K stars · 0 installsFlareSolverr
Proxy server to bypass Cloudflare protection
14.3K stars · 0 installs