Wayback Machine Scraper
A command-line utility and Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.
Supply asset profile
Coding and developer agents
Code review, repo analysis, testing, CI, GitHub, DevOps, and developer workflow skills.
Scenario
Coding agents
I need a coding agent that can understand a repository, edit code, and review pull requests.
Agent fit
Claude Code + Browser agents + CLI
Codex, Claude Code, Cursor, CLI, or custom agents.
Install
Ready
npx skills add sangaline/wayback-machine-scraper
Maintenance
stale
2y since push
Risk
Needs review
Permission surface may require sandboxing
GitHub quality
480
53/100 quality · 70/100 trust
Coverage tags
Review notes
Permission surface may require sandboxing · Repository appears stale
Agent adoption scorecard
Trust, audit, and install readiness at a glance
These scores combine public repository metadata, OpenAgentSkill review signals, maintenance freshness, and install readiness. They are a shortlist signal, not a replacement for human review.
Quality
Needs reviewInspect the repository carefully before adding it to an agent workflow.
Trust
Sandbox onlyUseful candidate with missing or mixed trust signals. Keep it in an isolated workspace until the outcome loop proves task fit.
Audit
Needs reviewInstall readiness, security metadata, maintenance, and adoption risk.
Trust Score v5
Human review before install
Run only in a sandbox and compare close alternatives before using it for real work.
Stars
480 GitHub stars
Repo activity
480 stars, 81 forks
Maintenance
2y since push
License
ISC
Install
npx skills add sangaline/wayback-machine-scraper
Install safety
standard package or runtime install path
Permission surface
shell or command execution, filesystem or document access
Agent outcomes
No agent outcome data yet
Docs
Strong README/SKILL.md context
Risk summary
Review before production
- Repository looks stale
- Quality score needs review
- Permission surface needs review: shell or command execution, filesystem or document access
- Recent maintenance: 2y since push
Install readiness
Install path available
- Install path is available
- Repository evidence is available
- License is declared
- No Agent Proven outcome evidence yet
Agent-readable metadata
Machine-readable decision data for this skill.
Use this block or the embedded JSON to decide whether an agent should install this skill, choose an alternative, or ask for human review first.
Suited tasks
- Web scraping workflows
- Claude Code teams
- builders willing to evaluate younger projects
- Crawl target URLs
Suited agents
Install decision
- Command
- npx skills add sangaline/wayback-machine-scraper
- Policy
- review
- Human review
- yes
Trust and risk
- Trust
- 62/100
- Audit
- 63/100
- Risk level
- Needs review
Outcome loop
- Endpoint
- /api/agent/outcome
- Event ID
- resolve
- Outcomes
- 5
Install command
npx skills add sangaline/wayback-machine-scraperDo not use when
- teams that require actively maintained dependencies
- production agents without a repository review
- Repository looks stale
- High-risk permission hints: Shell or command execution
- Permission surface may require sandboxing
Agent safety v2
31/100 · Avoid automatic install
Sparse or mixed signals. Useful for discovery, but not for autonomous installation.
Test manually in an isolated workspace and compare against safer alternatives.
high
Shell or command execution
Skill metadata references terminal, CLI, shell, subprocess, or command execution workflows.
medium
Browser automation
Skill may drive a browser or interact with web pages.
medium
Network access
Skill likely fetches remote pages, APIs, repositories, or external services.
medium
Filesystem access
Skill may read or write project files, documents, generated artifacts, or local workspace state.
- High-risk permission hints: Shell or command execution
- Permission surface may require sandboxing
Install targets
Install this skill in your agent workflow
Copy the registry command or an agent-specific install prompt for Codex, Claude Code, and Cursor.
OpenAgentSkill CLI
Use the registry command when your workflow supports the OpenAgentSkill installer.
$ npx skills add sangaline/wayback-machine-scraperAgent resolve plan
Let an agent verify fit before installing.
The Resolve API returns the selected skill, alternatives, safety policy, audit notes, install target, and copy-paste prompt an agent can follow without scraping this page.
Resolve JSON
/api/agent/resolve?task=Use%20Wayback%20Machine%20Scraper%20for%20an%20agent%20workflow&agent=codex&max_risk=medium
Resolve text
/api/agent/resolve?task=Use%20Wayback%20Machine%20Scraper%20for%20an%20agent%20workflow&agent=codex&max_risk=medium&format=text
Install handoff
/api/skills/sangaline-wayback-machine-scraper/install
Agent should check
- Task fit and alternatives from Resolve API.
- Audit score, trust score, and safety policy warnings.
- Install target compatibility for Codex, Claude Code, Cursor, or CLI.
Copy prompt
Task: Use Wayback Machine Scraper in this workspace.
Resolve first: https://www.openagentskill.com/api/agent/resolve?task=Use%20Wayback%20Machine%20Scraper%20for%20an%20agent%20workflow&agent=codex&max_risk=medium
Review install handoff: https://www.openagentskill.com/api/skills/sangaline-wayback-machine-scraper/install
Install command: npx skills add sangaline/wayback-machine-scraper
Before running it, summarize audit warnings, required permissions, and the fallback skill if install is risky.Agent handoff
Give an agent the install path, not another directory page.
Use the public install endpoint to fetch the command, safety checklist, target prompts, and canonical links for this skill.
Install handoff
/api/skills/sangaline-wayback-machine-scraper/install
LLM text format
/api/skills/sangaline-wayback-machine-scraper/install?format=text
Find alternatives
/api/skills/search?q=Wayback%20Machine%20Scraper&limit=3
Agent prompt
Use Wayback Machine Scraper for this task. Review https://www.openagentskill.com/api/skills/sangaline-wayback-machine-scraper/install, then install with: npx skills add sangaline/wayback-machine-scraperRegistry metadata
Agent-readable profile for automatic skill selection.
This page exposes the same decision, trust, audit, use-case, and install signals through the Registry API, so agents can rank this skill without scraping the UI.
Manifest
/api/registry/manifest/sangaline-wayback-machine-scraper
LLM text
/api/registry/manifest/sangaline-wayback-machine-scraper?format=text
Install alias
/api/registry/install/sangaline-wayback-machine-scraper
Recommend
/api/registry/recommend?task=Use%20Wayback%20Machine%20Scraper%20in%20an%20agent%20workflow&limit=3
Agent fit
Web scraping
Use-case tags
Platforms
Python, Web Automation, Claude Code, Browser agents
Audit report
Needs review · 63/100
Review install readiness, maintenance, trust, quality, and metadata warnings before adding this skill to an agent workflow.
Agent decision cockpit
Needs validation for Web scraping
Do a manual repository review before adding this to an agent workflow.
Role in stack
Needs validation
Primary fit
Web scraping
Trust label
Needs manual review
Install path
Command ready
Use when
- Web scraping workflows
- Claude Code teams
- builders willing to evaluate younger projects
Evidence
- install command or GitHub repo available
- 53/100 quality profile
- 5 OpenAgentSkill engagement events
Review first
- Repository looks stale
Implementation path
- 1Install it in a sandbox agent and run one Web scraping task end to end.
- 2Compare output quality, latency, and failure behavior against at least one alternative.
- 3Promote it into production only after reviewing repository permissions, license, and maintenance signals.
Trust profile
Sandbox only
Useful candidate with missing or mixed trust signals. Keep it in an isolated workspace until the outcome loop proves task fit.
GitHub adoption
INFO480 GitHub stars
Stars/forks activity
INFO480 stars, 81 forks; issue activity unavailable in current metadata
Recent maintenance
FIX2y since push
License clarity
PASSISC
Good signals
- AI review approved
- Install path is available
- Repository evidence is available
- Install command has no obvious high-risk pattern
- Outcome loop is ready but needs first real agent run
Review before install
- Repository looks stale
- Quality score needs review
- Permission surface needs review: shell or command execution, filesystem or document access
- Recent maintenance: 2y since push
- Permission surface: shell or command execution, filesystem or document access
- No real agent outcome reports yet
- Human review required before unattended installation
Recommended action
Run only in a sandbox and compare close alternatives before using it for real work.
Quality profile
Needs review candidate for agent workflows
Inspect the repository carefully before adding it to an agent workflow.
Workflow fit
Use this skill in these scenarios
Collect structured data
Web scraping
I need my agent to scrape websites and extract structured data from pages.
Operate web apps
Browser automation
I need my agent to control a browser, fill forms, and verify web app workflows.
Build and ship code
Coding agents
I need a coding agent that can understand a repository, edit code, and review pull requests.
Stack fit
Add it to a complete workflow
Scrape, clean, and reuse web data
Web data pipeline
A practical stack for agents that crawl public pages, extract clean content, normalize data, and hand it to downstream research or RAG workflows.
Turn skills into distribution
Content growth agent
A stack for turning newly indexed skills into SEO briefs, social drafts, comparison pages, and reusable publishing workflows.
Operate and verify web apps
Browser QA agent
A stack for agents that navigate products, fill forms, take screenshots, and verify real user flows across web applications.
Alternative shortlist
Compare before you install
Similar skills in this category, ranked with the same readiness and quality signals.
Crawl4AI
Open-source LLM-friendly web crawler and scraper
Scrapling
Adaptive web scraping for agent data collection
Scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
EasySpider
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/网页爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
Overview
A command-line utility and Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.
Imported by the skill-only GitHub discovery pipeline because it matches agent skill, automation, domain workflow, RAG, document-processing, data, finance, security, or developer-tool signals. Protocol-server projects are excluded from automated imports.
Platform Compatibility
Technical Details
- Version
- 1.0.0
- License
- ISC
- Last Updated
- 6/21/2026
- Published
- 6/21/2026
Frameworks & Tools
Decision snapshot
Needs validation
install command or GitHub repo available
Audit snapshot
Install review
Install and adoption review
- Security
- 83/100
- Maintenance
- 20/100
- Install
- 92/100
Agent-proven evidence
Agent Proven evidence
Outcome reports after resolve, review, install, and one narrow run.
- Success rate
- —
- Recent failure
- —
- Outcomes
- 0
- Output quality
- —
- Failed
- 0
- Not relevant
- 0
- Installs
- 0
- Risk blocked
- 0
- Setup needed
- 0
- Production
- 0
No agent outcome data yet. The first agent run can report success, setup needs, risk blocks, failure, or not-relevant through /api/agent/outcome.
Install
Add to agent workflow
Free and open source. Review the audit before production use.
Growth loop
Share kit
Scenario-led draft for Wayback Machine Scraper, ready for a manual X post.
Most web agents fail in the boring part: messy pages, missing context, repeatable extraction. Wayback Machine Scraper gives agents a cleaner path to browse, extract, and monito... 480 stars https://www.openagentskill.com/skills/sangaline-wayback-machine-scraper?ref=x #AIAgents
Optional reply with install command
Listing + install path for Wayback Machine Scraper: https://www.openagentskill.com/skills/sangaline-wayback-machine-scraper?ref=x Install: npx skills add sangaline/wayback-machine-scraper
Listing source
Community indexed
This listing was indexed from public sources and is not marked official until a maintainer claim is approved.
- Creator
- sangaline
- Indexed by
- OpenAgentSkill community index
Attribution links to the public repository or creator profile. Creators can claim the listing to update ownership signals.
Claim this skillOwner claim
Claim this skill listing
This community indexed listing is attributed to sangaline but is not marked official yet. Claim it to add a verified owner signal and make future launch, install, and audit updates easier to trust.
README badge
Add this badge to your GitHub README to show the listing, trust score, and install handoff.
[](https://www.openagentskill.com/skills/sangaline-wayback-machine-scraper)Author
sangaline
@sangaline
Tags
Platform Fit
Health Signals
- GitHub stars
- 480
- Quality score
- 37/100
- Last GitHub push
- Feb 23, 2024
- Framework hints
- 2
- OpenAgentSkill views
- 3
- Install copies
- 0
- Outbound clicks
- 0
Community Signal
Share whether this skill looks useful for your agent workflow. Aggregated feedback improves rankings over time.
Trust & Safety
Sandbox only
- GitHub adoption480 GitHub starsINFO
- Stars/forks activity480 stars, 81 forks; issue activity unavailable in current metadataINFO
- Recent maintenance2y since pushFIX
- License clarityISCPASS
- README/SKILL.md completenessMetadata includes enough usage and workflow contextPASS
- Dependency/runtime risknetwork or browser surfacePASS
Related Skills
Crawl4AI
Open-source LLM-friendly web crawler and scraper
70.8K stars · 31.0K installsScrapling
Adaptive web scraping for agent data collection
67.8K stars · 0 installsScrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
62.5K stars · 0 installsEasySpider
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/网页爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
44.1K stars · 0 installs