Extractor
Use LLMs to robustly extract web data
Supply asset profile
Data, BI, and analytics
CSV, SQL, notebooks, dashboards, data pipelines, BI, ETL, and spreadsheet analysis.
Scenario
Web scraping
I need my agent to scrape websites and extract structured data from pages.
Agent fit
Claude Code + CLI + Codex
Codex, Claude Code, Cursor, CLI, or custom agents.
Install
Ready
npx skills add lightfeed/extractor
Maintenance
fresh
10d since push
Risk
Safe to try
Quality score needs review
GitHub quality
318
82/100 quality · 84/100 trust
Coverage tags
Review notes
Quality score needs review
Agent adoption scorecard
Trust, audit, and install readiness at a glance
These scores combine public repository metadata, OpenAgentSkill review signals, maintenance freshness, and install readiness. They are a shortlist signal, not a replacement for human review.
Quality
StrongSolid option that is likely worth shortlisting for production workflows.
Trust
Strong shortlistGood trust signals with a few areas worth checking before rollout.
Audit
Safe to tryInstall readiness, security metadata, maintenance, and adoption risk.
Trust Score v2
Human review before install
Test in a sandbox workflow and compare its install path with close alternatives.
Stars
318 GitHub stars
Maintenance
10d since push
License
Apache-2.0
Install
npx skills add lightfeed/extractor
Risk summary
Low metadata risk
- Quality score needs review
Install readiness
Install path available
- Install path is available
- Repository evidence is available
- License is declared
- 10d since push
Agent safety v2
72/100 · Review before install
Good audit and safety signals with no high-risk permission hints in public metadata.
Review the audit page, then allow agent install in a sandboxed workflow.
medium
Network access
Skill likely fetches remote pages, APIs, repositories, or external services.
medium
Filesystem access
Skill may read or write project files, documents, generated artifacts, or local workspace state.
- Quality score needs review
Install targets
Install this skill in your agent workflow
Copy the registry command or an agent-specific install prompt for Codex, Claude Code, and Cursor.
OpenAgentSkill CLI
Use the registry command when your workflow supports the OpenAgentSkill installer.
$ npx skills add lightfeed/extractorAgent resolve plan
Let an agent verify fit before installing.
The Resolve API returns the selected skill, alternatives, safety policy, audit notes, install target, and copy-paste prompt an agent can follow without scraping this page.
Resolve JSON
/api/agent/resolve?task=Use%20Extractor%20for%20an%20agent%20workflow&agent=codex&max_risk=medium
Resolve text
/api/agent/resolve?task=Use%20Extractor%20for%20an%20agent%20workflow&agent=codex&max_risk=medium&format=text
Install handoff
/api/skills/lightfeed-extractor/install
Agent should check
- Task fit and alternatives from Resolve API.
- Audit score, trust score, and safety policy warnings.
- Install target compatibility for Codex, Claude Code, Cursor, or CLI.
Copy prompt
Task: Use Extractor in this workspace.
Resolve first: https://www.openagentskill.com/api/agent/resolve?task=Use%20Extractor%20for%20an%20agent%20workflow&agent=codex&max_risk=medium
Review install handoff: https://www.openagentskill.com/api/skills/lightfeed-extractor/install
Install command: npx skills add lightfeed/extractor
Before running it, summarize audit warnings, required permissions, and the fallback skill if install is risky.Agent handoff
Give an agent the install path, not another directory page.
Use the public install endpoint to fetch the command, safety checklist, target prompts, and canonical links for this skill.
Install handoff
/api/skills/lightfeed-extractor/install
LLM text format
/api/skills/lightfeed-extractor/install?format=text
Find alternatives
/api/skills/search?q=Extractor&limit=3
Agent prompt
Use Extractor for this task. Review https://www.openagentskill.com/api/skills/lightfeed-extractor/install, then install with: npx skills add lightfeed/extractorRegistry metadata
Agent-readable profile for automatic skill selection.
This page exposes the same decision, trust, audit, use-case, and install signals through the Registry API, so agents can rank this skill without scraping the UI.
Manifest
/api/registry/manifest/lightfeed-extractor
LLM text
/api/registry/manifest/lightfeed-extractor?format=text
Install alias
/api/registry/install/lightfeed-extractor
Recommend
/api/registry/recommend?task=Use%20Extractor%20in%20an%20agent%20workflow&limit=3
Agent fit
Web scraping
Use-case tags
Platforms
TypeScript, Data Pipeline, Claude Code
Audit report
Safe to try · 88/100
Review install readiness, maintenance, trust, quality, and metadata warnings before adding this skill to an agent workflow.
Agent decision cockpit
Companion skill for Web scraping
Shortlist this skill and compare it with close alternatives before production adoption.
Role in stack
Companion skill
Primary fit
Web scraping
Trust label
Strong shortlist
Install path
Command ready
Use when
- Web scraping workflows
- Claude Code teams
- builders willing to evaluate younger projects
Evidence
- recent repository activity
- install command or GitHub repo available
- 82/100 quality profile
Review first
- No OpenAgentSkill engagement data yet
Implementation path
- 1Install it in a sandbox agent and run one Web scraping task end to end.
- 2Compare output quality, latency, and failure behavior against at least one alternative.
- 3Promote it into production only after reviewing repository permissions, license, and maintenance signals.
Trust profile
Strong shortlist
Good trust signals with a few areas worth checking before rollout.
GitHub adoption
INFO318 GitHub stars
Recent maintenance
PASS10d since push
License clarity
PASSApache-2.0
README/SKILL.md completeness
INFOPublic metadata needs stronger README/SKILL.md context
Good signals
- AI review approved
- Install path is available
- Repository evidence is available
- Recently maintained repository
Review before install
- Quality score needs review
Recommended action
Test in a sandbox workflow and compare its install path with close alternatives.
Quality profile
Strong candidate for agent workflows
Solid option that is likely worth shortlisting for production workflows.
Workflow fit
Use this skill in these scenarios
Collect structured data
Web scraping
I need my agent to scrape websites and extract structured data from pages.
Parse messy files
Document processing
I need my agent to read PDFs, extract tables, and turn documents into structured data.
Build and ship code
Coding agents
I need a coding agent that can understand a repository, edit code, and review pull requests.
Stack fit
Add it to a complete workflow
Scrape, clean, and reuse web data
Web data pipeline
A practical stack for agents that crawl public pages, extract clean content, normalize data, and hand it to downstream research or RAG workflows.
Turn skills into distribution
Content growth agent
A stack for turning newly indexed skills into SEO briefs, social drafts, comparison pages, and reusable publishing workflows.
Inspect, patch, and verify code
Coding review agent
A stack for software agents that inspect repositories, review pull requests, generate tests, and turn findings into shippable patches.
Alternative shortlist
Compare before you install
Similar skills in this category, ranked with the same readiness and quality signals.
D3
Bring data to life with SVG, Canvas and HTML. :bar_chart::chart_with_upwards_trend::tada:
Grafana
The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Prometheus, Loki, Elasticsearch, InfluxDB, Postgres and many more.
Superset
Apache Superset is a Data Visualization and Data Exploration Platform
Echarts
Apache ECharts is a powerful, interactive charting and data visualization library for browser
Overview
Use LLMs to robustly extract web data
Imported by the skill-only GitHub discovery pipeline because it matches agent skill, automation, domain workflow, RAG, document-processing, data, finance, security, or developer-tool signals. Protocol-server projects are excluded from automated imports.
Platform Compatibility
Technical Details
- Version
- 1.0.0
- License
- Apache-2.0
- Last Updated
- 6/16/2026
- Published
- 6/16/2026
Frameworks & Tools
Decision snapshot
Companion skill
recent repository activity
Audit Snapshot
Install and adoption review
- Security
- 94/100
- Maintenance
- 100/100
- Install
- 92/100
Growth loop
Share this skill
Scenario-led draft for Extractor, with the OpenAgentSkill Update theme and canonical URL.
OpenAgentSkill Update Today: Extractor Use it when you need an agent to browse, extract, or monitor web pages without building a scraper from scratch. 318 stars - data-analysis Link: https://www.openagentskill.com/skills/lightfeed-extractor?ref=x #AIAgents #OpenAgentSkill
Optional reply with install command
Link for Extractor: https://www.openagentskill.com/skills/lightfeed-extractor?ref=x Install: npx skills add lightfeed/extractor
Listing source
Community indexed
This listing was indexed from public sources and is not marked official until a maintainer claim is approved.
- Creator
- lightfeed
- Source
- lightfeed/extractor
- Indexed by
- OpenAgentSkill community index
Attribution links to the public repository or creator profile. Creators can claim the listing to update ownership signals.
Claim this skillOwner claim
Claim this skill listing
This community indexed listing is attributed to lightfeed but is not marked official yet. Claim it to add a verified owner signal and make future launch, install, and audit updates easier to trust.
README badge
Add this badge to your GitHub README to show the listing, trust score, and install handoff.
[](https://www.openagentskill.com/skills/lightfeed-extractor)Author
lightfeed
@lightfeed
Tags
Platform Fit
Health Signals
- GitHub stars
- 318
- Quality score
- 51/100
- Last GitHub push
- Jun 6, 2026
- Framework hints
- 2
- OpenAgentSkill views
- 0
- Install copies
- 0
- Outbound clicks
- 0
Community Signal
Share whether this skill looks useful for your agent workflow. Aggregated feedback improves rankings over time.
Trust & Safety
Strong shortlist
- GitHub adoption318 GitHub starsINFO
- Recent maintenance10d since pushPASS
- License clarityApache-2.0PASS
- README/SKILL.md completenessPublic metadata needs stronger README/SKILL.md contextINFO
- Dependency riskno major dependency risk hints in public metadataPASS
- Install availabilitynpx skills add lightfeed/extractorPASS
Related Skills
D3
Bring data to life with SVG, Canvas and HTML. :bar_chart::chart_with_upwards_trend::tada:
113.1K stars · 0 installsGrafana
The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Prometheus, Loki, Elasticsearch, InfluxDB, Postgres and many more.
74.4K stars · 0 installsSuperset
Apache Superset is a Data Visualization and Data Exploration Platform
73.3K stars · 0 installsEcharts
Apache ECharts is a powerful, interactive charting and data visualization library for browser
66.6K stars · 0 installs