Decision filters

Choose skills by scenario, quality, and trust signals.

5 skills matching "extractor"

Best blend of quality, stars, freshness, and agent usage

1

PaddleOCR

VERIFIEDEXCELLENT · 100

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

$ npx skills add PaddlePaddle/PaddleOCR
78.4K stars77 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonrag
by PaddlePaddleQuick view
2

Article Extractor

VERIFIEDEXCELLENT · 100

To extract main article from given URL with Node.js

$ npx skills add extractus/article-extractor
1.9K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javascriptcrawler
by extractusQuick view
3

NewPipeExtractor

VERIFIEDEXCELLENT · 100

NewPipe's core library for extracting data from streaming sites

$ npx skills add TeamNewPipe/NewPipeExtractor
1.9K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javacrawler
by TeamNewPipeQuick view
4

News Please

VERIFIEDEXCELLENT · 99

news-please - an integrated web crawler and information extractor for news that just works

$ npx skills add fhamborg/news-please
2.5K stars62 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythoncrawler
by fhamborgQuick view
5

Trafilatura

VERIFIEDEXCELLENT · 91

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

$ npx skills add adbar/trafilatura
6.0K stars57 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonweb-automation
by adbarQuick view