Decision filters

Choose skills by scenario, quality, and trust signals.

15 skills matching "parse"

Best blend of quality, stars, freshness, and agent usage

1

Opendataloader Pdf

VERIFIEDEXCELLENT · 100

PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.

$ npx skills add opendataloader-project/opendataloader-pdf
21.5K stars74 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javarag
by opendataloader-projectQuick view
2

Crawlee Python

VERIFIEDEXCELLENT · 100

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

$ npx skills add apify/crawlee-python
9.1K stars69 qualityClaude Code + OpenAI Agents
High-confidence pick with strong adoption and healthy maintenance signals.
pythonweb-automation
by apifyQuick view
3

Skills

VERIFIEDEXCELLENT · 100

Trail of Bits Claude Code skills for security research, vulnerability detection, and audit workflows

$ npx skills add trailofbits/skills
5.4K stars69 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
by trailofbitsQuick view
4

AutoRAG

VERIFIEDEXCELLENT · 100

AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

$ npx skills add Marker-Inc-Korea/AutoRAG
4.8K stars67 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonrag
by Marker-Inc-KoreaQuick view
5

Infinity

VERIFIEDEXCELLENT · 100

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.

$ npx skills add infiniflow/infinity
4.5K stars67 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
c++rag
by infiniflowQuick view
6

Article Extractor

VERIFIEDEXCELLENT · 100

To extract main article from given URL with Node.js

$ npx skills add extractus/article-extractor
1.9K stars65 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
javascriptcrawler
by extractusQuick view
7

Core

VERIFIEDEXCELLENT · 100

A modern PDF library for TypeScript. Parse, modify, and generate PDFs with a clean, intuitive API.

$ npx skills add LibPDF-js/core
1.7K stars64 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
typescriptpdf
by LibPDF-jsQuick view
8

Selectolax

VERIFIEDEXCELLENT · 100

Python binding to Modest and Lexbor engines. Fast HTML5 parser with CSS selectors for Python.

$ npx skills add rushter/selectolax
1.6K stars64 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
cythonweb-automation
by rushterQuick view
9

Phpdoc Parser

VERIFIEDEXCELLENT · 100

Next-gen phpDoc parser with support for intersection types and generics

$ npx skills add phpstan/phpdoc-parser
1.5K stars64 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
phpstatic-analysis
by phpstanQuick view
10

Php Svg Lib

VERIFIEDEXCELLENT · 87

SVG file parsing / rendering library

$ npx skills add dompdf/php-svg-lib
1.4K stars57 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
phppdf
by dompdfQuick view
11

Parse Video

EXCELLENT · 87

Golang短视频去水印:抖音,皮皮虾,火山,微视,最右,快手,全民小视频,皮皮搞笑,西瓜视频,虎牙,梨视频,acfun,好看视频...

$ npx skills add wujunwei928/parse-video
921 stars54 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
gocrawler
by wujunwei928Quick view
12

Skrape.It

EXCELLENT · 87

A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.

$ npx skills add skrapeit/skrape.it
871 stars54 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
kotlincrawler
by skrapeitQuick view
13

Dedoc

EXCELLENT · 86

Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser

$ npx skills add ispras/dedoc
704 stars54 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonocr
by isprasQuick view
14

Parser Avito

STRONG · 80

Avito Parser —бесплатный парсер для автоматического мониторинга новых объявлений Avito и\или выгрузки объявлений в файл

$ npx skills add Duff89/parser_avito
616 stars53 qualityClaude Code + Browser agents
Solid option that is likely worth shortlisting for production workflows.
pythonplaywright
by Duff89Quick view
15

Docstrange

VERIFIEDEXCELLENT · 85

Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with intelligent structured data extraction and advanced OCR.

$ npx skills add NanoNets/docstrange
1.5K stars53 qualityClaude Code
High-confidence pick with strong adoption and healthy maintenance signals.
pythonocr
by NanoNetsQuick view