Html To Markdown
High performance and CommonMark compliant HTML to Markdown converter. Maintained by the Kreuzberg team. Kreuzberg is a fast, polyglot document intelligence engine with a Rust core. It extracts structured data from 56+ document formats using streaming parsers and built-in OCR.
Install with one command
$ npx skills add kreuzberg-dev/html-to-markdownDecision summary
Production-ready for Document processing
Use this as a leading candidate, then validate the README and install path in your own agent stack.
Best for
- Document processing workflows
- Claude Code teams
- teams that value GitHub adoption signals
Not ideal for
- teams that need a vendor-supported SLA
- high-compliance environments without internal security review
Risk notes
- No OpenAgentSkill engagement data yet
Quality profile
Excellent candidate for agent workflows
High-confidence pick with strong adoption and healthy maintenance signals.
Workflow fit
Use this skill in these scenarios
Parse messy files
Document processing
I need my agent to read PDFs, extract tables, and turn documents into structured data.
Search private knowledge
RAG and knowledge
I need my agent to build a RAG workflow over documents and retrieve reliable context.
Operate web apps
Browser automation
I need my agent to control a browser, fill forms, and verify web app workflows.
Stack fit
Add it to a complete workflow
Ingest, retrieve, and cite
RAG knowledge base
A stack for document-heavy agents that ingest files, create searchable knowledge, retrieve relevant context, and answer with grounded sources.
Scrape, clean, and reuse web data
Web data pipeline
A practical stack for agents that crawl public pages, extract clean content, normalize data, and hand it to downstream research or RAG workflows.
Operate and verify web apps
Browser QA agent
A stack for agents that navigate products, fill forms, take screenshots, and verify real user flows across web applications.
Overview
High performance and CommonMark compliant HTML to Markdown converter. Maintained by the Kreuzberg team. Kreuzberg is a fast, polyglot document intelligence engine with a Rust core. It extracts structured data from 56+ document formats using streaming parsers and built-in OCR.
Imported by the skill-only GitHub discovery pipeline because it matches agent skill, automation, RAG, or developer-tool signals. Protocol-server projects are excluded from automated imports.
Platform Compatibility
Technical Details
- Version
- 1.0.0
- License
- MIT
- Last Updated
- 5/31/2026
- Published
- 5/25/2026
Frameworks & Tools
Claim this skill
Project owners can request ownership review. Approved claims unlock a stronger trust signal.
Author
kreuzberg-dev
@kreuzberg-dev
Tags
Platform Fit
Health Signals
- GitHub stars
- 743
- Quality score
- 54/100
- Last GitHub push
- May 30, 2026
- Framework hints
- 2
- OpenAgentSkill views
- 0
- Install copies
- 0
- Outbound clicks
- 0
Community Signal
Share whether this skill looks useful for your agent workflow. Aggregated feedback improves rankings over time.
Trust & Safety
- —Open source (public GitHub repo)
- —AI static analysis passed
- —License: MIT
Related Skills
MarkItDown
Convert documents into Markdown for agent-readable context
132.6K stars · 0 installsAwesome Llm Apps
100+ AI Agent & RAG apps you can actually run — clone, customize, ship.
112.3K stars · 0 installsRAGFlow
Build document intelligence and RAG workflows for agents
81.6K stars · 0 installsPaddleOCR
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
78.9K stars · 0 installs