Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
$ npx skills add PaddlePaddle/PaddleOCRDecision filters
76 skills matching "language"
Best blend of quality, stars, freshness, and agent usage
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
$ npx skills add PaddlePaddle/PaddleOCRAI productivity studio with smart chat, autonomous agents, and 300+ assistants. Unified access to frontier LLMs
$ npx skills add CherryHQ/cherry-studioGive your AI agent a web browser
$ npx skills add browser-use/browser-use[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
$ npx skills add HKUDS/LightRAG🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
$ npx skills add dair-ai/Prompt-Engineering-GuideOpen-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.
$ npx skills add deepset-ai/haystackBuild browser agents with natural language actions
$ npx skills add browserbase/stagehandDeepTutor -- Agent-native, Open-sourced Personalized Tutoring. https://deeptutor.info/.
$ npx skills add HKUDS/DeepTutorControl web interfaces with natural language agents
$ npx skills add alibaba/page-agentA generative speech model for daily dialogue.
$ npx skills add 2noise/ChatTTS💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
$ npx skills add neuml/txtaiAutomate your mobile devices with natural language commands - an LLM agnostic mobile Agent 🤖
$ npx skills add droidrun/mobilerun⚙️🦀 Build modular and scalable LLM Applications in Rust
$ npx skills add 0xPlaygrounds/rig🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓
$ npx skills add Helicone/heliconeHarness LLMs with Multi-Agent Programming
$ npx skills add langroid/langroid【三年面试五年模拟】AIGC/LLM/AI Agent算法工程师面试秘籍。涵盖AIGC、LLM大模型、AI Agent、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、强化学习、大数据挖掘、具身智能、元宇宙、AGI等AI行业面试笔试干货经验与核心知识。
$ npx skills add WeThinkIn/AIGC-Interview-Book🚀 EvoAgentX: Building a Self-Evolving Ecosystem of AI Agents
$ npx skills add EvoAgentX/EvoAgentXAI Search & RAG Without Moving Your Data. Get instant answers from your company's knowledge across 100+ apps while keeping data secure. Deploy in minutes, not months.
$ npx skills add swirlai/swirl-searchThe Python Code Tutorials
$ npx skills add x4nth055/pythoncode-tutorialsAwesome-GraphRAG: A curated list of resources (surveys, papers, benchmarks, and opensource projects) on graph-based retrieval-augmented generation.
$ npx skills add DEEP-PolyU/Awesome-GraphRAGAwesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.
$ npx skills add EgoAlpha/prompt-in-context-learningNestJS Helper + AI Chatbot Development
$ npx skills add samchon/nestia📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
$ npx skills add Xnhyacinth/Awesome-LLM-Long-Context-ModelingDistributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
$ npx skills add crawlab-team/crawlabEko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai
$ npx skills add FellouAI/ekoA community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.
$ npx skills add Andrew-Jang/RAGHubInteractive architecture diagrams for codebases
$ npx skills add CodeBoarding/CodeBoarding🤖 𝗟𝗲𝗮𝗿𝗻 for 𝗳𝗿𝗲𝗲 how to 𝗯𝘂𝗶𝗹𝗱 an end-to-end 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻-𝗿𝗲𝗮𝗱𝘆 𝗟𝗟𝗠 & 𝗥𝗔𝗚 𝘀𝘆𝘀𝘁𝗲𝗺 using 𝗟𝗟𝗠𝗢𝗽𝘀 best practices: ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 12 𝘩𝘢𝘯𝘥𝘴-𝘰𝘯 𝘭𝘦𝘴𝘴𝘰𝘯𝘴
$ npx skills add decodingai-magazine/llm-twin-courseAI agent framework for plan-first development workflows with approval-based execution. Multi-language support (TypeScript, Python, Go, Rust) with automatic testing, code review, and validation built for OpenCode
$ npx skills add darrenhinde/OpenAgentsControlGenerative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
$ npx skills add NVIDIA/GenerativeAIExamplesOctoTools: An agentic framework with extensible tools for complex reasoning
$ npx skills add octotools/octotoolsYomiTokuはAIを活用した日本語文書解析エンジンを提供するPythonパッケージです。 Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language.
$ npx skills add kotaro-kinoshita/yomitokuAgentQL is a suite of tools for connecting your AI to the web. Featuring a query language and Playwright integrations for interacting with elements and extracting data quickly, precisely, and at scale. Includes REST API, Python and JavaScript SDKs, browser debugger.
$ npx skills add tinyfish-io/agentqlA self-hosted file conversion server & share tool that supports 445 file formats in 13 languages.
$ npx skills add zelon88/HRConvert2Observal is an Observability and Evaluation platform for human-in-the-loop agents
$ npx skills add BlazeUp-AI/ObservalSonarSource Static Analyzer for JavaScript and TypeScript
$ npx skills add SonarSource/SonarJSMouseover Translate Any Language At Once - Chrome Extension: PDF Translator, EBOOK, EPUB, OCR, TTS, NETFLIX, YOUTUBE DUAL SUBTITLES, GOOGLE DOCS, AI, VIEWER, GMAIL, WRITING, IMAGE, DUAL SUBS, MANGA, HOVER, DICTIONARY, WEBTOON, EDGE, JAPANESE, ENGLISH
$ npx skills add ttop32/MouseTooltipTranslator:coffee: SonarSource Static Analyzer for Java Code Quality and Security
$ npx skills add SonarSource/sonar-javaA browser-based desktop where AI Agent operates every app through natural language.
$ npx skills add MiniMax-AI/OpenRoom[KDD'2026] "VideoRAG: Chat with Your Videos"
$ npx skills add HKUDS/VideoRAGAn artifact of fully-specified annotations to power static-analysis checks, beginning with nullness analysis.
$ npx skills add jspecify/jspecifyCrawl a website starting from a URL, find relevant pages, and extract data – all guided by your natural language prompt.
$ npx skills add oxylabs/ai-crawler-pyPython Type Checker / Language Server
$ npx skills add zubanls/zubanSPEC-First Agentic Development Kit for Claude Code — 24 AI agents + 52 skills with TDD/DDD quality gates, 16-language projects, 4-language docs. Go CLI, zero deps.
$ npx skills add modu-ai/moai-adkClaude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflows - all through natural language commands.
$ npx skills add codeaashu/claude-codeOCR engine for all the languages
$ npx skills add mittagessen/krakenGet clean data from tricky documents, powered by vision-language models ⚡
$ npx skills add emcf/thepipeA CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats including EPUB books and PDF documents.
$ npx skills add nazdridoy/kokoro-ttsPocket Flow: Codebase to Tutorial
$ npx skills add The-Pocket/PocketFlow-Tutorial-Codebase-KnowledgeAI Map is an AI-powered website mapping tool by Oxylabs AI Studio that uses natural language prompts to intelligently discover and extract relevant URLs from any website.
$ npx skills add oxylabs/ai-map-pyAI Browser Agent is an advanced Browser AI tool developed by Oxylabs AI Studio that automates real user browsing tasks using natural language instructions.
$ npx skills add oxylabs/browser-agent-pyStructured data gathering from any website using AI-powered scraper, crawler, and browser automation. Scraping and crawling with natural language prompts. Equip your LLM agents with fresh data. AI Studio python SDK for intelligent web data gathering.
$ npx skills add oxylabs/oxylabs-ai-studio-py动手学Ollama,CPU玩转大模型部署,在线阅读地址:https://datawhalechina.github.io/handy-ollama/
$ npx skills add datawhalechina/handy-ollama🥂 Gracefully face hCaptcha challenge with multimodal large language model.
$ npx skills add QIN2DIM/hcaptcha-challengerSoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.
$ npx skills add SciPhi-AI/R2RA high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具
$ npx skills add MarkPDFdown/markpdfdownIntelligent proxy pool for Humans™ to extract content from the internet and build your own Large Language Models in this new AI era
$ npx skills add MikeChongCan/scyllaA repo lists papers related to LLM based agent
$ npx skills add AGI-Edgerunners/LLM-Agents-PapersAn local, offline (after initial setup), portable OCR software that can process images and PDF files, using DeepSeek-OCR AI (running directly on your machine).
$ npx skills add th1nhhdk/local_ai_ocrA collection of awesome web crawler,spider in different languages
$ npx skills add BruceDone/awesome-crawler[ACL2026] "MiniRAG: Making RAG Simpler with Small and Open-Sourced Language Models"
$ npx skills add HKUDS/MiniRAGA lightweight terminal coding assistant with Claude Code-like workflow, tool loop, and TUI architecture, built for learning and experimentation. Multi-language support: TypeScript , Python and Rust implementations available now.
$ npx skills add LiuMengxuan04/MiniCodeBuild ChatGPT over your data, all with natural language
$ npx skills add run-llama/ragsThe only fully local production-grade Super SDK that provides a simple, unified, and powerful interface for calling more than 200+ LLMs.
$ npx skills add adaline/gatewayAI 驱动 UI 生成和发布的低代码平台,基于TailwindCss,通过拖拽可视化快速构建现代化响应式UI、动态自定义组件、多主题、多语言的网站应用。AI-powered UI generation and publishing low code platform, built on TailwindCSS, enabling rapid drag-and-drop visual creation of modern responsive UIs, dynamic customizable components, multi-theme, and multi-language web applications.
$ npx skills add biaogebusy/web-builderbkit Vibecoding Kit - PDCA methodology + Claude Code mastery for AI-native development
$ npx skills add popup-studio-ai/bkit-claude-codeGenerate interactive call graphs for various languages
$ npx skills add chanhx/crabviztranslate scientific papers in latex, especially arxiv papers
$ npx skills add SUSYUSTC/MathTranslateInteract with your SQL database, Natural Language to SQL using LLMs
$ npx skills add Dataherald/dataherald[EMNLP-2024] Build multimodal language agents for fast prototype and production
$ npx skills add om-ai-lab/OmAgentThe llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured output. Works also with models not fine-tuned to JSON output and function calls.
$ npx skills add Maximilian-Winter/llama-cpp-agentAIRecon is an autonomous cybersecurity agent that combines a self-hosted Large Language Model (Ollama) with a Kali Linux Docker sandbox and a Textual TUI. It is designed to automate security assessments, penetration testing, and bug bounty reconnaissance — without any API keys or cloud dependency.
$ npx skills add pikpikcu/aireconAwesome papers involving LLMs in Social Science.
$ npx skills add ValueByte-AI/Awesome-LLM-in-Social-Science[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
$ npx skills add SqueezeAILab/LLMCompilerFast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
$ npx skills add yobix-ai/extractousThe official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
$ npx skills add parthsarthi03/raptor