Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
$ npx skills add PaddlePaddle/PaddleOCRDecision filters
23 skills matching "image"
Best blend of quality, stars, freshness, and agent usage
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
$ npx skills add PaddlePaddle/PaddleOCRAI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, and more). Turn any folder of code, SQL schemas, R scripts, shell scripts, docs, papers, images, or videos into a queryable knowledge graph. App code + database schema + infrastructure in one graph.
$ npx skills add safishamsi/graphifyAI generates natively editable PPTX from any document — real PowerPoint shapes with native animations, not images · by Hugo He
$ npx skills add hugohe3/ppt-masterThe container platform tailored for Kubernetes multi-cloud, datacenter, and edge management ⎈ 🖥 ☁️
$ npx skills add kubesphere/kubesphereA repository of skills designed to enhance daily work efficiency with Claude Code.
$ npx skills add JimLiu/baoyu-skillsYour AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.
$ npx skills add khoj-ai/khojAI-agent Skill for generating polished HTML slide decks: editorial magazine and Swiss layouts, image prompts, social covers, and a WebGL/low-power presentation runtime.
$ npx skills add op7418/guizang-ppt-skillPM Skills Marketplace: 100+ agentic skills, commands, and plugins — from discovery to strategy, execution, launch, and growth.
$ npx skills add phuryn/pm-skillsUltra-high-performance, secure, all-in-one acceleration engine for developer resources
$ npx skills add xixu-me/xgetCollection of AI-related utilities. Welcome to submit pull requests /收藏AI相关的实用工具,欢迎提交pull requests
$ npx skills add ikaijua/Awesome-AIToolsAI suite powered by state-of-the-art models and providing advanced AI/AGI functions. Includes AI personas, AGI functions, world-class Beam multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud.
$ npx skills add enricoros/big-AGIConardLi's open-source Skills collection, featuring web design, knowledge retrieval, image generation, and more.
$ npx skills add ConardLi/garden-skillsAI Agent 驱动的开源视频生成工作台 — 小说→角色/场景/道具设计→剧本→分镜图→视频,跨镜头角色与场景一致 | Open-source AI video workspace powered by AI Agents, Nano Banana 2 & Veo 3.1 / Grok / Seedance / OpenAI
$ npx skills add ArcReel/ArcReel🏳️🌈 Media downloader from any sites, including Twitter, Reddit, Instagram, BlueSky, TikTok, Threads, Facebook, OnlyFans, YouTube, Pinterest, PornHub, XHamster, XVIDEOS, ThisVid etc.
$ npx skills add AAndyProgram/SCrawlerAI skill for OpenClaw & Claude Code — recommend from 10000+ Nano Banana Pro (Gemini) image prompts. Smart search by use case, content remix, sample images.
$ npx skills add YouMind-OpenLab/nano-banana-pro-prompts-recommend-skillThe process of extracting product data from Amazon using Python, including titles, ratings, prices, images, and descriptions.
$ npx skills add oxylabs/how-to-scrape-amazon-product-dataWebAI2API: 基于 Camoufox 的网页 AI 转 API 工具,支持 LMArena/Gemini等,多窗口并发与账号隔离。 | Web AI to OpenAI API via Camoufox. Supports LMArena/Gemini and more, multi-window concurrency & account isolation.
$ npx skills add foxhui/WebAI2APIMLX Studio - Home of JANG_Q - Image Gen/Edit + Chat/Code All in one - + OpenClaw (Anthropic API)
$ npx skills add jjang-ai/mlxstudioOpen‑WebUI Tools is a modular toolkit designed to extend and enrich your Open WebUI instance, turning it into a powerful AI workstation. With a suite of over 15 specialized tools, function pipelines, and filters, this project supports academic research, agentic autonomy, multimodal creativity, workflows, and more
$ npx skills add Haervwe/open-webui-toolsSelf-healing infrastructure for AI agent payments. 90.3% auto-recovery.
$ npx skills add adrianhihi/helixControl Figma from the command line. Full read/write access for AI agents — create shapes, text, components, set styles, export images. 100+ commands.
$ npx skills add dannote/figma-useGoogle, Naver multiprocess image web crawler (Selenium)
$ npx skills add YoongiKim/AutoCrawlerGoogle Search Results via SERP API pip Python Package
$ npx skills add serpapi/google-search-results-python