Image Text Localization Recognition

STRONG · 74
Community indexed

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集 シーンテキストの位置認識と識別のための論文リソースの要約

Downloads 0
Stars 959
Version 1.0.0
Quality 51/100 · Needs review
Trust 74/100 · Strong shortlist
Audit 63/100 · Needs review

Supply asset profile

Research and knowledge work

Deep research, source comparison, literature review, RAG, knowledge search, and reports.

Browse track

Scenario

Document processing

I need my agent to read PDFs, extract tables, and turn documents into structured data.

Agent fit

Claude Code + CLI + Codex

Codex, Claude Code, Cursor, CLI, or custom agents.

Install

Ready

npx skills add whitelok/image-text-localization-recognition

Maintenance

stale

3y since push

Risk

Needs review

License is unclear

GitHub quality

959

51/100 quality · 74/100 trust

Coverage tags

ResearchDocument processingdocument-processingocrdocuments

Review notes

License is unclear · Repository appears stale

Agent adoption scorecard

Trust, audit, and install readiness at a glance

These scores combine public repository metadata, OpenAgentSkill review signals, maintenance freshness, and install readiness. They are a shortlist signal, not a replacement for human review.

Quality

Needs review
51

Inspect the repository carefully before adding it to an agent workflow.

Trust

Strong shortlist
74

Good trust signals with a few areas worth checking before rollout.

Audit

Needs review
63

Install readiness, security metadata, maintenance, and adoption risk.

Trust Score v3

Human review before install

Test in a sandbox workflow and compare its install path with close alternatives.

OCRCodexClaude CodeCursorOpenAgentSkill CLI

Stars

959 GitHub stars

Repo activity

959 stars, 232 forks

Maintenance

3y since push

License

Unknown

Install

npx skills add whitelok/image-text-localization-recognition

Install safety

standard package or runtime install path

Permission surface

filesystem or document access

Docs

Strong README/SKILL.md context

Risk summary

Review before production

  • License is unclear
  • Repository looks stale
  • Quality score needs review
  • Recent maintenance: 3y since push

Install readiness

Install path available

  • Install path is available
  • Repository evidence is available
  • License is unclear
  • 3y since push

Agent-readable metadata

Machine-readable decision data for this skill.

Use this block or the embedded JSON to decide whether an agent should install this skill, choose an alternative, or ask for human review first.

Open JSON

Suited tasks

  • Document processing workflows
  • Claude Code teams
  • teams that value GitHub adoption signals
  • Read uploaded files
  • Extract structured fields

Suited agents

OCRCodexClaude CodeCursorOpenAgentSkill CLICLI

Trust and risk

Trust score
74/100
Risk level
Needs review
Auto install
review

Install command

npx skills add whitelok/image-text-localization-recognition

Do not use when

  • teams that require actively maintained dependencies
  • production agents without a repository review
  • Repository looks stale
  • No OpenAgentSkill engagement data yet
  • License is unclear

Agent safety v2

47/100 · Avoid automatic install

Experimentalreview

Sparse or mixed signals. Useful for discovery, but not for autonomous installation.

Test manually in an isolated workspace and compare against safer alternatives.

Resolve via API

medium

Network access

Skill likely fetches remote pages, APIs, repositories, or external services.

medium

Filesystem access

Skill may read or write project files, documents, generated artifacts, or local workspace state.

  • License is unclear

Install targets

Install this skill in your agent workflow

Copy the registry command or an agent-specific install prompt for Codex, Claude Code, and Cursor.

skill install

OpenAgentSkill CLI

Use the registry command when your workflow supports the OpenAgentSkill installer.

$ npx skills add whitelok/image-text-localization-recognition

Agent resolve plan

Let an agent verify fit before installing.

The Resolve API returns the selected skill, alternatives, safety policy, audit notes, install target, and copy-paste prompt an agent can follow without scraping this page.

Open text plan

Agent should check

  • Task fit and alternatives from Resolve API.
  • Audit score, trust score, and safety policy warnings.
  • Install target compatibility for Codex, Claude Code, Cursor, or CLI.

Copy prompt

Task: Use Image Text Localization Recognition in this workspace.
Resolve first: https://www.openagentskill.com/api/agent/resolve?task=Use%20Image%20Text%20Localization%20Recognition%20for%20an%20agent%20workflow&agent=codex&max_risk=medium
Review install handoff: https://www.openagentskill.com/api/skills/whitelok-image-text-localization-recognition/install
Install command: npx skills add whitelok/image-text-localization-recognition
Before running it, summarize audit warnings, required permissions, and the fallback skill if install is risky.

Agent handoff

Give an agent the install path, not another directory page.

Use the public install endpoint to fetch the command, safety checklist, target prompts, and canonical links for this skill.

Open install API

Agent prompt

Use Image Text Localization Recognition for this task. Review https://www.openagentskill.com/api/skills/whitelok-image-text-localization-recognition/install, then install with: npx skills add whitelok/image-text-localization-recognition

Registry metadata

Agent-readable profile for automatic skill selection.

This page exposes the same decision, trust, audit, use-case, and install signals through the Registry API, so agents can rank this skill without scraping the UI.

Open manifest

Agent fit

53/100

Document processing

Platforms

OCR, Claude Code

Audit report

Needs review · 63/100

Review install readiness, maintenance, trust, quality, and metadata warnings before adding this skill to an agent workflow.

View audit report

Agent decision cockpit

Needs validation for Document processing

Do a manual repository review before adding this to an agent workflow.

53
Readiness
Review
Stage

Role in stack

Needs validation

Primary fit

Document processing

Trust label

Needs manual review

Install path

Command ready

Use when

  • Document processing workflows
  • Claude Code teams
  • teams that value GitHub adoption signals

Evidence

  • 959 GitHub stars
  • install command or GitHub repo available
  • 51/100 quality profile

Review first

  • Repository looks stale
  • No OpenAgentSkill engagement data yet

Implementation path

  1. 1Install it in a sandbox agent and run one Document processing task end to end.
  2. 2Compare output quality, latency, and failure behavior against at least one alternative.
  3. 3Promote it into production only after reviewing repository permissions, license, and maintenance signals.

Trust profile

Strong shortlist

Good trust signals with a few areas worth checking before rollout.

74
Trust score

GitHub adoption

INFO

959 GitHub stars

Stars/forks activity

INFO

959 stars, 232 forks; issue activity unavailable in current metadata

Recent maintenance

FIX

3y since push

License clarity

CHECK

Unknown

Good signals

  • AI review approved
  • Install path is available
  • Repository evidence is available
  • Meaningful GitHub adoption signal
  • Install command has no obvious high-risk pattern

Review before install

  • License is unclear
  • Repository looks stale
  • Quality score needs review
  • Recent maintenance: 3y since push
  • License clarity: Unknown

Recommended action

Test in a sandbox workflow and compare its install path with close alternatives.

Quality profile

Needs review candidate for agent workflows

Inspect the repository carefully before adding it to an agent workflow.

51
GitHub stars
959
Freshness
3y ago
Install ready
Yes
License
Unknown
Check before install: Repository looks stale

Workflow fit

Use this skill in these scenarios

Stack fit

Add it to a complete workflow

Alternative shortlist

Compare before you install

Similar skills in this category, ranked with the same readiness and quality signals.

Compare all

Overview

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集 シーンテキストの位置認識と識別のための論文リソースの要約

Imported by the skill-only GitHub discovery pipeline because it matches agent skill, automation, domain workflow, RAG, document-processing, data, finance, security, or developer-tool signals. Protocol-server projects are excluded from automated imports.

Platform Compatibility

ocrFULL

Technical Details

Version
1.0.0
License
Unknown
Last Updated
6/14/2026
Published
6/12/2026

Frameworks & Tools

OCR

Decision snapshot

Needs validation

53
Ready
Review
Stage

959 GitHub stars

Audit Snapshot

Install and adoption review

63
Needs review
Security
82/100
Maintenance
20/100
Install
92/100
Open full audit

Growth loop

Share this skill

X

Scenario-led draft for Image Text Localization Recognition, with the OpenAgentSkill Update theme and canonical URL.

OpenAgentSkill Update
Today: Image Text Localization Recognition

Use it when your agent needs to turn docs, data, or knowl...

959 stars - document-processing
Link: https://www.openagentskill.com/skills/whitelok-image-text-localization-recognition?ref=x
#AIAgents #OpenAgentSkill
Open X draft
Optional reply with install command
Link for Image Text Localization Recognition:
https://www.openagentskill.com/skills/whitelok-image-text-localization-recognition?ref=x

Install: npx skills add whitelok/image-text-localization-recognition

Listing source

Community indexed

Claimable

This listing was indexed from public sources and is not marked official until a maintainer claim is approved.

Creator
whitelok
Indexed by
OpenAgentSkill community index

Attribution links to the public repository or creator profile. Creators can claim the listing to update ownership signals.

Claim this skill

Owner claim

Claim this skill listing

This community indexed listing is attributed to whitelok but is not marked official yet. Claim it to add a verified owner signal and make future launch, install, and audit updates easier to trust.

README badge

Add this badge to your GitHub README to show the listing, trust score, and install handoff.

[![OpenAgentSkill](https://www.openagentskill.com/api/badge/whitelok-image-text-localization-recognition)](https://www.openagentskill.com/skills/whitelok-image-text-localization-recognition)

Author

W

whitelok

@whitelok

Platform Fit

Health Signals

GitHub stars
959
Quality score
40/100
Last GitHub push
Sep 17, 2023
Framework hints
1
OpenAgentSkill views
0
Install copies
0
Outbound clicks
0

Community Signal

Share whether this skill looks useful for your agent workflow. Aggregated feedback improves rankings over time.

Trust & Safety

Strong shortlist

74
  • GitHub adoption959 GitHub starsINFO
  • Stars/forks activity959 stars, 232 forks; issue activity unavailable in current metadataINFO
  • Recent maintenance3y since pushFIX
  • License clarityUnknownCHECK
  • README/SKILL.md completenessMetadata includes enough usage and workflow contextPASS
  • Dependency/runtime riskno major dependency risk hints in public metadataPASS