Overview
- Skill Key
- bobholamovic/paddleocr-text-recognition
- Author
- bobholamovic
- Source Repo
- openclaw/skills
- Version
- -
- Source Path
- skills/bobholamovic/paddleocr-text-recognition
- Latest Commit SHA
- 849634e1caa16472b813d888d43bba67e6c74020
Use this skill when users need to extract text from images, PDFs, or documents. Supports URLs and local files. Returns structured JSON containing recognized text.
Stars
0
Installs
0
Status
ACTIVE
Visibility
PUBLIC
直接复制以下提示词,发送给你的 AI 助手即可完成安装。
请先检查是否已安装 SkillHub 商店,若未安装,请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店,然后安装 paddleocr-text-recognition 技能。 若已安装,则直接安装 paddleocr-text-recognition 技能。
# PaddleOCR Text Recognition Skill ## When to Use This Skill Invoke this skill in the following situations: - Extract text from images (screenshots, photos, scans) - Extract text from PDFs or document images - Extract text and positions from structured documents (invoices, receipts, forms, tables) - Extract text from URLs or local files that point to images/PDFs Do not use this skill in the following situations: - Plain text files that can be read directly with the Read tool - Code files or markdown documents - Tasks that do not involve image-to-text conversion ## How to Use This Skill **⛔ MANDATORY RESTRICTIONS - DO NOT VIOLATE ⛔** 1. **ONLY use PaddleOCR Text Recognition API** - Execute the script `python scripts/ocr_caller.py` 2. **NEVER read images directly** - Do NOT read images yourself 3. **NEVER offer alternatives** - Do NOT suggest "I can try to read it" or similar 4. **IF API fails** - Display the error message and STOP immediately 5. **NO fallback methods** - Do NOT attempt OCR any other way If the script execution fails (API not configured, network error, etc.): - Show the error message to the user - Do NOT offer to help using your vision capabilities - Do NOT ask "Would you like me to try reading it?" - Simply stop and wait for user to fix the configuration ### Basic Workflow 1. **Identify the input source**: - User provides URL: Use the `--file-url` parameter - User provides local file path: Use the `--file-path` parameter - User uploads image: Save it first, then use `--file-path` **Input type note**: - Supported file types depend on the model and endpoint configuration. - Follow the official endpoint/API documentation for the exact supported formats. 2. **Execute OCR**: ```bash python scripts/ocr_caller.py --file-url "URL provided by user" --pretty ``` Or for local files: ```bash python scripts/ocr_caller.py --file-path "file path" --pretty ``` **Default behavior: save raw JSON to a temp file**:...
openstockdata
OpenClaw Skill for stock data analysis
edholofy
University for AI agents. 92 courses, 4400+ scenarios, any model via OpenRouter. Auto-training loops generate per-model SKILL.md documents. Works with Claude Code, OpenClaw, Cursor, Windsurf. No fine-tuning required.
human-pages-ai
Search and hire real humans for tasks — photography, delivery, research, and more
lethehades
macOS WPS Office workflow helper skill for safer document preparation, conversion, export, and compatibility guidance
zseven-w
Reusable skill templates for OpenClaw AI agents. Templates for API integration, data processing, web scraping, CLI tools, and file processing.
capt-marbles
Generative Engine Optimization (GEO) for AI search visibility. Optimize content to appear in ChatGPT, Perplexity, Claude, and Google AI Overviews. Use when optimizing websites, pages, or content for LLM discoverability and citation.