Overview
- Skill Key
- ankylala/document-parser
- Author
- ankylala
- Source Repo
- openclaw/skills
- Version
- -
- Source Path
- skills/ankylala/document-parser
- Latest Commit SHA
- e757401241c1a9393c56161dd2e6e7eda06937b5
Stars
0
Installs
0
Status
ACTIVE
Visibility
PUBLIC
直接复制以下提示词,发送给你的 AI 助手即可完成安装。
请先检查是否已安装 SkillHub 商店,若未安装,请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店,然后安装 Document Parser 技能。 若已安装,则直接安装 Document Parser 技能。
# document-parser
高精度文档解析技能,从 PDF、图片、Word 文档中提取结构化数据。
## 用途
- 解析 PDF、图片 (JPG/PNG)、Word 文档
- 版面分析与结构提取
- 表格识别(输出 HTML/Markdown)
- OCR 文字识别
- 印章检测
- 目录提取
## 命令
### 解析文档
```
document-parser parse <文件路径> [选项]
```
示例:
```
document-parser parse C:\docs\report.pdf
document-parser parse C:\docs\scan.jpg --layout --table
document-parser parse C:\docs\contract.docx --output markdown
```
### 查询任务状态
```
document-parser status <任务 ID>
```
## 参数说明
| 参数 | 说明 | 示例 |
|------|------|------|
| 文件路径 | PDF/图片/Word 文件路径 | `C:\docs\report.pdf` |
| --layout | 启用版面分析 | `--layout` |
| --table | 启用表格识别 | `--table` |
| --seal | 启用印章检测 | `--seal` |
| --output | 输出格式 (json/markdown/both) | `--output markdown` |
| --pages | 页码范围 | `--pages 1-5,8,10-12` |
## 配置
### 方式一:环境变量
```
DOCUMENT_PARSER_API_KEY=your_api_key
DOCUMENT_PARSER_BASE_URL=http://47.111.146.164:8088/taidp/v1/idp/general_parse
```
### 方式二:配置文件
在技能目录创建 `config.json`:
```json
{
"api_key": "your_api_key",
"base_url": "http://47.111.146.164:8088/taidp/v1/idp/general_parse"
}
```
## 输出格式
返回结构化 JSON 包含:
- **pages**: 解析后的页面数组
- **elements**: 版面元素(文本、表格、图片等)
- **markdown**: Markdown 格式文本
- **data**: 数据统计摘要
## 依赖
- requests
- python-docx (Word 支持)
- Pillow (图片处理)
## 错误码
| 错误码 | 消息 | 说明 |
|--------|------|------|
| 10000 | Success | 识别成功 |
| 10001 | Missing parameter | 参数缺失 |
| 10002 | Invalid parameter | 非法参数 |
| 10003 | Invalid file | 文件格式非法 |
| 10004 | Failed to recognize | 识别失败 |
| 10005 | Internal error | 内部错误 |
# Document Parser > OpenClaw Skill - High-precision document parsing Extract structured data from PDF, images, and Word documents. ## Features - ✅ **Multi-format Support**: PDF, Images (JPG/PNG), Word documents - ✅ **Layout Analysis**: Automatically detect and structure document elements - ✅ **Table Recognition**: Extract tables with HTML and Markdown outputs - ✅ **OCR Support**: Recognize text in scanned documents and images - ✅ **Seal Detection**: Detect stamps and seals in documents - ✅ **TOC Extraction**: Extract table of contents from documents - ✅ **Cross-page Merge**: Automatically merge content across pages ## Quick Start ### Installation ```bash # Install via ClawHub openclaw skills install document-parser # Or manual installation (local development) cd E:\skills\document-parser pip install -r requirements.txt ``` ### Configuration **Option 1: Environment Variables (Recommended)** ```bash # Windows PowerShell $env:DOCUMENT_PARSER_API_KEY="your_api_key" setx DOCUMENT_PARSER_API_KEY "your_api_key" # Optional: Custom API endpoint $env:DOCUMENT_PARSER_BASE_URL="http://your-server:8088/taidp/v1/idp/general_parse" ``` **Option 2: Configuration File** ```bash cd E:\skills\document-parser copy config.example.json config.json # Edit config.json with your API Key ``` ### Usage #### Parse a Document ```bash # Basic parsing document-parser parse "C:\docs\report.pdf" # Enable layout analysis and table recognition document-parser parse "C:\docs\report.pdf" --layout --table # Specify output format document-parser parse "C:\docs\scan.jpg" --output markdown # Specify page range document-parser parse "C:\docs\book.pdf" --pages 1-5,10-15 ``` #### Query Task Status ```bash document-parser status <task_id> ``` ## Parameters | Parameter | Type | Required | Description | |-----------|------|----------|-------------| | file | string | Yes | PDF/Image/Word file path | | --layout | flag | No | Enable layout analysis | | --table | flag | No | Enable table r...
edholofy
University for AI agents. 92 courses, 4400+ scenarios, any model via OpenRouter. Auto-training loops generate per-model SKILL.md documents. Works with Claude Code, OpenClaw, Cursor, Windsurf. No fine-tuning required.
openstockdata
OpenClaw Skill for stock data analysis
lethehades
macOS WPS Office workflow helper skill for safer document preparation, conversion, export, and compatibility guidance
capt-marbles
Generative Engine Optimization (GEO) for AI search visibility. Optimize content to appear in ChatGPT, Perplexity, Claude, and Google AI Overviews. Use when optimizing websites, pages, or content for LLM discoverability and citation.
capt-marbles
Control PhantomBuster automation agents via API. List agents, launch automations, get output/results, check status, and abort running agents. Use when the user needs to run LinkedIn scraping, Twitter automation, lead generation phantoms, or any PhantomBuster workflow.
capt-marbles
Web scraping and crawling with Firecrawl API. Fetch webpage content as markdown, take screenshots, extract structured data, search the web, and crawl documentation sites. Use when the user needs to scrape a URL, get current web info, capture a screenshot, extract specific data from pages, or crawl docs for a framework/library.