TopRank Skills

Home / Claw Skills / Data Analysis / paper-parse
Official OpenClaw rules 54%

paper-parse

Parse academic PDF papers into markdown with figure extraction.

Stars

0

Installs

0

Status

ACTIVE

Visibility

PUBLIC

安装方式

直接复制以下提示词,发送给你的 AI 助手即可完成安装。

请先检查是否已安装 SkillHub 商店,若未安装,请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店,然后安装 paper-parse 技能。 若已安装,则直接安装 paper-parse 技能。

Overview

Skill Key
chen-li-17/paper-parse-figures
Author
chen-li-17
Source Repo
openclaw/skills
Version
-
Source Path
skills/chen-li-17/paper-parse-figures
Latest Commit SHA
9bcaec611d650ce3a55e0103fdbf97818cc8943a

Extracted Content

SKILL.md excerpt

# Paper Parse

Parse academic PDF papers into structured markdown with figure extraction using PyMuPDF.

## Usage

```bash
uv run {baseDir}/scripts/parse_paper.py --pdf /path/to/paper.pdf [--output-dir ./output]
```

## Output

The tool generates:

- `{paper_name}_content.md` - Full paper content in markdown
- `{paper_name}_parsed.json` - Structured metadata including:
  - Paper title
  - Number of pages
  - Extracted figures with captions and paths
- `cover_title_authors.png` - First-page snapshot focused on title + authors region
- `figures/` - Directory containing high-resolution figure screenshots

## Example

```bash
uv run scripts/parse_paper.py --pdf ~/papers/my-paper.pdf --output-dir ./parsed
```

Output structure:
```
./parsed/
├── my-paper_content.md
├── my-paper_parsed.json
└── figures/
    ├── figure_1.png
    ├── figure_2.png
    └── ...
```

## Dependencies

- PyMuPDF (fitz) - PDF parsing and rendering
- pymupdf4llm - Markdown conversion

These are automatically managed by uv via the inline script metadata.

Related Claw Skills