paper-parse

Parse academic PDF papers into markdown with figure extraction.

View Source SKILL.md

Stars

Installs

Status

ACTIVE

Visibility

PUBLIC

安装方式

直接复制以下提示词，发送给你的 AI 助手即可完成安装。

请先检查是否已安装 SkillHub 商店，若未安装，请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店，然后安装 paper-parse 技能。若已安装，则直接安装 paper-parse 技能。

Overview

Skill Key: chen-li-17/paper-parse-figures
Author: chen-li-17
Source Repo: openclaw/skills
Version: -
Source Path: skills/chen-li-17/paper-parse-figures
Latest Commit SHA: 9bcaec611d650ce3a55e0103fdbf97818cc8943a

Extracted Content

SKILL.md excerpt

# Paper Parse

Parse academic PDF papers into structured markdown with figure extraction using PyMuPDF.

## Usage

```bash
uv run {baseDir}/scripts/parse_paper.py --pdf /path/to/paper.pdf [--output-dir ./output]
```

## Output

The tool generates:

- `{paper_name}_content.md` - Full paper content in markdown
- `{paper_name}_parsed.json` - Structured metadata including:
  - Paper title
  - Number of pages
  - Extracted figures with captions and paths
- `cover_title_authors.png` - First-page snapshot focused on title + authors region
- `figures/` - Directory containing high-resolution figure screenshots

## Example

```bash
uv run scripts/parse_paper.py --pdf ~/papers/my-paper.pdf --output-dir ./parsed
```

Output structure:
```
./parsed/
├── my-paper_content.md
├── my-paper_parsed.json
└── figures/
    ├── figure_1.png
    ├── figure_2.png
    └── ...
```

## Dependencies

- PyMuPDF (fitz) - PDF parsing and rendering
- pymupdf4llm - Markdown conversion

These are automatically managed by uv via the inline script metadata.

Related Claw Skills

openstockdata

stock-data-skill

★ 4

OpenClaw Skill for stock data analysis

edholofy

dojo.md

★ 4

University for AI agents. 92 courses, 4400+ scenarios, any model via OpenRouter. Auto-training loops generate per-model SKILL.md documents. Works with Claude Code, OpenClaw, Cursor, Windsurf. No fine-tuning required.

lethehades

wps-macos-helper

★ 1

macOS WPS Office workflow helper skill for safer document preparation, conversion, export, and compatibility guidance

capt-marbles

geo-optimization

★ 1

Generative Engine Optimization (GEO) for AI search visibility. Optimize content to appear in ChatGPT, Perplexity, Claude, and Google AI Overviews. Use when optimizing websites, pages, or content for LLM discoverability and citation.

camopel

finviz-crawler

★ 0

Continuous financial news crawler for finviz.com with SQLite storage, article extraction, and query tool. Use when monitoring financial markets, building news digests, or needing a local financial news database. Runs as a background daemon or systemd service.

camopel

ddgs-search

★ 0

Free multi-engine web search via ddgs CLI (DuckDuckGo, Google, Bing, Brave, Yandex, Yahoo, Wikipedia) + arXiv API search. No API keys required. Use when user needs web search, research paper discovery, or when other skills need a search backend. Drop-in replacement for web-search-plus.

Analysis Signals

Dependencies

gh uv