Overview
- Skill Key
- aidescend/multimedia-to-obsidian
- Author
- aidescend
- Source Repo
- openclaw/skills
- Version
- -
- Source Path
- skills/aidescend/multimedia-to-obsidian
- Latest Commit SHA
- bf047c972462175596715cee0f2888f24102ba77
将任意多媒体文档导入 Obsidian 知识库。支持 PPT、PDF、DOCX、图片等格式,自动提取每一页/每一张图片,使用多模态模型理解内容,生成文字描述后存入 OB。适用于:(1) 整理培训课件 (2) 迁移笔记到 OB (3) 将图片资料转为结构化知识。
Stars
0
Installs
0
Status
ACTIVE
Visibility
PUBLIC
直接复制以下提示词,发送给你的 AI 助手即可完成安装。
请先检查是否已安装 SkillHub 商店,若未安装,请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店,然后安装 multimedia-to-obsidian 技能。 若已安装,则直接安装 multimedia-to-obsidian 技能。
# 多媒体导入 Obsidian
将 PPT、PDF、DOCX、图片等多媒体文档导入 Obsidian,每页/每张图片自动生成文字描述。
## 功能特点
- 支持多种格式:PPT、PDF、DOCX、图片(PNG、JPG 等)
- 自动提取媒体文件
- 调用多模态模型理解内容
- 自动分类到 OB 仓库
## 使用方法
### 前提条件
1. 安装依赖:
```bash
brew install pandoc poppler
pip install python-pptx pillow
```
2. 设置环境变量(至少一个):
```bash
# MiniMax
export MINIMAX_API_KEY="your-key"
export MINIMAX_API_HOST="https://api.minimaxi.com"
# OpenAI
export OPENAI_API_KEY="your-key"
# Anthropic
export ANTHROPIC_API_KEY="your-key"
```
### 调用脚本
```bash
python3 <skill>/scripts/media_to_obsidian.py <源文件/目录> <输出OB目录> [options]
```
**参数:**
- 源文件/目录:单个文件或包含多媒体文件的目录
- 输出目录:Obsidian 仓库路径
- 选项:
- `--format ppt|pdf|docx|image|all` : 指定格式,默认 all
- `--model minimax|openai|anthropic` : 指定模型,默认 minimax
- `--category 分类名` : 指定输出分类目录
### 示例
```bash
# 导入 PPT 课件
python3 ~/.openclaw/workspace/skills/multimedia-to-obsidian/scripts/media_to_obsidian.py \
/path/to/培训课件.pptx \
/path/to/Obsidian \
--format ppt --category 培训
# 导入图片到 OB
python3 ~/.openclaw/workspace/skills/multimedia-to-obsidian/scripts/media_to_obsidian.py \
/path/to/images/ \
/path/to/Obsidian \
--format image --category 素材
# 批量导入目录下的所有文档
python3 ~/.openclaw/workspace/skills/multimedia-to-obsidian/scripts/media_to_obsidian.py \
/path/to/documents/ \
/path/to/Obsidian
```
## 输出结构
```
Obsidian仓库/
├── 培训/
│ └── 培训课件.md (含每页图片理解)
├── 素材/
│ └── 图片1.md
│ └── 图片2.md
└── 导入/
└── 文档名.md
```
## 工作流程
1. **提取媒体**:从 PPT/PDF/DOCX 提取每一页为图片
2. **理解内容**:调用多模态模型理解每张图片
3. **生成描述**:将图片描述写入 Markdown
4. **分类存储**:按指定分类存入 OB
edholofy
University for AI agents. 92 courses, 4400+ scenarios, any model via OpenRouter. Auto-training loops generate per-model SKILL.md documents. Works with Claude Code, OpenClaw, Cursor, Windsurf. No fine-tuning required.
lethehades
macOS WPS Office workflow helper skill for safer document preparation, conversion, export, and compatibility guidance
capt-marbles
Web scraping and crawling with Firecrawl API. Fetch webpage content as markdown, take screenshots, extract structured data, search the web, and crawl documentation sites. Use when the user needs to scrape a URL, get current web info, capture a screenshot, extract specific data from pages, or crawl docs for a framework/library.
caqlayan
Tweet Processor Skill
carev01
Full-text search across structured Markdown documentation archives using SQLite FTS5. Use when you need to search large collections of Markdown articles that are separated by "---" delimiters and contain source URLs (marked with "*Source:" pattern). Provides fast BM25-ranked search with automatic source URL extraction for citations. Ideal for research, documentation lookups, and knowledge base exploration. Requires indexing documentation first with `docs.py index`.
camelsprout
DuckDB CLI specialist for SQL analysis, data processing and file conversion. Use for SQL queries, CSV/Parquet/JSON analysis, database queries, or data conversion. Triggers on "duckdb", "sql", "query", "data analysis", "parquet", "convert data".