Home / Claw Skills / Document / docx-to-html

Official OpenClaw rules 36%

docx-to-html

Use this skill whenever the user has a DOCX file (.docx) and wants to convert, read, view, extract content from, or process it in any way — including summarization, displaying in a browser, extracting tables or lists, or feeding into AI pipelines. Always use this skill for any task involving .docx files, even if the request seems simple. Triggers include: 'convert docx', 'open word file', 'read word document', 'extract tables from docx', or any mention of a .docx filename.

View Source SKILL.md

Stars

Installs

Status

ACTIVE

Visibility

PUBLIC

安装方式

直接复制以下提示词，发送给你的 AI 助手即可完成安装。

请先检查是否已安装 SkillHub 商店，若未安装，请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店，然后安装 docx-to-html 技能。若已安装，则直接安装 docx-to-html 技能。

Overview

Skill Key: bibekyess/docx-to-html
Author: bibekyess
Source Repo: openclaw/skills
Version: -
Source Path: skills/bibekyess/docx-to-html
Latest Commit SHA: 46a10c5172911a1fb182b922d361ff752b782afe

Extracted Content

SKILL.md excerpt

# DOCX to HTML Converter

This skill provides a straightforward method to convert Microsoft Word (.docx) documents into clean, semantic HTML, making them suitable for various web-based and AI-driven applications.

## Compatibility

- **Python 3** (for the conversion wrapper)
- **Node.js** with `mammoth` installed (core conversion engine)

To install Node.js dependencies, run once from the `scripts/` directory:
```bash
npm install
```

## Use Cases

- **Browser-Based Viewing**: Convert DOCX documents for display in web browsers without requiring Microsoft Word.
- **AI-Ready Content**: Prepare DOCX content for LLMs for tasks like summarization, Q&A, and semantic search.
- **Web Integration**: Integrate Word document content into web applications, CMS, or online editors.
- **Data Extraction**: Extract structured data (tables, lists, headings) from DOCX files for automated reporting and analysis.
- **Search and Indexing**: Enable full-text and vector search by converting DOCX content into easily indexable HTML.

## Workflow

1. **Locate DOCX File**: Identify the path to the `.docx` file to convert.

2. **Run Conversion Script**: Execute the Python wrapper from the skill's `scripts/` directory:
```bash
python3 <skill-dir>/scripts/convert.py <input_path.docx> <output_path.html>
```
Replace `<skill-dir>` with the actual path where this skill is installed.

3. **Verify Output**: Open the generated `.html` file in a browser and check:
- Headings (`<h1>`, `<h2>`, etc.) appear at the correct hierarchy levels
- Tables render with the expected rows and columns
- Lists appear as bullet or numbered items (not plain text)
- Bold, italic, and inline formatting are preserved
- Images are visible (embedded as base64 by default)

4. **Process HTML**: Use the resulting HTML for further tasks like summarization, indexing, or display.

## Bundled Resources

- `scripts/docx-converter.js`: Core Node.js conversion logic using `mammoth.js`.
- `scripts/convert.py`: P...

Related Claw Skills

edholofy

dojo.md

★ 4

University for AI agents. 92 courses, 4400+ scenarios, any model via OpenRouter. Auto-training loops generate per-model SKILL.md documents. Works with Claude Code, OpenClaw, Cursor, Windsurf. No fine-tuning required.

lethehades

wps-macos-helper

★ 1

macOS WPS Office workflow helper skill for safer document preparation, conversion, export, and compatibility guidance

capt-marbles

firecrawl

★ 0

Web scraping and crawling with Firecrawl API. Fetch webpage content as markdown, take screenshots, extract structured data, search the web, and crawl documentation sites. Use when the user needs to scrape a URL, get current web info, capture a screenshot, extract specific data from pages, or crawl docs for a framework/library.

caqlayan

Tweet Processor

★ 0

Tweet Processor Skill

carev01

md-docs-search

★ 0

Full-text search across structured Markdown documentation archives using SQLite FTS5. Use when you need to search large collections of Markdown articles that are separated by "---" delimiters and contain source URLs (marked with "*Source:" pattern). Provides fast BM25-ranked search with automatic source URL extraction for citations. Ideal for research, documentation lookups, and knowledge base exploration. Requires indexing documentation first with `docs.py index`.

caspian9

feishu-file-manager

★ 0

飞书云盘文件管理技能。用于读取、下载和管理飞书云盘中的文件。当用户需要：访问飞书文件、下载文档、读取PDF/Word/PPT文件、分析飞书云盘内容时使用。核心方法：使用 tenant_access_token 调用 Drive API 下载文件，解析内容返回给用户。

Analysis Signals

Dependencies

gh npm bun python node