Overview
- Skill Key
- bibekyess/docx-to-html
- Author
- bibekyess
- Source Repo
- openclaw/skills
- Version
- -
- Source Path
- skills/bibekyess/docx-to-html
- Latest Commit SHA
- 46a10c5172911a1fb182b922d361ff752b782afe
Use this skill whenever the user has a DOCX file (.docx) and wants to convert, read, view, extract content from, or process it in any way — including summarization, displaying in a browser, extracting tables or lists, or feeding into AI pipelines. Always use this skill for any task involving .docx files, even if the request seems simple. Triggers include: 'convert docx', 'open word file', 'read word document', 'extract tables from docx', or any mention of a .docx filename.
Stars
0
Installs
0
Status
ACTIVE
Visibility
PUBLIC
直接复制以下提示词,发送给你的 AI 助手即可完成安装。
请先检查是否已安装 SkillHub 商店,若未安装,请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店,然后安装 docx-to-html 技能。 若已安装,则直接安装 docx-to-html 技能。
# DOCX to HTML Converter This skill provides a straightforward method to convert Microsoft Word (.docx) documents into clean, semantic HTML, making them suitable for various web-based and AI-driven applications. ## Compatibility - **Python 3** (for the conversion wrapper) - **Node.js** with `mammoth` installed (core conversion engine) To install Node.js dependencies, run once from the `scripts/` directory: ```bash npm install ``` ## Use Cases - **Browser-Based Viewing**: Convert DOCX documents for display in web browsers without requiring Microsoft Word. - **AI-Ready Content**: Prepare DOCX content for LLMs for tasks like summarization, Q&A, and semantic search. - **Web Integration**: Integrate Word document content into web applications, CMS, or online editors. - **Data Extraction**: Extract structured data (tables, lists, headings) from DOCX files for automated reporting and analysis. - **Search and Indexing**: Enable full-text and vector search by converting DOCX content into easily indexable HTML. ## Workflow 1. **Locate DOCX File**: Identify the path to the `.docx` file to convert. 2. **Run Conversion Script**: Execute the Python wrapper from the skill's `scripts/` directory: ```bash python3 <skill-dir>/scripts/convert.py <input_path.docx> <output_path.html> ``` Replace `<skill-dir>` with the actual path where this skill is installed. 3. **Verify Output**: Open the generated `.html` file in a browser and check: - Headings (`<h1>`, `<h2>`, etc.) appear at the correct hierarchy levels - Tables render with the expected rows and columns - Lists appear as bullet or numbered items (not plain text) - Bold, italic, and inline formatting are preserved - Images are visible (embedded as base64 by default) 4. **Process HTML**: Use the resulting HTML for further tasks like summarization, indexing, or display. ## Bundled Resources - `scripts/docx-converter.js`: Core Node.js conversion logic using `mammoth.js`. - `scripts/convert.py`: P...
edholofy
University for AI agents. 92 courses, 4400+ scenarios, any model via OpenRouter. Auto-training loops generate per-model SKILL.md documents. Works with Claude Code, OpenClaw, Cursor, Windsurf. No fine-tuning required.
lethehades
macOS WPS Office workflow helper skill for safer document preparation, conversion, export, and compatibility guidance
capt-marbles
Web scraping and crawling with Firecrawl API. Fetch webpage content as markdown, take screenshots, extract structured data, search the web, and crawl documentation sites. Use when the user needs to scrape a URL, get current web info, capture a screenshot, extract specific data from pages, or crawl docs for a framework/library.
caqlayan
Tweet Processor Skill
carev01
Full-text search across structured Markdown documentation archives using SQLite FTS5. Use when you need to search large collections of Markdown articles that are separated by "---" delimiters and contain source URLs (marked with "*Source:" pattern). Provides fast BM25-ranked search with automatic source URL extraction for citations. Ideal for research, documentation lookups, and knowledge base exploration. Requires indexing documentation first with `docs.py index`.
caspian9
飞书云盘文件管理技能。用于读取、下载和管理飞书云盘中的文件。 当用户需要:访问飞书文件、下载文档、读取PDF/Word/PPT文件、分析飞书云盘内容时使用。 核心方法:使用 tenant_access_token 调用 Drive API 下载文件,解析内容返回给用户。