TopRank Skills

Official OpenClaw rules 54%

rxnim

Parse chemical reaction images into machine-readable data (reactants, products, conditions) using the RxnIM multimodal LLM. Supports web API (Hugging Face Spaces) and local inference.

Stars

0

Installs

0

Status

ACTIVE

Visibility

PUBLIC

安装方式

直接复制以下提示词,发送给你的 AI 助手即可完成安装。

请先检查是否已安装 SkillHub 商店,若未安装,请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店,然后安装 rxnim 技能。 若已安装,则直接安装 rxnim 技能。

Overview

Skill Key
fqiangliu/rxn-im
Author
fqiangliu
Source Repo
openclaw/skills
Version
-
Source Path
skills/fqiangliu/rxn-im
Latest Commit SHA
d574d5e154b2c47c3cf204537fe1d62b6c36e745

Extracted Content

SKILL.md excerpt

# RxnIM Skill

Extract structured reaction data (SMILES, conditions) from chemical reaction images using the RxnIM model. RxnIM is a multimodal large language model designed for chemical reaction image parsing, achieving 84%-92% soft match F1 score on various benchmarks. It performs three tasks: reaction extraction, condition OCR, and role identification.

## Features

- **Image input**: PNG, JPG, etc.
- **Output**: JSON with reactants, products, conditions (reagents, solvents, temperature, yield).
- **Two modes**:
  1. **Web API**: Calls the Hugging Face Spaces demo (no local model needed).
  2. **Local inference**: Runs the full model locally (requires GPU and ~14 GB disk space).
- **Tasks supported**:
  - Reaction extraction (SMILES of reactants and products)
  - Condition OCR (text extraction and role labeling)
  - Role identification (reagent, solvent, temperature, yield)

## Background

RxnIM (Reaction Image Multimodal large language model) is the first multimodal LLM specifically designed to parse chemical reaction images into machine‑readable reaction data. It aligns task instructions with image features and uses an LLM‑based decoder to predict reaction components and conditions. The model is trained on a large‑scale synthetic dataset (Pistachio) and real ACS publications.

**Key capabilities**:
- Extracts SMILES of reactants and products with high accuracy.
- Interprets textual conditions (reagents, solvents, temperature, yield) and assigns roles.
- Outputs structured JSON or formatted reaction strings.
Performance: 84%–92% soft match F1 score on multiple test sets, outperforming previous methods.



##Quick Start
**Web API Mode (default)**
node scripts/rxnim.js --image /path/to/reaction.png

**Local Mode**
First, download the model checkpoint (see RxnIM repository(https://github.com/CYF2000127/RxnIM)) and set the environment variable RXNIM_MODEL_PATH.
export RXNIM_MODEL_PATH=/path/to/RxnIM-7b
node scripts/rxnim.js --image...

Related Claw Skills