image-understanding

使用智谱AI的GLM-4V-Flash免费多模态API理解图片内容。当用户需要理解图片内容、描述图片、识别图中物体时使用此skill。

View Source SKILL.md

Stars

Installs

Status

ACTIVE

Visibility

PUBLIC

安装方式

直接复制以下提示词，发送给你的 AI 助手即可完成安装。

请先检查是否已安装 SkillHub 商店，若未安装，请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店，然后安装 image-understanding 技能。若已安装，则直接安装 image-understanding 技能。

Overview

Skill Key: andyzwp/image-read
Author: andyzwp
Source Repo: openclaw/skills
Version: -
Source Path: skills/andyzwp/image-read
Latest Commit SHA: a2ee2029946e0d33710676110de26d3d0caff6ed

Extracted Content

SKILL.md excerpt

# Image Understanding Skill

这个skill用于理解图片内容，使用智谱AI的GLM-4V-Flash免费多模态API。

## 何时使用

当用户需要理解图片内容时使用此skill，例如：
- "这张图里是什么"
- "描述一下这个图片"
- "这张细胞图显示了什么"
- "分析这张图片的内容"

## 前置要求

用户需要：
1. 访问 https://bigmodel.cn/ 注册账号
2. 获取API Key：https://bigmodel.cn/console/apikeys
3. 将API Key以环境变量方式提供：`ZHIPU_API_KEY`

## 使用方法

### 方式一：使用内置脚本

skill提供了 `scripts/analyze_image.py` 脚本，可以直接调用：

```bash
python scripts/analyze_image.py <图片路径> "<问题>"
```

参数：
- `<图片路径>`: 图片文件路径（建议使用jpg格式）
- `<问题>`: 要问的问题，如"这张图片里有什么"

### 方式二：手动调用API

如果没有脚本，可以直接用Python调用智谱API：

```python
from zhipuai import ZhipuAI

client = ZhipuAI(api_key="你的API Key")

response = client.chat.completions.create(
    model="glm-4v",
    messages=[
        {
            "role": "user",
            "content": [
                {"type": "text", "text": "这张图片里有什么？请详细描述。"},
                {"type": "image_url", "image_url": {"url": "图片URL或base64"}}
            ]
        }
    ]
)

print(response.choices[0].message.content)
```

## 输出格式

返回图片内容的详细描述，包括：
- 图像中的主要物体/人物
- 场景/背景
- 颜色、布局等视觉特征
- 文字（如果有）
- 可能的含义或推断

## 注意事项

- GLM-4V-Flash完全免费，但有调用频率限制
- 支持图片URL或Base64编码
- 最佳支持图片尺寸：1024x1024以内
- 建议使用JPG格式，PNG格式可能存在兼容性问题

Related Claw Skills

capt-marbles

Task Router Skill

★ 0

Task Router

capncoconut

x402hub

★ 0

Register, communicate, and earn on the x402hub AI agent marketplace. Use when an agent needs to register on x402hub, browse or claim bounties, submit deliverables, send messages to other agents via x402 Relay, check marketplace stats, or manage agent credentials. Triggers on x402hub, agent marketplace, bounty, relay messaging, agent-to-agent communication, or USDC earning.

capevace

claw

★ 0

Real-time event bus for AI agents. Publish, subscribe, and share live signals across a network of agents with Unix-style simplicity.

captchasco

captchas-openclaw

★ 0

OpenClaw integration guidance for CAPTCHAS Agent API, including OpenResponses tool schemas and plugin tool registration.

carol-gutianle

Modelready

★ 0

name: modelready description: Start using a local or Hugging Face model instantly, directly from chat. metadata: {"openclaw":{"requires":{"bins": "bash", "curl" }, "env": "URL" }}

canbirlik

wiz-light-control

★ 0

Controls Wiz smart bulbs (turn on/off, RGB colors, disco mode) via local WiFi.

Analysis Signals

Dependencies

python