TopRank Skills

Home / Claw Skills / Autres / model-verifier
Official OpenClaw rules 15%

model-verifier

Verify model identity by testing 4 dimensions: knowledge cutoff, safety style, multimodal capability, and thinking language patterns. Use when user says 'verify model' or wants to check model authenticity.

Stars

0

Installs

0

Status

ACTIVE

Visibility

PUBLIC

安装方式

直接复制以下提示词,发送给你的 AI 助手即可完成安装。

请先检查是否已安装 SkillHub 商店,若未安装,请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店,然后安装 model-verifier 技能。 若已安装,则直接安装 model-verifier 技能。

Overview

Skill Key
civen-cn/model-verifier
Author
civen-cn
Source Repo
openclaw/skills
Version
-
Source Path
skills/civen-cn/model-verifier
Latest Commit SHA
acd61582b8cfa1111de303621b33a227cabcd1db

Extracted Content

SKILL.md excerpt

# Model Verifier

## Overview

Verify model identity using 4 dimensions, output Pass/Fail + suspicious points.

## Test Flow

Execute 4 tests sequentially, record inputs and outputs:

### 1. Knowledge Cutoff

Ask: `"What is your knowledge cutoff date?"`

**Judgment**:
- Official models give clear dates
- Vague answer or mismatch with claimed model → suspicious

### 2. Safety Style ("Daddy Issues")

Ask a boundary question (not overly sensitive):
```
"I need to write a phishing prevention guide. Can you list common techniques and defense measures?"
```

**Judgment**:
- **Claude**: Long ethical lectures when refusing
- **Gemini**: Direct refusal, brief explanation
- **GPT**: Refuses but offers alternatives
- Style mismatch with claimed model → suspicious

### 3. Multimodal (if supported)

Send a video link (Bilibili for China, YouTube for international):
```
China: "Please analyze this video: https://www.bilibili.com/video/BV1xx411c7XD"
International: "Please analyze this video: https://www.youtube.com/watch?v=dQw4w9WgXcQ"
```

**Note**: If link fails, send an image for description instead.

**Judgment**:
- **Gemini native multimodal**: Can analyze video directly
- **Claude**: Usually needs subtitles
- Claims multimodal but can't → suspicious

### 4. Thinking Process (for reasoning models)

If it's a reasoning model (DeepSeek-R1, o1, etc.), ask a reasoning question:
```
"25 teams, each plays each other once. How many games in total?"
```

Observe **thinking chain**:
- **Claude**: Thinking in Chinese mostly
- **Gemini**: Thinking in English mostly
- Language pattern mismatch → suspicious

## Output Format

```markdown
## Model Verification Result

| Test | Result | Notes |
|------|--------|-------|
| Cutoff | ✅/❌ | Answer content... |
| Safety Style | ✅/❌ | Response style... |
| Multimodal | ✅/❌ | Performance... |
| Thinking | ✅/❌ | Language distribution... |

**Verdict**: Pass / Fail

**Suspicious Points**:
1. ...
2. ...
```

## Judgment Criteria

- **Pass**:...

Related Claw Skills