TopRank Skills

Home / Claw Skills / Others / pronunciation-coach
Official OpenClaw rules 15%

pronunciation-coach

Pronunciation coaching with real voice analysis using Azure Speech Services. Analyzes audio files for phoneme-level accuracy, fluency, prosody, and intonation scores.

Stars

0

Installs

0

Status

ACTIVE

Visibility

PUBLIC

安装方式

直接复制以下提示词,发送给你的 AI 助手即可完成安装。

请先检查是否已安装 SkillHub 商店,若未安装,请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店,然后安装 pronunciation-coach 技能。 若已安装,则直接安装 pronunciation-coach 技能。

Overview

Skill Key
crazybuffon/pronunciation-coach
Author
crazybuffon
Source Repo
openclaw/skills
Version
-
Source Path
skills/crazybuffon/pronunciation-coach
Latest Commit SHA
e7a63c207ae6baa0b743d0710f7808f9a858e012

Extracted Content

SKILL.md excerpt

# Pronunciation Coach

Analyze spoken English pronunciation using Azure Speech Services and provide actionable coaching feedback.

**Privacy Note**: This skill reads local voice messages from `~/.openclaw/media/inbound/` and transmits them to Microsoft Azure Speech Services for processing.

## Prerequisites

- **Azure Speech API Key**: Set `AZURE_SPEECH_KEY` env var
- **Azure Speech Region**: Set `AZURE_SPEECH_REGION` env var (e.g., `southeastasia`)
- **ffmpeg**: Required for audio format conversion (must be on PATH)
- **Node.js**: Required for report generation

## Workflow

### 1. Receive Audio

Voice messages from Telegram are stored in `~/.openclaw/media/inbound/`. Find the latest `.ogg` file matching the message timestamp.

```bash
ls -lt ~/.openclaw/media/inbound/*.ogg | head -5
```

### 2. Run Assessment

```bash
scripts/pronunciation-assess.sh <audio_file> "<reference_text>"
```

- `audio_file`: Path to the voice message (ogg/wav/mp3/m4a)
- `reference_text`: What the speaker intended to say (from transcript)
- The script auto-converts any format to WAV 16kHz mono

### 3. Generate Report

Pipe the JSON output into the report generator:

```bash
scripts/pronunciation-assess.sh audio.ogg "reference text" | node scripts/pronunciation-report.js
```

The report includes:
- Overall scores (Pronunciation, Accuracy, Fluency, Prosody, Completeness)
- Word-by-word breakdown with per-phoneme scores
- Problem sounds highlighted
- Verdict with actionable next steps

### 4. Provide Coaching

After generating the report:

1. **Send the text report** to the user (scores + word breakdown)
2. **Identify top 3 problem sounds** from the phoneme scores
3. **Explain each problem** — what the correct sound is and how to produce it
   - See `references/phoneme-guide.md` for phoneme descriptions and fixes
4. **Send a voice message** (via TTS) demonstrating the correct pronunciation of problem words
5. **Assign practice** — give the user specific sentences to re-record focusing on wea...

Related Claw Skills