Overview
- Skill Key
- crazybuffon/pronunciation-coach
- Author
- crazybuffon
- Source Repo
- openclaw/skills
- Version
- -
- Source Path
- skills/crazybuffon/pronunciation-coach
- Latest Commit SHA
- e7a63c207ae6baa0b743d0710f7808f9a858e012
Pronunciation coaching with real voice analysis using Azure Speech Services. Analyzes audio files for phoneme-level accuracy, fluency, prosody, and intonation scores.
Stars
0
Installs
0
Status
ACTIVE
Visibility
PUBLIC
直接复制以下提示词,发送给你的 AI 助手即可完成安装。
请先检查是否已安装 SkillHub 商店,若未安装,请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店,然后安装 pronunciation-coach 技能。 若已安装,则直接安装 pronunciation-coach 技能。
# Pronunciation Coach Analyze spoken English pronunciation using Azure Speech Services and provide actionable coaching feedback. **Privacy Note**: This skill reads local voice messages from `~/.openclaw/media/inbound/` and transmits them to Microsoft Azure Speech Services for processing. ## Prerequisites - **Azure Speech API Key**: Set `AZURE_SPEECH_KEY` env var - **Azure Speech Region**: Set `AZURE_SPEECH_REGION` env var (e.g., `southeastasia`) - **ffmpeg**: Required for audio format conversion (must be on PATH) - **Node.js**: Required for report generation ## Workflow ### 1. Receive Audio Voice messages from Telegram are stored in `~/.openclaw/media/inbound/`. Find the latest `.ogg` file matching the message timestamp. ```bash ls -lt ~/.openclaw/media/inbound/*.ogg | head -5 ``` ### 2. Run Assessment ```bash scripts/pronunciation-assess.sh <audio_file> "<reference_text>" ``` - `audio_file`: Path to the voice message (ogg/wav/mp3/m4a) - `reference_text`: What the speaker intended to say (from transcript) - The script auto-converts any format to WAV 16kHz mono ### 3. Generate Report Pipe the JSON output into the report generator: ```bash scripts/pronunciation-assess.sh audio.ogg "reference text" | node scripts/pronunciation-report.js ``` The report includes: - Overall scores (Pronunciation, Accuracy, Fluency, Prosody, Completeness) - Word-by-word breakdown with per-phoneme scores - Problem sounds highlighted - Verdict with actionable next steps ### 4. Provide Coaching After generating the report: 1. **Send the text report** to the user (scores + word breakdown) 2. **Identify top 3 problem sounds** from the phoneme scores 3. **Explain each problem** — what the correct sound is and how to produce it - See `references/phoneme-guide.md` for phoneme descriptions and fixes 4. **Send a voice message** (via TTS) demonstrating the correct pronunciation of problem words 5. **Assign practice** — give the user specific sentences to re-record focusing on wea...
capt-marbles
Task Router
capncoconut
Register, communicate, and earn on the x402hub AI agent marketplace. Use when an agent needs to register on x402hub, browse or claim bounties, submit deliverables, send messages to other agents via x402 Relay, check marketplace stats, or manage agent credentials. Triggers on x402hub, agent marketplace, bounty, relay messaging, agent-to-agent communication, or USDC earning.
capevace
Real-time event bus for AI agents. Publish, subscribe, and share live signals across a network of agents with Unix-style simplicity.
captchasco
OpenClaw integration guidance for CAPTCHAS Agent API, including OpenResponses tool schemas and plugin tool registration.
carol-gutianle
name: modelready description: Start using a local or Hugging Face model instantly, directly from chat. metadata: {"openclaw":{"requires":{"bins": "bash", "curl" }, "env": "URL" }}
canbirlik
Controls Wiz smart bulbs (turn on/off, RGB colors, disco mode) via local WiFi.