TopRank Skills

Home / Claw Skills / API 集成 / volcengine-ai-audio-tts
Official OpenClaw rules 36%

volcengine-ai-audio-tts

Text-to-speech generation on Volcengine (ByteDance) speech services. Use when users need narration, multi-language speech output, voice selection, or TTS troubleshooting. Supports online one-shot HTTP API (openspeech.bytedance.com).

Stars

0

Installs

0

Status

ACTIVE

Visibility

PUBLIC

安装方式

直接复制以下提示词,发送给你的 AI 助手即可完成安装。

请先检查是否已安装 SkillHub 商店,若未安装,请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店,然后安装 volcengine-ai-audio-tts 技能。 若已安装,则直接安装 volcengine-ai-audio-tts 技能。

Overview

Skill Key
day253/day253-volcengine-ai-audio-tts
Author
day253
Source Repo
openclaw/skills
Version
-
Source Path
skills/day253/day253-volcengine-ai-audio-tts
Latest Commit SHA
dc72894d786e7824da85ed88012fd60fcd826332

Extracted Content

SKILL.md excerpt

Category: provider

# Volcengine 语音合成 TTS

## Validation

```bash
mkdir -p output/volcengine-ai-audio-tts
python -m py_compile skills/ai/audio/volcengine-ai-audio-tts/scripts/generate_tts.py && echo "py_compile_ok" > output/volcengine-ai-audio-tts/validate.txt
```

Pass criteria: command exits 0 and `output/volcengine-ai-audio-tts/validate.txt` is generated.

## Output And Evidence

- Save generated audio files, request payloads, and response metadata to `output/volcengine-ai-audio-tts/`.
- Keep one validation log per execution.

## Prerequisites

- Python 3.8+. No extra SDK required (uses `requests` and stdlib).
- Install: `pip install requests`
- Set environment variables (from Volcengine 豆包语音控制台):
  - `VOLCENGINE_TTS_APP_ID` — 应用 ID
  - `VOLCENGINE_TTS_TOKEN` — 应用 Token(用于 Authorization: Bearer;${token})
  - `VOLCENGINE_TTS_CLUSTER` — 业务集群,如 `volcano_tts`(标准音色)

Optional: use `.env` in repo root or script directory; script will load them.

## Normalized interface (tts.generate)

### Request
- `text` (string, required) — 合成文本,UTF-8,单次建议 ≤1024 字节
- `voice_type` (string, required) — 音色,见 [发音人参数列表](https://www.volcengine.com/docs/6561/79824),如 `BV700_streaming`
- `encoding` (string, optional) — 编码格式:`pcm` | `wav` | `mp3` | `ogg_opus`,默认 `mp3`
- `rate` (int, optional) — 采样率 8000/16000/24000,默认 24000
- `speed_ratio` (float, optional) — 语速 [0.2, 3],默认 1.0
- `volume_ratio` (float, optional) — 音量 [0.1, 3],默认 1.0
- `pitch_ratio` (float, optional) — 音高 [0.1, 3],默认 1.0
- `language` (string, optional) — 语言,如 `cn`

### Response
- `audio_path` (string) — 本地保存的音频文件路径
- `sample_rate` (int)
- `format` (string)
- `duration_ms` (string, when returned by API)
- `code` (int) — 3000 表示成功

## Quick start (Python script)

```bash
# 使用内联 JSON 请求
python skills/ai/audio/volcengine-ai-audio-tts/scripts/generate_tts.py \
  --request '{"text":"你好,这是一段测试语音。","voice_type":"BV...

Related Claw Skills