Overview
- Skill Key
- day253/day253-volcengine-ai-audio-tts
- Author
- day253
- Source Repo
- openclaw/skills
- Version
- -
- Source Path
- skills/day253/day253-volcengine-ai-audio-tts
- Latest Commit SHA
- dc72894d786e7824da85ed88012fd60fcd826332
Text-to-speech generation on Volcengine (ByteDance) speech services. Use when users need narration, multi-language speech output, voice selection, or TTS troubleshooting. Supports online one-shot HTTP API (openspeech.bytedance.com).
Stars
0
Installs
0
Status
ACTIVE
Visibility
PUBLIC
直接复制以下提示词,发送给你的 AI 助手即可完成安装。
请先检查是否已安装 SkillHub 商店,若未安装,请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店,然后安装 volcengine-ai-audio-tts 技能。 若已安装,则直接安装 volcengine-ai-audio-tts 技能。
Category: provider
# Volcengine 语音合成 TTS
## Validation
```bash
mkdir -p output/volcengine-ai-audio-tts
python -m py_compile skills/ai/audio/volcengine-ai-audio-tts/scripts/generate_tts.py && echo "py_compile_ok" > output/volcengine-ai-audio-tts/validate.txt
```
Pass criteria: command exits 0 and `output/volcengine-ai-audio-tts/validate.txt` is generated.
## Output And Evidence
- Save generated audio files, request payloads, and response metadata to `output/volcengine-ai-audio-tts/`.
- Keep one validation log per execution.
## Prerequisites
- Python 3.8+. No extra SDK required (uses `requests` and stdlib).
- Install: `pip install requests`
- Set environment variables (from Volcengine 豆包语音控制台):
- `VOLCENGINE_TTS_APP_ID` — 应用 ID
- `VOLCENGINE_TTS_TOKEN` — 应用 Token(用于 Authorization: Bearer;${token})
- `VOLCENGINE_TTS_CLUSTER` — 业务集群,如 `volcano_tts`(标准音色)
Optional: use `.env` in repo root or script directory; script will load them.
## Normalized interface (tts.generate)
### Request
- `text` (string, required) — 合成文本,UTF-8,单次建议 ≤1024 字节
- `voice_type` (string, required) — 音色,见 [发音人参数列表](https://www.volcengine.com/docs/6561/79824),如 `BV700_streaming`
- `encoding` (string, optional) — 编码格式:`pcm` | `wav` | `mp3` | `ogg_opus`,默认 `mp3`
- `rate` (int, optional) — 采样率 8000/16000/24000,默认 24000
- `speed_ratio` (float, optional) — 语速 [0.2, 3],默认 1.0
- `volume_ratio` (float, optional) — 音量 [0.1, 3],默认 1.0
- `pitch_ratio` (float, optional) — 音高 [0.1, 3],默认 1.0
- `language` (string, optional) — 语言,如 `cn`
### Response
- `audio_path` (string) — 本地保存的音频文件路径
- `sample_rate` (int)
- `format` (string)
- `duration_ms` (string, when returned by API)
- `code` (int) — 3000 表示成功
## Quick start (Python script)
```bash
# 使用内联 JSON 请求
python skills/ai/audio/volcengine-ai-audio-tts/scripts/generate_tts.py \
--request '{"text":"你好,这是一段测试语音。","voice_type":"BV...
human-pages-ai
Search and hire real humans for tasks — photography, delivery, research, and more
zseven-w
Reusable skill templates for OpenClaw AI agents. Templates for API integration, data processing, web scraping, CLI tools, and file processing.
capt-marbles
Attio CRM integration for managing companies, people, deals, notes, tasks, and custom objects. Use when working with Attio CRM data, searching contacts, managing sales pipelines, adding notes to records, creating tasks, or syncing prospect information.
capt-marbles
Web scraping and crawling with Firecrawl API. Fetch webpage content as markdown, take screenshots, extract structured data, search the web, and crawl documentation sites. Use when the user needs to scrape a URL, get current web info, capture a screenshot, extract specific data from pages, or crawl docs for a framework/library.
caqlayan
Tweet Processor Skill
carlosarturoleon
Connect to Windsor.ai MCP for natural language access to 325+ data sources including Facebook Ads, GA4, HubSpot, Shopify, and more.