TopRank Skills

Home / Claw Skills / API 集成 / alicloud-ai-audio-tts
Official OpenClaw rules 36%

alicloud-ai-audio-tts

Generate human-like speech audio with Model Studio DashScope Qwen TTS models (qwen3-tts-flash, qwen3-tts-instruct-flash). Use when converting text to speech, producing voice lines for short drama/news videos, or documenting TTS request/response fields for DashScope.

Stars

0

Installs

0

Status

ACTIVE

Visibility

PUBLIC

安装方式

直接复制以下提示词,发送给你的 AI 助手即可完成安装。

请先检查是否已安装 SkillHub 商店,若未安装,请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店,然后安装 alicloud-ai-audio-tts 技能。 若已安装,则直接安装 alicloud-ai-audio-tts 技能。

Overview

Skill Key
cinience/alicloud-ai-audio-tts
Author
cinience
Source Repo
openclaw/skills
Version
1.0.0
Source Path
skills/cinience/alicloud-ai-audio-tts
Latest Commit SHA
c31d86e13bdc6753a986649e1c5a7694a7158768

Extracted Content

SKILL.md excerpt

Category: provider

# Model Studio Qwen TTS

## Validation

```bash
mkdir -p output/alicloud-ai-audio-tts
python -m py_compile skills/ai/audio/alicloud-ai-audio-tts/scripts/generate_tts.py && echo "py_compile_ok" > output/alicloud-ai-audio-tts/validate.txt
```

Pass criteria: command exits 0 and `output/alicloud-ai-audio-tts/validate.txt` is generated.

## Output And Evidence

- Save generated audio links, sample audio files, and request payloads to `output/alicloud-ai-audio-tts/`.
- Keep one validation log per execution.

## Critical model names

Use one of the recommended models:
- `qwen3-tts-flash`
- `qwen3-tts-instruct-flash`
- `qwen3-tts-instruct-flash-2026-01-26`

## Prerequisites

- Install SDK (recommended in a venv to avoid PEP 668 limits):

```bash
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope
```
- Set `DASHSCOPE_API_KEY` in your environment, or add `dashscope_api_key` to `~/.alibabacloud/credentials` (env takes precedence).

## Normalized interface (tts.generate)

### Request
- `text` (string, required)
- `voice` (string, required)
- `language_type` (string, optional; default `Auto`)
- `instruction` (string, optional; recommended for instruct models)
- `stream` (bool, optional; default false)

### Response
- `audio_url` (string, when stream=false)
- `audio_base64_pcm` (string, when stream=true)
- `sample_rate` (int, 24000)
- `format` (string, wav or pcm depending on mode)

## Quick start (Python + DashScope SDK)

```python
import os
import dashscope

# Prefer env var for auth: export DASHSCOPE_API_KEY=...
# Or use ~/.alibabacloud/credentials with dashscope_api_key under [default].
# Beijing region; for Singapore use: https://dashscope-intl.aliyuncs.com/api/v1
dashscope.base_http_api_url = "https://dashscope.aliyuncs.com/api/v1"

text = "Hello, this is a short voice line."
response = dashscope.MultiModalConversation.call(
    model="qwen3-tts-instruct-flash",
    api_key=os.getenv("DASHSCOPE_API_KEY"),
    text=text,
    voice...

Related Claw Skills