zhipu-tts

Overview

Skill Key: franklu0819-lang/zhipu-tts
Author: franklu0819-lang
Source Repo: openclaw/skills
Version: -
Source Path: skills/franklu0819-lang/zhipu-tts
Latest Commit SHA: fd1f6b179bf9217aa9ff187a4a11d6ba1be395e8

Extracted Content

SKILL.md excerpt

# Zhipu AI Text-to-Speech

Convert Chinese text to natural-sounding speech using Zhipu AI's GLM-TTS model.

## Setup

**1. Get your API Key:**
Get a key from [Zhipu AI Console](https://bigmodel.cn/usercenter/proj-mgmt/apikeys)

**2. Set it in your environment:**
```bash
export ZHIPU_API_KEY="your-key-here"
```

## Available Voices

### System Voices (Pre-built)

- **tongtong** (彤彤) - Default voice, balanced tone
- **chuichui** (锤锤) - Male voice, deeper tone
- **xiaochen** (小陈) - Young professional voice
- **jam** - 动动动物圈 Jam voice
- **kazi** - 动动动物圈 Kazi voice
- **douji** - 动动动物圈 Douji voice
- **luodo** - 动动动物圈 Luodo voice

## Usage

### Basic Text-to-Speech

Convert text to speech with default settings (tongtong voice, normal speed, WAV format):

```bash
bash scripts/text_to_speech.sh "你好，今天天气怎么样"
```

### Advanced Options

Specify voice, speed, format, and output filename:

```bash
bash scripts/text_to_speech.sh "欢迎使用智能语音服务" xiaochen 1.2 wav greeting.wav
```

**Parameters:**
- `text` (required): Chinese text to convert (max 1024 characters)
- `voice` (optional): tongtong (default), chuichui, xiaochen, jam, kazi, douji, luodo
- `speed` (optional): Speech speed from 0.5 to 2.0 (default: 1.0)
- `output_format` (optional): wav (default), pcm
- `output_file` (optional): Output filename (default: output.{format})

## Voice Selection Guide

**Choose tongtong (default) for:**
- General purpose narration
- Professional presentations
- Balanced tone requirements

**Choose chuichui for:**
- Male voice needed
- Deeper, authoritative tone
- Documentary or formal content

**Choose xiaochen for:**
- Young, energetic tone
- Modern, casual content
- Friendly assistant vibe

**Choose jam/kazi/douji/luodo for:**
- Entertainment content
- Character voices
- Creative projects

## Speed Control

**Recommended speeds:**
- **0.8-1.0**: Clear, professional narration
- **1.0-1.2**: Natural conversational pace (default: 1.0)
- **1.2-1.5**: Energ...

README excerpt

# Zhipu AI TTS Skill

Text-to-speech conversion using Zhipu AI (BigModel) GLM-TTS model. Convert Chinese text to natural-sounding speech with multiple voice options.

## Features

- 🎙️ **Multiple Voices**: 7 different voice personas (tongtong, chuichui, xiaochen, jam, kazi, douji, luodo)
- ⚡ **Speed Control**: Adjustable speech speed from 0.5x to 2.0x
- 🎵 **Multiple Formats**: WAV and PCM output formats
- 🇨🇳 **Chinese Language**: Optimized for Mandarin Chinese synthesis
- 📝 **Long Text Support**: Up to 1024 characters per request
- 🔊 **High Quality**: 24000 Hz sampling rate for optimal audio quality

## Requirements

- `jq` - JSON processor
- `ZHIPU_API_KEY` environment variable

## Quick Start

```bash
# Install dependencies (if needed)
sudo apt-get install jq

# Set your API key
export ZHIPU_API_KEY="your-key-here"

# Convert text to speech (default settings)
bash scripts/text_to_speech.sh "你好，今天天气怎么样"

# With custom voice and speed
bash scripts/text_to_speech.sh "欢迎使用智能语音服务" xiaochen 1.2 wav greeting.wav
```

## Available Voices

- **tongtong** (彤彤) - Default balanced tone
- **chuichui** (锤锤) - Male voice, deeper tone
- **xiaochen** (小陈) - Young professional voice
- **jam** - 动动动物圈 Jam voice
- **kazi** - 动动动物圈 Kazi voice
- **douji** - 动动动物圈 Douji voice
- **luodo** - 动动动物圈 Luodo voice

## Use Cases

- 📚 Audiobook creation
- 🎮 Game character voices
- 📢 Announcement systems
- 🤖 Virtual assistants
- 🎬 Video dubbing
- 📻 Radio content generation

## Parameters

- `text` (required): Chinese text to convert (max 1024 characters)
- `voice` (optional): Voice persona (default: tongtong)
- `speed` (optional): Speech speed 0.5-2.0 (default: 1.0)
- `output_format` (optional): wav or pcm (default: wav)
- `output_file` (optional): Output filename (default: output.{format})

## Examples

```bash
# Professional greeting
bash scripts/text_to_speech.sh "您好，感谢致电智能客服" tongtong 1.0 wav greeting.wav

# Energetic announce...

TopRank Skills

安装方式

Overview

Extracted Content

SKILL.md excerpt

README excerpt

Related Claw Skills

otterline

casino

game-developer

aetherlang-strategy

douban-sync

gamer-news