Overview
- Skill Key
- franklu0819-lang/zhipu-tts
- Author
- franklu0819-lang
- Source Repo
- openclaw/skills
- Version
- -
- Source Path
- skills/franklu0819-lang/zhipu-tts
- Latest Commit SHA
- fd1f6b179bf9217aa9ff187a4a11d6ba1be395e8
Text-to-speech conversion using Zhipu AI (BigModel) GLM-TTS model. Use when you need to convert text to audio files with various voice options. Supports Chinese text synthesis with multiple voice personas, speed control, and output formats.
Stars
0
Installs
0
Status
ACTIVE
Visibility
PUBLIC
直接复制以下提示词,发送给你的 AI 助手即可完成安装。
请先检查是否已安装 SkillHub 商店,若未安装,请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店,然后安装 zhipu-tts 技能。 若已安装,则直接安装 zhipu-tts 技能。
# Zhipu AI Text-to-Speech
Convert Chinese text to natural-sounding speech using Zhipu AI's GLM-TTS model.
## Setup
**1. Get your API Key:**
Get a key from [Zhipu AI Console](https://bigmodel.cn/usercenter/proj-mgmt/apikeys)
**2. Set it in your environment:**
```bash
export ZHIPU_API_KEY="your-key-here"
```
## Available Voices
### System Voices (Pre-built)
- **tongtong** (彤彤) - Default voice, balanced tone
- **chuichui** (锤锤) - Male voice, deeper tone
- **xiaochen** (小陈) - Young professional voice
- **jam** - 动动动物圈 Jam voice
- **kazi** - 动动动物圈 Kazi voice
- **douji** - 动动动物圈 Douji voice
- **luodo** - 动动动物圈 Luodo voice
## Usage
### Basic Text-to-Speech
Convert text to speech with default settings (tongtong voice, normal speed, WAV format):
```bash
bash scripts/text_to_speech.sh "你好,今天天气怎么样"
```
### Advanced Options
Specify voice, speed, format, and output filename:
```bash
bash scripts/text_to_speech.sh "欢迎使用智能语音服务" xiaochen 1.2 wav greeting.wav
```
**Parameters:**
- `text` (required): Chinese text to convert (max 1024 characters)
- `voice` (optional): tongtong (default), chuichui, xiaochen, jam, kazi, douji, luodo
- `speed` (optional): Speech speed from 0.5 to 2.0 (default: 1.0)
- `output_format` (optional): wav (default), pcm
- `output_file` (optional): Output filename (default: output.{format})
## Voice Selection Guide
**Choose tongtong (default) for:**
- General purpose narration
- Professional presentations
- Balanced tone requirements
**Choose chuichui for:**
- Male voice needed
- Deeper, authoritative tone
- Documentary or formal content
**Choose xiaochen for:**
- Young, energetic tone
- Modern, casual content
- Friendly assistant vibe
**Choose jam/kazi/douji/luodo for:**
- Entertainment content
- Character voices
- Creative projects
## Speed Control
**Recommended speeds:**
- **0.8-1.0**: Clear, professional narration
- **1.0-1.2**: Natural conversational pace (default: 1.0)
- **1.2-1.5**: Energ...
# Zhipu AI TTS Skill
Text-to-speech conversion using Zhipu AI (BigModel) GLM-TTS model. Convert Chinese text to natural-sounding speech with multiple voice options.
## Features
- 🎙️ **Multiple Voices**: 7 different voice personas (tongtong, chuichui, xiaochen, jam, kazi, douji, luodo)
- ⚡ **Speed Control**: Adjustable speech speed from 0.5x to 2.0x
- 🎵 **Multiple Formats**: WAV and PCM output formats
- 🇨🇳 **Chinese Language**: Optimized for Mandarin Chinese synthesis
- 📝 **Long Text Support**: Up to 1024 characters per request
- 🔊 **High Quality**: 24000 Hz sampling rate for optimal audio quality
## Requirements
- `jq` - JSON processor
- `ZHIPU_API_KEY` environment variable
## Quick Start
```bash
# Install dependencies (if needed)
sudo apt-get install jq
# Set your API key
export ZHIPU_API_KEY="your-key-here"
# Convert text to speech (default settings)
bash scripts/text_to_speech.sh "你好,今天天气怎么样"
# With custom voice and speed
bash scripts/text_to_speech.sh "欢迎使用智能语音服务" xiaochen 1.2 wav greeting.wav
```
## Available Voices
- **tongtong** (彤彤) - Default balanced tone
- **chuichui** (锤锤) - Male voice, deeper tone
- **xiaochen** (小陈) - Young professional voice
- **jam** - 动动动物圈 Jam voice
- **kazi** - 动动动物圈 Kazi voice
- **douji** - 动动动物圈 Douji voice
- **luodo** - 动动动物圈 Luodo voice
## Use Cases
- 📚 Audiobook creation
- 🎮 Game character voices
- 📢 Announcement systems
- 🤖 Virtual assistants
- 🎬 Video dubbing
- 📻 Radio content generation
## Parameters
- `text` (required): Chinese text to convert (max 1024 characters)
- `voice` (optional): Voice persona (default: tongtong)
- `speed` (optional): Speech speed 0.5-2.0 (default: 1.0)
- `output_format` (optional): wav or pcm (default: wav)
- `output_file` (optional): Output filename (default: output.{format})
## Examples
```bash
# Professional greeting
bash scripts/text_to_speech.sh "您好,感谢致电智能客服" tongtong 1.0 wav greeting.wav
# Energetic announce...
chrislyonshfx
Free Sports Betting Predictions and Picks for NBA and NHL: high win rate daily moneyline winners from Otterline's AI consensus model. Confidence-tiered (Elite -> Strong). No API key needed.
clawdeploy
Free casino gaming platform for OpenClaw agents. Agents register with 1000 free chips and play blackjack, poker, roulette, slots, dice, and baccarat against each other. Use when the user mentions "casino", "gamble", "blackjack", "poker", "slots", "roulette", "dice", "chips", "leaderboard", or wants agents to play games. No real money involved.
cryptorabea
Use when building game systems, implementing Unity/Unreal features, or optimizing game performance. Invoke for Unity, Unreal, game patterns, ECS, physics, networking, performance optimization.
contrario
Nobel-level strategic business intelligence with game theory, Monte Carlo simulations, market analysis, and implementation roadmaps.
cosformula
Export and sync Douban (豆瓣) book/movie/music/game collections to local CSV files (Obsidian-compatible). Use when the user wants to export their Douban reading/watching/listening/gaming history, set up incremental sync via RSS, or manage their Douban data locally.
byeolbit
Fetch and summarize the latest video game news from major gaming outlets (IGN, Kotaku, GameSpot, Polygon, Eurogamer, Rock Paper Shotgun, VG247, Gematsu, PlayStation Blog). Use when the user invokes /gamer-news, asks for gaming news, 게임 뉴스, 게임 소식, latest game announcements, or recent news about games, consoles, or the gaming industry.