TopRank Skills

Home / Claw Skills / 其他 / qwen3-tts-mlx
Official OpenClaw rules 15%

qwen3-tts-mlx

Local Qwen3-TTS speech synthesis on Apple Silicon via MLX. Use for offline narration, audiobooks, video voiceovers, and multilingual TTS.

Stars

0

Installs

0

Status

ACTIVE

Visibility

PUBLIC

安装方式

直接复制以下提示词,发送给你的 AI 助手即可完成安装。

请先检查是否已安装 SkillHub 商店,若未安装,请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店,然后安装 qwen3-tts-mlx 技能。 若已安装,则直接安装 qwen3-tts-mlx 技能。

Overview

Skill Key
h1bomb/qwen3-tts-mlx
Author
h1bomb
Source Repo
openclaw/skills
Version
-
Source Path
skills/h1bomb/qwen3-tts-mlx
Latest Commit SHA
526718511b2cf95c1d924da250e989c86af0a164

Extracted Content

SKILL.md excerpt

# Qwen3-TTS MLX

Run Qwen3-TTS locally on Apple Silicon (M1/M2/M3/M4) using MLX. Supports 11 languages, 9 built-in voices, voice cloning, and voice design from text descriptions.

## When to Use

- Generate speech fully offline on a Mac
- Produce narration, audiobooks, podcasts, or video voiceovers
- Create multilingual TTS with controllable style and emotion
- Clone any voice from a short audio sample
- Design custom voices from text descriptions

## Quick Start

### Install

```bash
pip install mlx-audio
brew install ffmpeg
```

### Basic Usage

```bash
python scripts/run_tts.py custom-voice \
  --text "Hello, welcome to local text to speech." \
  --voice Ryan \
  --output output.wav
```

### With Style Control

```bash
python scripts/run_tts.py custom-voice \
  --text "Breaking news: local AI model achieves human-level speech." \
  --voice Uncle_Fu \
  --instruct "news anchor tone, calm and authoritative" \
  --output news.wav
```

## Model Variants

| Variant | Model | Size | Memory | Use Case |
|---------|-------|------|--------|----------|
| CustomVoice | `mlx-community/Qwen3-TTS-12Hz-0.6B-CustomVoice-4bit` | ~1GB | ~4GB | Built-in voices + style control (recommended) |
| VoiceDesign | `mlx-community/Qwen3-TTS-12Hz-1.7B-VoiceDesign-5bit` | ~2GB | ~5GB | Create voices from text descriptions |
| Base | `mlx-community/Qwen3-TTS-12Hz-0.6B-Base-4bit` | ~1GB | ~4GB | Voice cloning from reference audio |

## Supported Languages

| Language | Code | Notes |
|----------|------|-------|
| Auto-detect | `auto` | Default, detects from text |
| Chinese | `Chinese` | Mandarin |
| English | `English` | |
| Japanese | `Japanese` | |
| Korean | `Korean` | |
| French | `French` | |
| German | `German` | |
| Spanish | `Spanish` | |
| Portuguese | `Portuguese` | |
| Italian | `Italian` | |
| Russian | `Russian` | |

## Built-in Voices

| Voice | Language | Character |
|-------|----------|-----------|
| Vivian | Chinese | Female, bright, young |
| Serena | Chinese | Female, gentle...

Related Claw Skills