TopRank Skills

Home / Claw Skills / API 集成 / alicloud-ai-audio-asr
Official OpenClaw rules 36%

alicloud-ai-audio-asr

Transcribe non-realtime speech with Alibaba Cloud Model Studio Qwen ASR models (`qwen3-asr-flash`, `qwen-audio-asr`, `qwen3-asr-flash-filetrans`). Use when converting recorded audio files to text, generating transcripts with timestamps, or documenting DashScope/OpenAI-compatible ASR request and response fields.

Stars

0

Installs

0

Status

ACTIVE

Visibility

PUBLIC

安装方式

直接复制以下提示词,发送给你的 AI 助手即可完成安装。

请先检查是否已安装 SkillHub 商店,若未安装,请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店,然后安装 alicloud-ai-audio-asr 技能。 若已安装,则直接安装 alicloud-ai-audio-asr 技能。

Overview

Skill Key
cinience/alicloud-ai-audio-asr
Author
cinience
Source Repo
openclaw/skills
Version
1.0.0
Source Path
skills/cinience/alicloud-ai-audio-asr
Latest Commit SHA
bc5cf882e7b2dba8871ff3c6a0789d49f2a5b955

Extracted Content

SKILL.md excerpt

Category: provider

# Model Studio Qwen ASR (Non-Realtime)

## Validation

```bash
mkdir -p output/alicloud-ai-audio-asr
python -m py_compile skills/ai/audio/alicloud-ai-audio-asr/scripts/transcribe_audio.py && echo "py_compile_ok" > output/alicloud-ai-audio-asr/validate.txt
```

Pass criteria: command exits 0 and `output/alicloud-ai-audio-asr/validate.txt` is generated.

## Output And Evidence

- Store transcripts and API responses under `output/alicloud-ai-audio-asr/`.
- Keep one command log or sample response per run.

Use Qwen ASR for recorded audio transcription (non-realtime), including short audio sync calls and long audio async jobs.

## Critical model names

Use one of these exact model strings:
- `qwen3-asr-flash`
- `qwen-audio-asr`
- `qwen3-asr-flash-filetrans`

Selection guidance:
- Use `qwen3-asr-flash` or `qwen-audio-asr` for short/normal recordings (sync).
- Use `qwen3-asr-flash-filetrans` for long-file transcription (async task workflow).

## Prerequisites

- Install SDK dependencies (script uses Python stdlib only):

```bash
python3 -m venv .venv
. .venv/bin/activate
```

- Set `DASHSCOPE_API_KEY` in environment, or add `dashscope_api_key` to `~/.alibabacloud/credentials`.

## Normalized interface (asr.transcribe)

### Request
- `audio` (string, required): public URL or local file path.
- `model` (string, optional): default `qwen3-asr-flash`.
- `language_hints` (array<string>, optional): e.g. `zh`, `en`.
- `sample_rate` (number, optional)
- `vocabulary_id` (string, optional)
- `disfluency_removal_enabled` (bool, optional)
- `timestamp_granularities` (array<string>, optional): e.g. `sentence`.
- `async` (bool, optional): default false for sync models, true for `qwen3-asr-flash-filetrans`.

### Response
- `text` (string): normalized transcript text.
- `task_id` (string, optional): present for async submission.
- `status` (string): `SUCCEEDED` or submission status.
- `raw` (object): original API response.

## Quick start (official HTTP API)

Sync trans...

Related Claw Skills