Overview
- Skill Key
- cinience/alicloud-ai-audio-asr
- Author
- cinience
- Source Repo
- openclaw/skills
- Version
- 1.0.0
- Source Path
- skills/cinience/alicloud-ai-audio-asr
- Latest Commit SHA
- bc5cf882e7b2dba8871ff3c6a0789d49f2a5b955
Transcribe non-realtime speech with Alibaba Cloud Model Studio Qwen ASR models (`qwen3-asr-flash`, `qwen-audio-asr`, `qwen3-asr-flash-filetrans`). Use when converting recorded audio files to text, generating transcripts with timestamps, or documenting DashScope/OpenAI-compatible ASR request and response fields.
Stars
0
Installs
0
Status
ACTIVE
Visibility
PUBLIC
直接复制以下提示词,发送给你的 AI 助手即可完成安装。
请先检查是否已安装 SkillHub 商店,若未安装,请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店,然后安装 alicloud-ai-audio-asr 技能。 若已安装,则直接安装 alicloud-ai-audio-asr 技能。
Category: provider # Model Studio Qwen ASR (Non-Realtime) ## Validation ```bash mkdir -p output/alicloud-ai-audio-asr python -m py_compile skills/ai/audio/alicloud-ai-audio-asr/scripts/transcribe_audio.py && echo "py_compile_ok" > output/alicloud-ai-audio-asr/validate.txt ``` Pass criteria: command exits 0 and `output/alicloud-ai-audio-asr/validate.txt` is generated. ## Output And Evidence - Store transcripts and API responses under `output/alicloud-ai-audio-asr/`. - Keep one command log or sample response per run. Use Qwen ASR for recorded audio transcription (non-realtime), including short audio sync calls and long audio async jobs. ## Critical model names Use one of these exact model strings: - `qwen3-asr-flash` - `qwen-audio-asr` - `qwen3-asr-flash-filetrans` Selection guidance: - Use `qwen3-asr-flash` or `qwen-audio-asr` for short/normal recordings (sync). - Use `qwen3-asr-flash-filetrans` for long-file transcription (async task workflow). ## Prerequisites - Install SDK dependencies (script uses Python stdlib only): ```bash python3 -m venv .venv . .venv/bin/activate ``` - Set `DASHSCOPE_API_KEY` in environment, or add `dashscope_api_key` to `~/.alibabacloud/credentials`. ## Normalized interface (asr.transcribe) ### Request - `audio` (string, required): public URL or local file path. - `model` (string, optional): default `qwen3-asr-flash`. - `language_hints` (array<string>, optional): e.g. `zh`, `en`. - `sample_rate` (number, optional) - `vocabulary_id` (string, optional) - `disfluency_removal_enabled` (bool, optional) - `timestamp_granularities` (array<string>, optional): e.g. `sentence`. - `async` (bool, optional): default false for sync models, true for `qwen3-asr-flash-filetrans`. ### Response - `text` (string): normalized transcript text. - `task_id` (string, optional): present for async submission. - `status` (string): `SUCCEEDED` or submission status. - `raw` (object): original API response. ## Quick start (official HTTP API) Sync trans...
human-pages-ai
Search and hire real humans for tasks — photography, delivery, research, and more
zseven-w
Reusable skill templates for OpenClaw AI agents. Templates for API integration, data processing, web scraping, CLI tools, and file processing.
capt-marbles
Attio CRM integration for managing companies, people, deals, notes, tasks, and custom objects. Use when working with Attio CRM data, searching contacts, managing sales pipelines, adding notes to records, creating tasks, or syncing prospect information.
capt-marbles
Web scraping and crawling with Firecrawl API. Fetch webpage content as markdown, take screenshots, extract structured data, search the web, and crawl documentation sites. Use when the user needs to scrape a URL, get current web info, capture a screenshot, extract specific data from pages, or crawl docs for a framework/library.
caqlayan
Tweet Processor Skill
carlosarturoleon
Connect to Windsor.ai MCP for natural language access to 325+ data sources including Facebook Ads, GA4, HubSpot, Shopify, and more.