Overview
- Skill Key
- davideuler/human-avatar
- Author
- davideuler
- Source Repo
- openclaw/skills
- Version
- -
- Source Path
- skills/davideuler/human-avatar
- Latest Commit SHA
- 417d2128ead97a4ab2231097c89a1d033420d4f2
使用阿里云 DashScope/灵眸 API 生成人脸口播视频(talking head video)。支持三种模式:EMO(人像+音频驱动口播,两步流程)、AA/Animate Anyone(全身动画)、灵眸(基于模板的数字人口播视频)。当用户需要制作口播视频、数字人视频、EMO/AA 人脸动画、VideoRetalk 视频换人时触发此技能。
Stars
0
Installs
0
Status
ACTIVE
Visibility
PUBLIC
直接复制以下提示词,发送给你的 AI 助手即可完成安装。
请先检查是否已安装 SkillHub 商店,若未安装,请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店,然后安装 human-avatar 技能。 若已安装,则直接安装 human-avatar 技能。
# Human Avatar — 阿里云口播视频生成
## 三种模式
| 模式 | 接口 | 认证 | Region | 说明 |
|------|------|------|--------|------|
| **EMO** | DashScope | `DASHSCOPE_API_KEY` | **cn-beijing** | 人像+音频→口播,需先 detect |
| **AA** (Animate Anyone) | DashScope | `DASHSCOPE_API_KEY` | **cn-beijing** | 人像+动作视频→全身动画 |
| **灵眸** (LingMou) | 独立产品 SDK | AK/SK | **cn-beijing** | 基于模板的数字人口播 |
| **VideoRetalk** (视频换人) | DashScope | `DASHSCOPE_API_KEY` | **cn-beijing** | 视频角色替换 |
> ⚠️ **Region 固定为 cn-beijing**,API Key 需在北京地域开通,不可与新加坡 Key 混用。
## 前置条件
```bash
pip install requests dashscope oss2
# 灵眸额外需要:
pip install alibabacloud-lingmou20250527 alibabacloud-tea-openapi
```
环境变量:
```bash
export DASHSCOPE_API_KEY=sk-xxxx # DashScope API Key(百炼控制台获取)
export ALIBABA_CLOUD_ACCESS_KEY_ID=xxx # 灵眸用
export ALIBABA_CLOUD_ACCESS_KEY_SECRET=xxx
export OSS_BUCKET=xxx # 本地文件上传用
export OSS_ENDPOINT=oss-cn-beijing.aliyuncs.com
```
## EMO 工作流(两步)
```
Step 1: emo-detect-v1 检测人脸 → 获取 face_bbox, ext_bbox
↓
Step 2: emo-v1 提交生成 → task_id
↓
轮询 GET /api/v1/tasks/{task_id} → SUCCEEDED → video_url
```
```bash
python scripts/portrait_animate.py \
--image-url "https://example.com/portrait.jpg" \
--audio-url "https://example.com/speech.mp3" \
--download
```
## 灵眸工作流(基于模板)
```
1. 查询模板列表 → templateId(已存 digital_human_template.json)
2. CreateBroadcastVideoFromTemplate (variables 替换 text_content)
3. 轮询 ListBroadcastVideosById → SUCCESS → videoURL
```
```bash
python scripts/avatar_video.py \
--template-id "BS1b2WNnRMu4ouRzT4clY9Jhg" \
--text "大家好,欢迎收看今天的科技新闻。" \
--download
```
## 一键 Demo Pipeline
```bash
# EMO
python scripts/demo_pipeline.py --mode emo --image ./face.jpg --audio ./speech.mp3 --download
# AA
python scripts/demo_pipeline.py --mode aa --model <AA_MODEL_NAME> --image-url https://... --vide...
human-pages-ai
Search and hire real humans for tasks — photography, delivery, research, and more
zseven-w
Reusable skill templates for OpenClaw AI agents. Templates for API integration, data processing, web scraping, CLI tools, and file processing.
capt-marbles
Attio CRM integration for managing companies, people, deals, notes, tasks, and custom objects. Use when working with Attio CRM data, searching contacts, managing sales pipelines, adding notes to records, creating tasks, or syncing prospect information.
capt-marbles
Web scraping and crawling with Firecrawl API. Fetch webpage content as markdown, take screenshots, extract structured data, search the web, and crawl documentation sites. Use when the user needs to scrape a URL, get current web info, capture a screenshot, extract specific data from pages, or crawl docs for a framework/library.
caqlayan
Tweet Processor Skill
carlosarturoleon
Connect to Windsor.ai MCP for natural language access to 325+ data sources including Facebook Ads, GA4, HubSpot, Shopify, and more.