🧪 Skills

digital-human-training

数字人训练与部署 Skill - 提供从语音克隆、唇形同步到实时交互数字人的全流程训练建议与技术支持。

v1.0.0

⭐ —

❤️ 0

⬇️ 150

👁 1

Save 📁 Collect

Share

Description

name: digital-human-training description: 数字人训练与部署 Skill - 提供从语音克隆、唇形同步到实时交互数字人的全流程训练建议与技术支持。 version: 1.0.0 author: xiaoai keywords: digital-human, avatar, whisper, tts, lip-sync, 数字人, 训练

数字人训练与部署 Skill

提供构建实时交互数字人的全流程指导，涵盖从素材采集到模型训练。

核心能力

🎙️ 语音克隆 (Voice Cloning)：指导使用 GPT-SoVITS 或 Fish Speech 进行高保真声音训练。
😶 唇形驱动 (Lip Sync)：适配 SadTalker, Live2D 或 Wav2Lip 的技术方案。
🧠 大脑集成 (LLM)：将 OpenClaw 的逻辑层与数字人视觉层打通。
⚡ 实时推理：优化推理延迟，实现 < 500ms 的数字人交互反馈。

技术路线图

素材准备：高清视频（绿幕背景）、清晰的 1-3 分钟干声采样。
模型选择：
- 2D 真人：HeyGen 路线或私有化部署 Easy-Wav2Lip。
- 3D/Live2D：Unity 集成。
部署方案：Local GPU (Nvidia RTW) vs Cloud API。

Example Usage

指令：我想做一个能实时回答问题的数字人，该怎么选型？输出：

方案 A (自建): GPT-SoVITS (语音) + Easy-Wav2Lip (视觉) + OpenClaw (逻辑)。
方案 B (低代码): HeyGen Streaming API 集成。
关键建议: 注意音频与视频的同步延迟，建议使用流式传输。

由小爱开发 | 数字人项目衍生

Reviews (0)

Sign in to write a review.

No reviews yet. Be the first to review!

Comments (0)

Sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Compatible Platforms

Links

📂 Source Code

Pricing

Free

Related Configs

self-improving-agent

Captures learnings, errors, and corrections to enable continuous improvement. Use when: (1) A command or operation fails unexpectedly, (2) User corrects Clau...

❤️ 2.0k ⬇️ 218k

Self Improving Agent

Captures learnings, errors, and corrections to enable continuous improvement. And also 50+ models for image generation, video generation, text-to-speech, spe...

❤️ 2.0k ⬇️ 206k

Find Skills

Search, discover, and install skills from the open agent skills ecosystem to extend agent capabilities for specific tasks or domains.

❤️ 814 ⬇️ 199k

Summarize

--- name: summarize description: Summarize URLs or files with the summarize CLI (web, PDFs, images, audio, YouTube). homepage: https://summarize.sh metadata: {"clawdbot":{"emoji":"🧾","requires":{"b

❤️ 609 ⬇️ 160k