🧪 Skills
SiliconFlow TTS Gen
Text-to-Speech using SiliconFlow API (CosyVoice2). Supports multiple voices, languages, and dialects.
v1.0.0
Description
name: siliconflow-tts-gen description: Text-to-Speech using SiliconFlow API (CosyVoice2). Supports multiple voices, languages, and dialects. env:
- SILICONFLOW_API_KEY
files:
config:
- ~/.openclaw/openclaw.json
SiliconFlow TTS Generation Skill
Text-to-Speech using SiliconFlow API with CosyVoice2 model. Supports 8 preset voices, multiple languages, and Chinese dialects.
Features
- 🎙️ 8 Preset Voices: 4 male + 4 female voices
- 🌍 Multilingual: Chinese, English, Japanese, Korean
- 🗣️ Chinese Dialects: Cantonese, Sichuan, Shanghai, Tianjin, Wuhan
- ⚡ Ultra Low Latency: 150ms first packet delay
- 🎵 Voice Cloning: 3-second rapid voice cloning
- 💾 Auto Download: Saves audio files locally
Requirements
- Environment Variable:
SILICONFLOW_API_KEY - Optional Config File:
~/.openclaw/openclaw.json(for auto-detect)
Installation
npx clawhub install siliconflow-tts-gen
Configuration
Set your SiliconFlow API key:
export SILICONFLOW_API_KEY="your-api-key"
Usage
List Available Voices
python3 scripts/generate.py --list-voices
Generate Speech
# Basic usage (default voice: alex)
python3 scripts/generate.py "你好,世界"
# Specify voice
python3 scripts/generate.py "Hello World" --voice bella
# Adjust speed
python3 scripts/generate.py "你好" --voice claire --speed 0.9
# Save to file
python3 scripts/generate.py "欢迎收听" --output welcome.mp3
# Change format
python3 scripts/generate.py "Hello" --format wav
Available Voices
Male Voices
| ID | Name | Characteristic |
|---|---|---|
| alex | 沉稳男声 | Mature and steady |
| benjamin | 低沉男声 | Deep and low |
| charles | 磁性男声 | Magnetic |
| david | 欢快男声 | Cheerful |
Female Voices
| ID | Name | Characteristic |
|---|---|---|
| anna | 沉稳女声 | Mature and elegant |
| bella | 激情女声 | Passionate |
| claire | 温柔女声 | Gentle and kind |
| diana | 欢快女声 | Sweet and happy |
Parameters
| Parameter | Type | Default | Range | Description |
|---|---|---|---|---|
--voice |
string | alex | - | Voice ID |
--speed |
float | 1.0 | 0.25-4.0 | Speech speed |
--format |
string | mp3 | mp3/opus/wav/pcm | Output format |
--output |
string | output.mp3 | - | Output file path |
Security Notes
- This skill requires an API key to call SiliconFlow services
- The script reads
~/.openclaw/openclaw.jsononly to auto-detect API keys - No sensitive data is transmitted except to
api.siliconflow.cn - Review the code at
scripts/generate.pybefore providing credentials
Author
MaxStorm Team
License
MIT
Reviews (0)
Sign in to write a review.
No reviews yet. Be the first to review!
Comments (0)
No comments yet. Be the first to share your thoughts!