name: voice-message version: 1.0.4 description: Send voice messages across chat channels (Telegram, Discord, Feishu/Lark, Signal, WhatsApp, and others) using edge-tts for text-to-speech and ffmpeg for audio conversion. IMPORTANT - Feishu/Lark does NOT support asVoice=true via the message tool; you MUST use this skill to send voice messages on Feishu, otherwise it will send a file attachment instead of a voice bubble. | 通过 edge-tts 文字转语音和 ffmpeg 音频转换，在各聊天渠道（Telegram、Discord、飞书、Signal、WhatsApp 等）发送语音消息。飞书不支持 message 工具的 asVoice=true，必须使用本 skill 才能发送语音气泡而非文件附件。 metadata: openclaw: emoji: "🎤"

Voice Message

Send text as voice messages to any chat channel.

Use scripts/gen_voice.sh to convert text to an ogg/opus file:

scripts/gen_voice.sh "你好" /tmp/voice.ogg
scripts/gen_voice.sh "Hello" /tmp/voice.ogg en-US-JennyNeural

Arguments: <text> <output.ogg> [voice]

Use the message tool directly:

action=send, asVoice=true, filePath=/tmp/voice.ogg

This works for most channels. Telegram confirmed working.

⚠️ Feishu does NOT support asVoice=true via the message tool. You must use the dedicated script.

Use scripts/send_feishu_voice.sh:

scripts/send_feishu_voice.sh /tmp/voice.ogg <receive_id> <tenant_access_token> [receive_id_type]

receive_id_type: open_id (default), chat_id, user_id, union_id, email
The script handles upload (as opus with duration) and sends as audio message type to produce a voice bubble.
To get tenant_access_token, use the Feishu tenant token API with your app credentials.

Discord voice messages require a waveform and special flags.

Generate ogg with scripts/gen_voice.sh
Generate waveform: python3 scripts/gen_waveform.py /tmp/voice.ogg
- Outputs JSON: {"duration_secs": 4.2, "waveform": "base64..."}
Send via Discord API with flags: 8192 (IS_VOICE_MESSAGE) and the waveform/duration in attachments metadata.
- Missing waveform/duration causes error 50161.

If asVoice=true does not produce a voice bubble on a channel: