Automatically converts received voice messages to text via an external ASR service, supporting multiple audio formats and integrating with OpenClaw.
Give your agent the ability to speak to you real-time. Talk to your Claude! Ultra-fast TTS, text-to-speech, voice synthesis, audio output with ~90ms latency....
Pronunciation coaching with real voice analysis using Azure Speech Services. Analyzes audio files for phoneme-level accuracy, fluency, prosody, and intonatio...
ElevenLabs advanced TTS for converting text to speech, listing voices, and managing credits
Chat with any real person or fictional character in their own voice by automatically finding their speech online, extracting a clean reference sample, and ge...
Write, test, and iterate prompts for AI models with voice preservation, model-specific adaptation, and systematic failure analysis.
Give your agent eyes — capture screenshots, voice, and annotations from any screen, monitor, or device via MCP.
Auto-transcribe voice messages locally using faster-whisper with selectable Whisper models, no API key required.
AI audio generation powered by CellCog. Text-to-speech, voice synthesis, voiceovers, podcast audio, narration, music generation, background music, sound design. Professional audio creation with AI.
将素材转换为戏言系列葵井巫女子的说话风格。Triggers on "zaregoto", "character voice", "style conversion". 触发场景:(1) 用户提供素材并要求「用巫女子风格改写
Set up mlx-whisper as the local audio transcription engine for OpenClaw on Apple Silicon Macs (M1/M2/M3/M4). Automatically transcribes voice notes sent via T...
Voice conversation interface for OpenClaw using wake word detection, streaming LLM responses, and text-to-speech. Use when a user wants to talk to their Open...
Advanced AI voice assistant for phone calls. Capable of persuasion, sales, restaurant bookings, reminders, and notifications.
Read any web page aloud with natural AI voices. Extract article text from any URL and convert it to audio (MP3). Use when the user wants to: listen to a webp...
Configure an OpenClaw instance to use a local OpenAI-compatible TTS backend (for example openedai-speech) with cloned voices. Use when users ask to wire loca...
Full Signal messenger integration for OpenClaw agents. Send/receive text and voice messages via signal-cli with role-based permissions (owner/trusted/untrust...
--- name: aliyun-asr description: "Pure Aliyun ASR skill for voice message transcription, supports multiple channels including Feishu" metadata: { "openclaw": { "emoji": "🎙️",
Transcribe audio files to text using Telnyx Speech-to-Text API. Use when you need to convert audio recordings, voice messages, or spoken content to text.
Explain anything — turn ideas into podcasts, explainer videos, or voice narration. Use when the user wants to "make a podcast", "create an explainer video",...
Create high-end cinematic scene prompts and production-ready scene briefs in a Hollywood producer voice. Use when the user asks for movie scene generation, s...
Make outbound phone calls via ElevenLabs voice agent and Twilio
Local speech-to-text using Vosk. Lightweight, fast, fully offline. Perfect for transcribing Telegram voice messages, audio files, or any speech-to-text task without cloud APIs.
Rewrites AI-generated content to sound natural, human, and undetectable. Removes robotic patterns, adds voice variety, and preserves meaning.