Advanced desktop automation with mouse, keyboard, and screen control. And also 50+ models for image generation, video generation, text-to-speech, speech-to-t...
Provides high-accuracy speech-to-text conversion supporting 22 Chinese dialects and 30 languages with automatic language detection, running on CPU.
Automate web browser interactions using natural language via CLI commands. And also 50+ models for image generation, video generation, text-to-speech, speech...
Top-tier AI music generation with models: Suno sonic v4, Suno sonic v5, DouBao BGM (GenBGM), DouBao Song (GenSong). One-stop text-to-music with custom mode,...
Enables real-time text messaging and broadcasting between multiple OpenClaw nodes across machines for coordinated fleet operations.
Generate short sound effects via ElevenLabs SFX (text-to-sound). Use when you need SFX clips like applause, canned laughter, whooshes, ambience, or short stingers, and optionally convert to WhatsApp-f
Install and use whisper.cpp (local, free/offline speech-to-text) with OpenClaw. Supports downloading different ggml model sizes (tiny/base/small/medium/large...
A skill to send emails with support for markdown, HTML text, and attachments, leveraging existing SMTP configuration in `/home/bb/.openclaw/smtp-config.json`. Includes retry logic and logging.
Control Discord from Clawdbot: send messages, react, post stickers, upload emojis, and more. And also 50+ models for image generation, video generation, text...
Headless browser automation CLI optimized for AI agents with accessibility tree snapshots. And also 50+ models for image generation, video generation, text-t...
Publish videos, photos, and text to all your social channels from one place. Schedule and manage posts at scale with background processing and easy status tracking. Track performance with unified anal
Use the ClawdHub CLI to search, install, update, and publish agent skills. And also 50+ models for image generation, video generation, text-to-speech, speech...
Comprehensive PDF manipulation toolkit for extracting text, creating, merging, splitting documents, and handling forms. And also 50+ models for image generat...
Fetch text-based real-time stock prices (KRX, Overseas) using Naver Finance.
NLP toolkit for text analysis including sentiment detection, keyword extraction, language detection, summarization, tokenization, and entity recognition.
Generate QR codes for URLs, text, WiFi credentials, contact cards, and more. Use when creating scannable links for marketing materials, sharing WiFi password...
Converts text to natural speech using ElevenLabs for clinical and healthcare use cases. Use when generating patient instructions, discharge summaries, medica...
Connect AI agents to the 2020117 decentralized network. Register, post to timeline, trade compute via NIP-90 DVM jobs (text generation, translation, summariz...
Generate AI videos using Vidu — featuring text-to-video, image-to-video, reference-to-video, and start-end-to-video with up to 1080p resolution, anime style...
Generate retro robotic speech audio using SAM (Software Automatic Mouth), the classic C64 text-to-speech synthesizer. Use for /sam command to generate voice messages. Supports /sam on/off toggle mode
Extract clean, plain-text transcripts from YouTube videos using a dual fallback system with Supadata API and yt-dlp for fast, accurate results.
Read any web page aloud with natural AI voices. Extract article text from any URL and convert it to audio (MP3). Use when the user wants to: listen to a webp...
Search and retrieve Magic: The Gathering card data using the Scryfall API. Use this skill when the user asks about MTG cards, wants to search for cards by name, type, color, mana cost, oracle text, se
Generate high-quality images from text descriptions.