Generate videos or images from text, images, or references, create and edit material elements, submit and query asynchronous video generation tasks via bundl...
Generate videos from text using Text to Video (Wan2.1) API. Automatically generates 4-second videos from text descriptions. Powered by Dreamface - AI tools f...
Render text into an image and return a temporary local image file path, with optional data URI. Use when Clawhub or Codex needs to convert plain text, styled...
--- name: paddleocr-text-recognition description: Extracts text (with locations) from images and PDF documents using PaddleOCR. metadata: openclaw: requires: env: - PADDLEOCR_OCR_A
Fix garbled text in PDF/SVG vector graphics for final editing in AI. Detect, replace and repair garbled text in vector graphic files while maintaining origin...
Generate speech audio from text using HeyGen's Starfish TTS model. Use when: (1) Generating standalone speech audio files from text, (2) Converting text to s...
SenseAudio Text-to-Speech (TTS) API for converting text to natural speech. Supports synchronous and SSE streaming modes, multiple voices, emotion control, sp...
Remove signs of AI-generated writing from text. Use when editing or reviewing text to make it sound more natural and human-written. Based on Wikipedia's comp...
--- name: text-transform description: Text transformation, regex, diff, format conversion, and JSON manipulation. version: 1.0.1 metadata: openclaw: emoji: "📝" homepage: https://text.agen
Humanize AI-generated text by removing telltale AI writing patterns. Use when text needs to sound natural and human-written — removing em-dashes, AI filler p...
Local speech-to-text using OpenAI Whisper. Use when the user needs to: (1) transcribe audio files to text, (2) convert voice messages to written content, (3)...
Install and use the speechall CLI tool for speech-to-text transcription. Use when the user wants to: (1) transcribe audio or video files to text, (2) install speechall on macOS or Linux, (3) list avai
Generate background music from text description. Use when users request text_to_music operations or related tasks.
Generate sound effects from text description. Use when users request text_to_sound_effect operations or related tasks.
Generate images from text with a free-quota-first multi-provider workflow. Use this skill when a user asks for text-to-image generation that needs provider r...
Adds TikTok-style text overlays to images and videos with styled fonts, backgrounds, strokes, and timed animations.
Remove AI-like features from text to make it more natural and human-like (去除AI生成文本的AI特征,使文本更自然、更接近人类写作)
Generate, remix, and edit images using fal.ai's AI models. Supports text-to-image generation, image-to-image remixing, and targeted inpainting/editing.
Text generation and chat completion on Volcengine ARK. Use when users need long-form writing, summarization, extraction, rewriting, Q&A, or prompt optimization with ARK text models.
Extract text from PDFs with OCR support. Perfect for digitizing documents, processing invoices, or analyzing content. Zero dependencies required.
Unified speech-to-text skill. Use when the user asks to transcribe audio or video, generate subtitles, identify speakers, translate speech, search transcript...
Text-to-speech conversion using GLM-TTS service via the `uvx zai-tts` command for generating audio from text. Use when (1) User requests audio/voice output w...
Convert text to natural speech with DIA TTS, Kokoro, Chatterbox, and more via inference.sh CLI. Models: DIA TTS (conversational), Kokoro TTS, Chatterbox, Hig...