Local speech-to-text using faster-whisper. High-performance transcription with GPU acceleration support. Includes word-level timestamps and distilled models....
Convert Markdown text to beautiful Xiaohongshu (XHS) style card images with 5 themes, deterministic browser screenshot rules, auto-pagination, smart title ex...
Core GOYFILES external-bot contract (endpoint-first onboarding, strict agent behavior, direct tool mode, and text-fetch contract).
A precision utility to generate QR code images from URLs or text using Python.
--- name: local-llama-tts description: Local text-to-speech using llama-tts (llama.cpp) and OuteTTS-1.0-0.6B model. metadata: { "openclaw": { "emoji": "🔊", "requires": {
Browser automation for AI agents using PinchTab. Control Chrome programmatically for testing, scraping, and interaction. Features token-efficient text extrac...
Detect hardcoded secrets like API keys, tokens, and passwords in text or code using Expanso Edge pipelines.
Parse TradingView signals from text or URLs to extract trade details and calculate risk/reward ratios for structured trade analysis.
Playful virtual girlfriend voice companion. Use when the user wants short, flirty, friendly text replies returned as Bulbul v3 audio across chat channels (Discord/Telegram/WhatsApp). Generate a brief
Fetch text-based real-time stock prices (KRX, Overseas) using Naver Finance.
AI-powered presentation generation using 2slides API. Create slides from text content, match reference image styles, or summarize documents into presentations. Use when users request to "create a pres
Install and use whisper.cpp (local, free/offline speech-to-text) with OpenClaw. Supports downloading different ggml model sizes (tiny/base/small/medium/large...
Build full-stack web and mobile apps from a text description. Creates projects, plans features, and starts Quick Start builds for background execution.
Generate and produce optimized video ads for Facebook, Instagram, and YouTube from text briefs using InVideo AI, including scripts, voiceovers, captions, and...
WeChat Official Account Draft Box management tool. Create and manage graphic draft articles via WeChat API, supporting text and images. Automatically extract...
Local search and indexing CLI (BM25 + vectors + rerank) with MCP mode. And also 50+ models for image generation, video generation, text-to-speech, speech-to-...
AI video generation — Sora, Kling, Veo 3, Seedance, Hailuo, WAN, Grok. Text-to-video, image-to-video, video editing. 37 models, one API key.
Automates uploading multiple sources (files, URLs, YouTube, Drive, text) to a NotebookLM notebook, generating a deep dive audio overview in a preferred langu...
Write effective prompts for Jimeng Seedance 2.0 multimodal AI video generation. Use when users want to create video prompts using text, images, videos, and a...
Repurpose any blog post or article into multiple social media formats. Input a URL or text, get X/Twitter thread, LinkedIn post, Instagram caption, email sni...
Use Jimeng AI 4.0 (Volcengine) to generate images from text or image references, and optionally send results to Feishu.
Unified persistent text-based environment for AI agents. Use when an agent needs to maintain state across sessions, perform structural code editing, or manag...
Generate videos using Alibaba Cloud DashScope Wan (通义万相) text-to-video (t2v) API (e.g., wan2.6-t2v). Use when the user asks to create a short video from a te...
Automatically solve Moltbook verification challenges (math problems) when posting. Parses obfuscated number text and calculates answers.