Extract text from images using Tesseract.js OCR. Supports Chinese (simplified/traditional) and English.
AI-powered bid/tender document review. Extracts text from .docx/.doc files, cross-references bid requirements vs responses, and generates a detailed audit re...
Extract audio from Douyin (抖音/TikTok China) videos and transcribe to text using Whisper. Trigger when user sends a Douyin link (v.douyin.com or www.douyin.co...
Hash generator and file integrity verifier. Generate MD5, SHA1, SHA256, SHA512 hashes for text and files, verify file integrity against expected hashes, comp...
Analyze text for emotional tone, opinion polarity, subjectivity, intensity, and specific emotions. Returns per-sentence breakdown with sarcasm detection, urg...
Translates PDF documents to Chinese with professional typography. Extracts text, translates section-by-section into well-structured Markdown, then generates...
Generate complete Xiaohongshu (Little Red Book) posts with up to 10 pages (3:4 vertical format). Auto-parses text content into cover + content pages. Support...
Convert text to speech audio via ComfyUI's Qwen-TTS API, supporting customizable voice, style, model, and output options.
Generate videos using local AI models (ComfyUI/Stable Video Diffusion) and auto-publish to social media platforms. Supports text-to-video, image-to-video, ba...
Give your text-based OpenClaw agent the ability to see and describe images
Translate text into multiple languages while maintaining specific tones or styles. Communicate across language barriers accurately and efficiently. Enhance global reach by converting content for diver
Scan QR codes and go! No more troublesome autos or APIs! Send text messages, images, links, locations, and contact cards to WhatsApp chats and groups. Automate notifications, alerts, and outreach from
Give your agent the ability to speak to you real-time. Talk to your Claude! Ultra-fast TTS, text-to-speech, voice synthesis, audio output with ~90ms latency....
Generate AI-optimized Alt Text, file names, captions, and Schema markup for images, videos, and audio assets. Improves AI discoverability on Google Lens, Cha...
Generate speech from text using Kyutai Pocket TTS - lightweight, CPU-friendly, streaming TTS with voice cloning. English only. ~6x real-time on M4 MacBook Air.
AI-powered scam and threat verification MCP server. Check phone numbers, URLs, text messages, and emails against aggregated threat intelligence from government complaint databases, malicious URL feeds
transform any input into a deliberately degraded, semantically corrupted, non-explanatory experimental text stream. use when the user invokes "巴别渊" or "劣化",...
Generate and edit images using TensorsLab's AI models. Supports text-to-image, image-to-image generation, plus advanced editing: avatar generation, watermark...
Read any web page aloud with natural AI voices. Extract article text from any URL and convert it to audio (MP3). Use when the user wants to: listen to a webp...
Fetch handwritten notes, sketches, and drawings from a reMarkable tablet via Cloud API (rmapi). Process content by refining artwork with AI image generation, extracting handwritten text to memory/jour
Transcribe YouTube videos to text by extracting captions and subtitles directly from the video URL using yt-dlp without audio processing.
Save travel inspiration from YouTube videos, Instagram reels, blog URLs, tweets, or plain text like "want to visit Hampi". Extracts destination, auto-tags by...
Formats text according to specified style guidelines. A clean example skill with no security issues.
Ingest pasted PC parts purchase/config text (Discord message receipts, bullet lists) into Notion DIY_PC tables (PCConfig, ストレージ, エンクロージャー, PCInput). Use when the user pastes