Text-to-speech conversion using Python edge-tts for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and sub...
视频转文章 / YouTube Video to Article — 使用 Gemini AI 将视频转为结构化文章。当用户需要将 YouTube 视频转换为文章时使用。
Automated Suno AI Music Generation - Create professional songs without manual intervention. Headless browser automation for servers with Gemini 3.1 Pro integ...
Search, analyze, and summarize peer-reviewed academic papers from open access sources. Provides credibility scoring, visualization, timeline generation, and...
Generate and edit AI images using ByteDance's Seedream — featuring text-to-image, image editing, and batch sequential generation with up to 4K resolution, PN...
AI-powered multilingual virtual salesperson for 24/7 global customer reception, client retention, real-time boss alerts, and continuous sales optimization.
Google Maps integration for OpenClaw with Routes API. Use for: (1) Distance/travel time calculations with traffic prediction, (2) Turn-by-turn directions, (3...
Advanced SEO content creation with multi-language support, content refresh, SEO scoring, and competitor analysis. Perfect for content creators and agencies.
AI image and video generation via Vydra.ai API. Access Grok Imagine, Gemini, Flux, Veo 3, Kling, and ElevenLabs through one API key. Agents can self-register and generate images automatically.
Ingest pasted PC parts purchase/config text (Discord message receipts, bullet lists) into Notion DIY_PC tables (PCConfig, ストレージ, エンクロージャー, PCInput). Use when...
Extract plain-text transcripts from YouTube videos using a local Python script. Use when the user wants to fetch, extract, or get a transcript from a YouTube...
Intelligent memory management system v3.4.1 - OpenClaw 2026.3.7 Pluggable Context Engine integration, Semantic Cache Layer, Dynamic Context Window, Multi-Con...
Proactive Chinese language tutor that delivers curated, real-world Mandarin learning content on a schedule. Use when: (1) User wants to learn or improve Chin...
Generates professional infographics with 21 layout types and 20 visual styles. Analyzes content, recommends layout×style combinations, and generates publicat...
AI-powered translation services using TranslateFlow API - translation, translate text, language conversion, multilingual translation, language translation, d...
Take a 60-question survey rating your behavioral tendencies from -3 to 3 to calculate and receive your detailed MBTI personality type and result URL.
Manage social media accounts for multiple clients safely with proxy isolation. Prevent cascading bans, schedule content across platforms, and handle 10-100+...
Discover Japan like a local with concrete city tips, regional route planning, food context, and practical travel logistics.
Monitor a curated list of AI/tech Twitter accounts, summarize the day's key posts using an LLM, and deliver a formatted digest to a Discord channel. Use when...
Download, transcribe, and analyze videos from YouTube, X/Twitter, and TikTok with local Whisper processing. Perfect for extracting TL;DRs, timestamps, and ac...
Query and create field observations and AI-processed captures. Photos, voice notes, and text notes from the field.
Extract text and structured data from documents using Azure Document Intelligence (formerly Form Recognizer). Supports OCR for PDFs, images, scanned document...
Download audio from a GETTR post or streaming page and transcribe it locally with MLX Whisper on Apple Silicon (with timestamps via VTT). Use when given a GE...
Summarize any URL or text using Kagi's Universal Summarizer API. Supports multiple engines (including the enterprise-grade Muriel model), bullet-point takeaw...