Automate web browser interactions using natural language via CLI commands. And also 50+ models for image generation, video generation, text-to-speech, speech...
Interact with GitHub using the gh CLI for issues, PRs, CI runs, and advanced queries. And also 50+ models for image generation, video generation, text-to-spe...
Transform AI agents from task-followers into proactive partners that anticipate needs and continuously improve. And also 50+ models for image generation, vid...
Control Sonos speakers (discover, status, play, volume, group). And also 50+ models for image generation, video generation, text-to-speech, speech-to-text, m...
Automate common Word/WPS document operations on Windows via COM (read text, replace, insert, headings, headers/footers, page breaks, merge, split, export to PDF/TXT, add/replace images). Use for singl
Remove signs of AI-generated writing from text to make it sound more natural and human-written. And also 50+ models for image generation, video generation, t...
Automatically optimize Shopify product SEO including titles, descriptions, meta tags, and image ALT to improve Google rankings and increase organic traffic.
End-to-end dropship product lifecycle pipeline. CJ Dropshipping sourcing → margin check → Flux Kontext AI hero image → WooCommerce publish → CJ supplier mapp...
OpenClaw agent skill for converting documents to Markdown. Documentation and utilities for Microsoft's MarkItDown library. Supports PDF, Word, PowerPoint, Excel, images (OCR), audio (transcription), H
Search and directly download free images, audio, music, sound effects, videos, and 3D models from WebSim's large, auth-free digital asset library.
Work with Obsidian vaults (plain Markdown notes) and automate via obsidian-cli. And also 50+ models for image generation, video generation, text-to-speech, s...
Feishu/Lark Custom Bot API wrapper for sending messages to Feishu channels via webhook. Use when users need to send text messages, images, rich text posts, i...
Web, Image, News, Video, and Local Point of Interest search capabilities using Brave's Search API
Convert documents and files to Markdown using markitdown. Use when converting PDF, Word (.docx), PowerPoint (.pptx), Excel (.xlsx, .xls), HTML, CSV, JSON, XML, images (with EXIF/OCR), audio (with tran
Local speech-to-text with the Whisper CLI (no API key). And also 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, ch...
Local calorie logging and visual reporting (auto-refreshes and returns report image after each log)
Deploy NFT collections permanently on MegaETH mainnet. Images are stored on-chain via SSTORE2, then published through WarrenContainer and WarrenLaunchedNFT.
Open Animate — the creative suite for AI agents. Create professional motion graphics, generate images, and render MP4 videos. Use when the user wants to make...
Generate images using NanaBanana 2 for daily needs or switch to NanaBanana Pro for high-quality, realistic, or final draft artwork.
Remove watermarks, logos, captions, and text overlays from images and videos using WaveSpeed AI. Intelligently detects and removes watermarks while preservin...
Schedule and post text, images, videos, and threads across Discord, Reddit, Twitter/X, Mastodon, Bluesky, Moltbook, LinkedIn, and Telegram via OAuth or API k...
Scan QR codes and go! No more troublesome autos or APIs! Send text messages, images, links, locations, and contact cards to WhatsApp chats and groups. Automate notifications, alerts, and outreach from
Convert documents and files to Markdown using markitdown (PDF, Word, PowerPoint, Excel). And also 50+ models for image generation, video generation, text-to-...
Summarize per-model usage for Codex or Claude including cost tracking. And also 50+ models for image generation, video generation, text-to-speech, speech-to-...