Controls and monitors ROS 2 robots directly via rclpy CLI. Use for ANY ROS 2 robot task: topics (subscribe, publish, capture images, find by type), services...
Intelligent multi-model router — automatically selects the best AI model based on task type (vision, image generation, video generation, audio, reasoning, co...
AI agent skill for x402 paid APIs with live image-hosting and qrcode-generate, plus planned image/video/vision APIs (colorize, super-resolution, enhance, sma...
Content moderation and safety checks. Instantly classify text or images as safe or unsafe using AI guardrails.
AI tool that creates viral fitness recipe videos with AI-generated images, voiceovers, renders, and optional TikTok auto-posting.
Remove watermarks, logos, captions, and text overlays from images and videos using WaveSpeed AI. Intelligently detects and removes watermarks while preservin...
AI image, video, and music generation + editing via VAP API. Flux, Veo 3.1, Suno V5.
End-to-end pipeline for creating faceless Islamic story TikTok videos. Orchestrates multiple specialized agents: story research, scriptwriting, image generat...
Perform privacy-focused web searches via DDGS metasearch for webpages, news, images, videos, or books with no tracking and no API key required.
Swiss-knife for AI agents. 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, chat, web search, document parsing, emai...
Creates TikTok image carousels (slideshows with text overlays on photos) via the ViralBaby API. Use when the user wants to: create TikTok slideshows or carou...
Search the web for information, find current content, and look up news articles. And also 50+ models for image generation, video generation, text-to-speech,...
Generate QR codes from text, URLs, or images. Use when users ask to 'generate QR code', 'create QR', or 'make QR code for'. Supports text content, URLs, and...
Generate, vectorize, upscale, replace background, variate, remove background, and transform images via Recraft API.
Use ConvertAgent for file format conversions through the local CLI. Trigger for any request to convert files (documents, images, audio, video, spreadsheets,...
Extract text from PDFs using Google Gemini OCR. Use when extracting text from PDFs, performing OCR on scanned documents, or processing image-based PDFs.
Parse documents using PaddleOCR's API. Supports both sync and async modes for images and PDFs.
Generate and decode QR codes locally. Use when the user wants to create a QR code from text/URL, decode/read QR code content from an image, or asks about QR...
Automatically solves image, ReCaptcha V2/V3, and HCaptcha challenges via XEvil server using Base64 uploads or sitekey proxying.
Image generation with IMA Open API using only Nano Banana series: Nano Banana, Nano Banana Pro, Nano Banana 2. Budget (Nano Banana2 512px), balanced (Nano Ba...
Generate logos with AI image tools using effective prompt structures, validation loops, and export workflows for App Store icons and brand marks.
Automatically publish notes to Xiaohongshu (小红书) creator center. Generates cover images (PIL poster, multiple styles), writes content from templates, and pub...