Generate lip-sync video from image + user's own audio recording. ✅ USE WHEN: - User provides their OWN audio file (voice recording) - Want to sync image to specific audio/voice - User recorded the
A tool for exploring each layer in a docker image container-inspect, go, cli, docker, docker-image, explorer, inspector. Use when you need container-inspect...
Summarize URLs or files with the summarize CLI (web, PDFs, images, audio, YouTube). And also 50+ models for image generation, video generation, text-to-speec...
Get current weather for any city and create images from your prompts. Streamline planning, reports, and storytelling by combining quick data lookups with visual creation. Receive shareable image links
Use the Meshy.ai REST API to generate assets: (1) text-to-2d (Meshy Text to Image) and (2) image-to-3d, then download outputs locally. Use when the user wants Meshy generations, needs polling async ta
An image-first social feed for OpenClaw bots. Create, post, comment, like, and follow AI generated images.
Generate or edit images via Gemini 3 Pro Image (Nano Banana Pro).
Swap faces in images and videos using WaveSpeed AI. Supports image face swap and video face swap with multi-face targeting. Produces watermark-free results w...
Generate videos using OpenAI's Sora API. Use when the user asks to generate, create, or make videos from text prompts or reference images. Supports image-to-video generation with automatic resizing.
AI media generation via deAPI. Transcribe YouTube/audio/video, generate images from text, text-to-speech, OCR, remove backgrounds, upscale images, create vid...
Provides local image analysis, OCR text extraction, object detection descriptions, image comparison, metadata reading, and format conversion.
Use PoYo AI Nano Banana for general image generation and image editing across many aspect ratios through the `https://api.poyo.ai/api/generate/submit` endpoi...
A MCP server that allows one to examine image metadata like EXIF, XMP, JFIF and GPS. This provides foundation for LLM-powered search and analysis of photo librares and image collections.
Extract text, tables, and images from PDFs or images using Mistral OCR API and output in Markdown, JSON, or HTML formats.
Generate diverse, non-repetitive image prompts powered by real visual references from Dribbble and design platforms. USE WHEN: user wants an image prompt, ne...
Based on the source image, overlay an architect's busy working process onto the entire scene. The image should look like a blueprint or trace paper covering the original photo, filled with handwritten
Take the input image, and use it is face and apply it to be Ash the Pokemon master image with his favorite character pikachu.
Unified API access to multiple AI models via kie.ai - image generation (Nano Banana Pro, Flux, 4o-image) at 30-80% lower cost than official APIs. Includes lo...
Instagram-style image network for AI agents. Post images, like, comment, and browse feeds.
Reasoning-driven image generation using structured creative briefs (Gemini 3 style) — generates high-fidelity images via muapi.ai with logic-based prompting
Create and edit icons, favicons, images, and color palettes using macOS tools, ImageMagick, SVG, and AI image generation.
Act as an advanced image generation model. Your task is to create an image of a young woman taking a mirror selfie with meticulous face preservation. FACE PRESERVATION: - Use the reference face to ma
Transform natural language image requests into optimized structured prompts for Gemini image generation. Automatically detects style and builds the perfect p...
Local image generation using Apple MLX via mflux — FLUX.2 Klein 4B (fast, Apache 2.0) and Z-Image Turbo (quality) models