{ "meta": { "source_image": "user_provided_image", "analysis_timestamp": "2024-07-30T12:00:00Z", "analysis_model": "image_to_json_v1.0", "overall_confidence": 0.99 }, "camera_and
Act as a professional image creator. You are an expert in generating high-quality, impactful images suitable for printing and sales. Your task is to: - Create visually stunning images that are ready
This skill helps users automatically extract structured image data from Google Images via BrowserAct API. Agent should proactively apply this skill when user...
Image processing tool for compression, background removal/replacement, and upscaling. Invoke when user wants to compress image, remove background, change bac...
Generate or edit images with Gemini using the Google GenAI SDK. Use when the user asks to create, transform, render, or save one or more images in an OpenCla...
Detect and remove AI fingerprints from AI-generated images. Strip metadata, add film grain, recompress, and bypass AI image detectors. Works with Midjourney,...
Generate images from text prompts using AI models via OpenRouter or Kie.ai. Use when the user asks to generate, create, draw, or illustrate an image.
Generate images using Google Gemini via OpenRouter API. Supports text-to-image and reference-image-guided generation. Use when the user asks to generate, cre...
Visual image search using Google Lens via SerpAPI. Identify objects, landmarks, products, plants, animals, artwork, logos, or any visual entity from an image...
Audit Amazon product listing images for non-square dimensions, auto-pad them to 2000×2000 white background, and push corrected images to live listings via SP...
{ "model": "nano-banana", "task": "image_to_image_product_transformation", "objective": "Transform the provided clothing product image into a luxury studio ghost-mannequin presentation where th
Generate images via fal.ai and BytePlus Seedream APIs. Supports single image, batch parallel, and reference-guided generation. Use when you need to generate...
Generate, edit, and compose images using Gemini models. Activate when user asks to generate images, draw, create logos/posters/icons/banners, edit/modify pho...
AI image and video generation via Vydra.ai API. Access Grok Imagine, Gemini, Flux, Veo 3, Kling, and ElevenLabs through one API key. Agents can self-register and generate images automatically.
Render text into an image and return a temporary local image file path, with optional data URI. Use when Clawhub or Codex needs to convert plain text, styled...
Generate images with Seedream4.5 and videos with Kling via LiblibAI API. Use when user asks to generate/create images, pictures, illustrations, or videos using LiblibAI, Seedream, or Kling models.
Search for images using Brave Search API. Use when you need to find images, pictures, photos, or visual content on any topic. Requires BRAVE_API_KEY environment variable.
Detect and filter out low-quality images by analyzing blur, brightness, and resolution to clean up image datasets efficiently.
Generate images from text prompts using FLUX via Together.ai. Returns image URL. Prompts are auto-enhanced for best results.
{ "model": "nano-banana", "task": "image_to_image_product_enhancement", "objective": "Transform the input product image into a professional commercial studio photograph while preserving the exac
Generate images from text with a free-quota-first multi-provider workflow. Use this skill when a user asks for text-to-image generation that needs provider r...
Generate ultra-realistic images and Instagram content using Gemini 2.0 Flash Experimental. Use when creating photorealistic images, social media content, or...
AI-powered image and video generation. Generate images, videos, manage jobs, and explore models via the masonry CLI.