Full-resolution vision for LLMs. Tiles large images and captures web pages via Chrome CDP so vision models process every detail without downscaling. Generates interactive HTML tile previews. Supports
Optimize ecommerce product images for platforms like Taobao and Amazon by analyzing design elements and predicting click-through rate improvements.
Generate and edit images using the AceData Nano Banana API. Supports models like nano-banana-2, custom aspect ratios (default 16:9), and resolutions (default...
Generate clean table images from data. Perfect for Discord/Telegram where ASCII tables look broken. Supports dark/light mode, custom styling, and auto-sizing...
Find relevant images from Wikimedia Commons with direct download links. Quickly compare options to choose the best visual. Retrieve full-resolution files for your projects.
Using the uploaded product image of ${Product Name:MacBook Pro}, create an engaging lifestyle scene showing realistic usage in ${Lifestyle Scenario:Office}. Target visuals specifically for ${Audience
OpenAI GPT image generation/editing MCP server.
Resize images using ImageMagick (CLI). Entrypoint is a Bash script.
Extract text from images using Tesseract OCR
{ "image_analysis": { "environment": { "type": "Indoor", "location_type": "Bedroom or Living Area", "spatial_depth": "Reflected depth via mirror", "background_elements":
I want to make a ultra realistic minecraf character out of an image, the character should have all the characteristics of the person in the eg. Skin color and outfit leave out the background intact th
Analyzes images to detect AI generation, extract metadata, identify artifacts, and perform content moderation using local and cloud-based tools.
Automatically upload images to Stardots.io cloud storage, manage files, and obtain secure access links using API authentication.
AI Background Removal - Remove background from images, create transparent PNG, one-click background remover for e-commerce product photos, portrait headshots...
小红书图片生成技能 - 针对家装、美食、穿搭等赛道的AI图片生成,支持多种生成方式和规格优化
Still-to-video conversion guide: model selection, motion prompting, and camera movement. Covers Wan 2.5 i2v, Seedance, Fabric, Grok Video with when to use ea...
Polish a technical blog draft into an 1000–1200 word, 4-5 section zh-CN article, preserve technical terms/code, and generate consistent hero + per-section im...
飞书图片发送工具,支持系统截屏、区域截图和本地图片文件发送到飞书工作区,方便快速分享屏幕内容。
Generates professional e-commerce product scene prompts (Midjourney/DALL-E/Stable Diffusion) with physics-based lighting, material-accurate rendering, and li...
--- name: agent-media description: AI UGC video production from the terminal using the `agent-media` CLI. homepage: https://github.com/gitroomhq/agent-media metadata: {"clawdbot":{"emoji":"🌎","requ
阿里云百炼图像生成、编辑与翻译。文生图:根据文本生成图像,支持复杂文字渲染。图像编辑:单图编辑、多图融合、风格迁移、物体增删。图像翻
调用第三方图像生成API生成图片;当用户需要通过自定义服务商API生成图像时使用
使用 GLM 视觉 MCP 进行图像理解和分析。触发条件:(1) 用户要求分析图片、理解图像、描述图片内容 (2) 需要识别图片中的物体、文字、场景 (3) 使用 GLM
图片分析与识别,可分析本地图片、网络图片、视频、文件。适用于 OCR、物体识别、场景理解等。当用户发送图片或要求分析图片时必须使用此技能。