📇 🏠 🍎 🪟 🐧 - Multimodal AI vision MCP server for image, video, and object detection analysis. Enables UI/UX evaluation, visual regression testing, and interface understanding using Google Gemini and Ve
Analyze images from multiple angles to extract detailed insights or quick summaries. Describe visuals rapidly or dive deeper with iterative reasoning when you need thorough understanding. Get strategi
Full-resolution vision for LLMs. Tiles large images and captures web pages via Chrome CDP so vision models process every detail without downscaling. Generates interactive HTML tile previews. Supports
Automate web browsing with fast, reliable actions driven by structured page snapshots. Click, type, navigate, manage tabs, and extract content without screenshots or vision models. Get deterministic r
Tool to Prevent AI tunnel-vision in critical workflows. Vibe Check MCP v2.7 introduces Chain-Pattern Interrupts (CPI) to enhance your infrastructure stack. mitigates over-engineering, scope creep, an
Turn any receipt into structured, accounting-ready JSON or clean Markdown with one API call. AI-powered vision extracts merchant, date, line items, tax breakdown, totals, currency, and confi
MCP server for xAI's [Grok API](https://docs.x.ai/docs/overview) with agentic tool calling, image generation, vision, and file support.
Web design analysis MCP server with 26 tools for layout extraction, motion detection, quality scoring, and semantic search. Uses Playwright, pgvector HNSW, and Ollama Vision to turn web pages into sea
Use HuggingFace Spaces directly from Claude. Use Open Source Image Generation, Chat, Vision tasks and more. Supports Image, Audio and text uploads/downloads.
Fast screenshot capture tool optimized for Claude Vision API. Automatically tiles full pages into 1072x1072 chunks for optimal AI processing with configurable viewports and wait strategies for dynamic
Real-time screen analysis, context-aware recording, and UI monitoring MCP server. Supports AI vision, event hooks, and multimodal agent workflows.
Desktop GUI automation using accessibility APIs. Control Windows, macOS, and Linux applications without vision models or screenshots. Supports workflow recording, structured data extraction, and brows
# Vulcan File Ops MCP Server
Visualize tabular data as polished charts in seconds. Personalize themes and layout, then render bar, line, pie, and more—with smart suggestions for field mapping. Follow a guided workflow to optimi
Discover and compare models and providers with up-to-date pricing, limits, modalities, and capabilities. Search and filter by features like reasoning, tool calling, and context length, then fetch deta
Driven Intent Negotiation — Contract-Oriented Deterministic Executable Runtime IMPORTANT: > - **Using Claude Code?** → Install the [Plugin](#-claude-code-plugin-recommended-for-claude-code) (eas
Automate complex web browsing tasks like navigation, form filling, and deep research across multiple tabs. Extract structured data and visual insights from any webpage instantly. Streamline digital wo
Enable seamless integration of language models with external data sources and tools through a standardized protocol. Facilitate dynamic access to files, APIs, and custom operations to enhance AI capab
Leverage Anchor Browser's infrastructure for scalable, geo-targeted, and anti-detection browser automation without local dependencies. Simplify browser automation with fast, structured data access and