Search

Professional Vision Statement for Transportation Company

Act as a Vision Strategy Expert. You are an experienced consultant in developing vision and mission statements for specialized transportation companies. Your task is to craft a professional vision sta

Senior Computer Vision

Computer vision engineering skill for object detection, image segmentation, and visual AI systems. Covers CNN and Vision Transformer architectures, YOLO/Fast...

❤️ 1 ⬇️ 1.4k

MoltShell Vision Engine

--- name: MoltShell Vision Engine description: Give your text-based OpenClaw agent the ability to see and describe images --- # 👁️ MoltShell Vision Engine Standard OpenClaw agents are **blind**

❤️ 1 ⬇️ 218

Computer Vision Expert

SOTA Computer Vision Expert (2026). Specialized in YOLO26, Segment Anything 3 (SAM 3), Vision Language Models, and real-time spatial analysis.

❤️ 1 ⬇️ 3.2k

Vision Sandbox

Agentic Vision via Gemini's native Code Execution sandbox. Use for spatial grounding, visual math, and UI auditing.

❤️ 1 ⬇️ 4.2k

Vision Bot

--- name: vision-bot description: Analyze images via URL or base64. Auto-detects mode: OCR, object counting, or full description. acceptLicenseTerms: true metadata: clawdbot: emoji: "👁️"

❤️ 0 ⬇️ 146

12-Month AI and Computer Vision Roadmap for Defense Applications

{ "role": "AI and Computer Vision Specialist Coach", "context": { "educational_background": "Graduating December 2026 with B.S. in Computer Engineering, minor in Robotics and Mandarin Chinese.

universal-pdf-vision-parser

Extract multilingual document content and language learning notes (French, German, Japanese, Spanish, etc.) from PDFs using multimodal vision (Qwen-VL-Max)....

❤️ 0 ⬇️ 197

Agent Vision Scraper

Dockerized AI-powered web scraper using Playwright with virtual display and vision-based captcha solving, no third-party captcha services needed.

❤️ 0 ⬇️ 210

Comprehensive Roadmap for AI and Computer Vision Specialization in Defense Systems

Act as a Career Development Coach specializing in AI and Computer Vision for Defense Systems. You are tasked with creating a detailed roadmap for an aspiring expert aiming to specialize in futuristic

🔌 MCP

ai-vision-mcp

📇 🏠 🍎 🪟 🐧 - Multimodal AI vision MCP server for image, video, and object detection analysis. Enables UI/UX evaluation, visual regression testing, and interface understanding using Google Gemini and Ve

universal-pdf-vision-parser

Extract multilingual document content and language learning notes (French, German, Japanese, Spanish, etc.) from PDFs using multimodal vision (Qwen-VL-Max)....

❤️ 0 ⬇️ 188

Vision Tagger

Tag and annotate images using Apple Vision framework (macOS only). Detects faces, bodies, hands, text (OCR), barcodes, objects, scene labels, and saliency re...

❤️ 0 ⬇️ 866

Future Vision

Write a compelling vision statement about where I see [project/work] going in the next 2-3 years and how sponsors can be part of that journey.

Vision

Provides local image analysis, OCR text extraction, object detection descriptions, image comparison, metadata reading, and format conversion.

🔌 MCP

Claude Vision

Analyze images from multiple angles to extract detailed insights or quick summaries. Describe visuals rapidly or dive deeper with iterative reasoning when you need thorough understanding. Get strategi

❤️ 0 ⬇️ 34

Peripheral Vision

Monitors adjacent systems, upstream dependencies, and downstream consumers for changes that could affect your current work — before they break it. Like biolo...

❤️ 0 ⬇️ 135

MoltShell Vision Engine

Give your text-based OpenClaw agent the ability to see and describe images

❤️ 1 ⬇️ 185

Vision-to-json

This is a request for a System Instruction (or "Meta-Prompt") that you can use to configure a Gemini Gem. This prompt is designed to force the model into a hyper-analytical mode where it prioritizes c

Trio Stream Vision

Analyze any YouTube livestream or RTSP camera feed using natural language — ask what's happening, detect specific events, or get periodic summaries. Powered...

❤️ 0 ⬇️ 48

Trio Vision

Turn any live camera into a smart camera — describe what to watch for in plain English, get alerts in your chat when it happens. Ask questions about any live...

❤️ 0 ⬇️ 61

uni-vision-engine

Automated high-quality video generation (text-to-video, image-to-video) via a local jimeng-api Docker service. Features native OpenClaw image interception, a...

❤️ 0 ⬇️ 104

MiniMax Vision Captcha