Download videos, images, and audio without watermarks from 999+ platforms (TikTok, YouTube, Instagram, Twitter, Bilibili, Sora2, etc.) using the MeowLoad API...
Extract text and structured data from documents using Azure Document Intelligence (formerly Form Recognizer). Supports OCR for PDFs, images, scanned document...
Connect to Prefy AI platform — Conductor API (19 models, OpenAI-compatible), server management, web search, image generation, phone calls (AutoCall). Use whe...
Automate posting to WeChat Moments on Windows desktop (open Moments window, trigger publish entry, select image, paste caption, click publish). Use when aske...
Deploy OHIF medical imaging viewer with Docker, configure DICOMweb sources, SSL, and integrate servers like tbidea, orthanc, and DCM4CHEE.
Optimize and generate text-to-image prompts for AI art platforms. Use when a user wants to: (1) Optimize prompts for Midjourney, Nano Banana, Dreamina, Qwen...
The Universal Imaging & Optical Sovereignty Protocol. A high-fidelity standard for the generation, authentication, and preservation of light-based and synthe...
--- name: doc-ocr-skills description: OCR documents (PDFs and images) using Gemini 2.5 Flash, PaddleOCR (local), or RapidOCR (local). --- # Document OCR Skill (docr) Uses **Gemini 2.5 Flash**, **Pad
Generate expressive talking-head videos from static images using Kameo AI. Converts static avatars/portraits into dynamic 5-second videos with realistic facial expressions, lip-sync, and motion. Use w
Generate QR codes from text, URLs, or data for payments, WiFi sharing, or any task needing a scannable QR code image.
Fetch complete Amazon product data including title, current price, currency, star rating, review count, availability, main image, and product URL. Works from...
Generate polished 1080×1920 TikTok/Reels/Shorts video ads from product clips and images. Three viral styles: Clean, Meme, UGC. Python + ffmpeg, no cloud requ...
Send ntfy.sh notifications with curl, including advanced headers like title, priority, tags, click links, action buttons, and image attachments. Use when a u...
Generate and read QR codes. Use when the user wants to create a QR code from text/URL, or decode/read a QR code from an image file. Supports PNG/JPG output and can read QR codes from screenshots or im
Call Hidream txt2img async API with exposed auth and request parameters. Use when users need to generate images from text prompts, build runnable Python comm...
Translates Excel files (.xlsx) from English to Chinese while preserving all formatting, images, and charts. Use for any task where a user provides an Excel f...
Render an STL file to a PNG image with a solid color using a deterministic software renderer and adjustable 3D perspective parameters.
Draft and publish posts to 小红书 (Xiaohongshu/RED). Use when creating content for 小红书, drafting posts, generating cover images, or publishing via browser automation. Covers the full workflow from conten
Analyze any webpage URL for SEO issues and get actionable recommendations. Checks title tags, meta descriptions, heading structure, keyword density, image al...
Convert PDFs, DOCX, PPTX, and images to Markdown using zerox with GPT-4o vision, including OCR for scanned documents.
Extract text and layout from images and PDFs using LLMWhisperer API. Good for handwriting and complex forms.
WeChat Official Account Draft Box management tool. Create and manage graphic draft articles via WeChat API, supporting text and images. Automatically extract...
Complete UGC video campaign pipeline: product → hero image → variations → videos → edited final. ✅ USE WHEN: - User says "crear campaña UGC" or "pipeline completo" - Need end-to-end UGC vi
Process screenshots Enzo shares with comments. Save to reference library, extract content, categorize, set reminders, and log patterns. Use when Enzo sends an image with context like "save this", shar