Search

291 results for "ocr"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

Auto Captcha Solver

Detect and solve simple image captchas during browser automation. Use when flows encounter 4-6 character text, distorted alphanumeric, numeric, rotated, or a...

❤️ 0 ⬇️ 17

🧪 Skill

nano-banana-pdf-edit

Free

Edit PDF files visually using natural language with the nano-pdf CLI tool, powered by Google's Gemini 3 Pro Image (Nano Banana). Use this skill whenever the...

❤️ 2 ⬇️ 403

🧪 Skill

homework-grader

Free

老师作业批改助手，用于自动批改数学作业、统计错题、生成Excel统计表和PDF报告。当老师需要：(1) 上传正确答案并让AI识别 (2) 批量上传学生作业照片

❤️ 1 ⬇️ 24

🧪 Skill

Clawshier

Free

Scan receipt or invoice photos sent via chat, extract expense data using OpenAI Vision, validate and deduplicate, then log to a Google Spreadsheet. Responds...

❤️ 0 ⬇️ 212

🧪 Skill

reMarkable MCP

Free

Access reMarkable tablet documents, notebooks, PDFs, and EPUBs. Use when the user wants to read, search, browse, or extract text from their reMarkable tablet...

❤️ 0 ⬇️ 107

🧪 Skill

Windows Control

Free

Full Windows desktop control. Mouse, keyboard, screenshots - interact with any Windows application like a human.

❤️ 25 ⬇️ 5.7k

🧪 Skill

Soulprint

Free

Soulprint decentralized identity verification for AI agents. v0.6.4 — blockchain-first architecture (no libp2p): state lives on Base Sepolia, 4 validator nod...

❤️ 4 ⬇️ 408

🧪 Skill

Mac Use 1.0.0

Free

Control macOS GUI apps visually — take screenshots, click, scroll, type. Use when the user asks to interact with any Mac desktop application's graphical inte...

❤️ 0 ⬇️ 7

🧪 Skill

yindeng_ analyse

Free

自动爬取银登网不良贷款转让公告及结果，支持多模型提取关键金融数据并导出结构化分析报告。

❤️ 1 ⬇️ 134

🧪 Skill

Research Library

Free

Local-first multimedia research library for hardware projects. Capture code, CAD, PDFs, images. Search with material-type weighting. Project isolation with cross-references. Async extraction. Backup +

❤️ 0 ⬇️ 1.1k

🧪 Skill

Llamaparse

Free

Parse, extract, and analyze documents using the LlamaParse API (LlamaCloud). Use when the user asks to parse PDFs, images, spreadsheets, or other documents i...

❤️ 1 ⬇️ 34

🧪 Skill

mac-use

Free

Control macOS GUI apps visually — take screenshots, click, scroll, type. Use when the user asks to interact with any Mac desktop application's graphical interface.

❤️ 3 ⬇️ 2.0k

🧪 Skill

Mineru Extract

Free

Use the official MinerU (mineru.net) parsing API to convert a URL (HTML pages like WeChat articles, or direct PDF/Office/image links) into clean Markdown + s...

❤️ 0 ⬇️ 17

🧪 Skill

Scanner

Free

Transform document photos into clean scanned-looking pages with automatic edge detection, cropping, and perspective correction. Use when (1) the user wants a...

❤️ 0 ⬇️ 33

💬 Prompt

Vision-to-json

Free

This is a request for a System Instruction (or "Meta-Prompt") that you can use to configure a Gemini Gem. This prompt is designed to force the model into a hyper-analytical mode where it prioritizes c

❤️ 0 ⬇️ 0

💬 Prompt

Job Posting Snapshot & Preservation Engine

Free

TITLE: Job Posting Snapshot & Preservation Engine VERSION: 1.5 Author: Scott M LAST UPDATED: 2026-03 ============================================================ CHANGELOG ===================

❤️ 0 ⬇️ 0

🧪 Skill

Privacy Mask

Free

Mask and redact sensitive information (PII) in screenshots and images — phone numbers, emails, IDs, API keys, crypto wallets, credit cards, passwords, and mo...

❤️ 0 ⬇️ 87

🧪 Skill

Pdf Anthropic

Free

Use this skill whenever the user wants to do anything with PDF files. This includes reading or extracting text/tables from PDFs, combining or merging multipl...

❤️ 0 ⬇️ 29

🧪 Skill

Superdoc

Free

Create, edit, and manipulate DOCX files using SuperDoc - a modern document editor with custom rendering pipeline. Use when you need to programmatically work...

❤️ 0 ⬇️ 98

🧪 Skill

Knowledge Base Collector

Free

Collect and organize a personal knowledge base from URLs (web/X/WeChat) and screenshots. Use when the user says they want to save an URL, ingest a link, archive content to KB, tag/classify notes, stor

❤️ 0 ⬇️ 662

🧪 Skill

Didit Kyc Onboarding

Free

End-to-end KYC (Know Your Customer) identity verification for onboarding real users. Use when someone needs to perform KYC, onboard users with identity verif...

❤️ 0 ⬇️ 117

🧪 Skill

PR's PDF Agent

Free

--- name: pdfagent description: Self-hosted PDF operations and conversions with metered usage output. version: 0.1.0 --- # PDF Agent Summary - Use `pdfagent` to perform PDF operations (merge, split,

❤️ 2 ⬇️ 130

🧪 Skill

zenTable

Free

Render structured table data as high-quality PNG images using Headless Chrome. Use when: need to visualize tabular data for chat interfaces, reports, or soci...

❤️ 1 ⬇️ 231

🧪 Skill

Template SDS Generator

Free

Generate a deterministic, template-preserving 16-section SDS/MSDS package from 1 DOCX template, 1 prompt/rule file, and 1-3 source SDS/MSDS files, with DOCX/...

❤️ 0 ⬇️ 59