Search

291 results for "ocr"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

DeepRead Agent Self Sign Up

Authenticate AI agents with the DeepRead OCR API using OAuth device flow. The agent displays a code, the user approves it in their browser, and the agent rec...

❤️ 2 ⬇️ 500

🧪 Skill

Prismer

Free

Prismer enables agents to fetch, compress, and parse web content, perform OCR, and communicate via messaging with real-time sync using CLI or SDK.

❤️ 0 ⬇️ 632

🧪 Skill

Extract PDF Text

Free

Extract text from PDF files using PyMuPDF. Parse tables, forms, and complex layouts. Supports OCR for scanned documents.

❤️ 0 ⬇️ 650

🔌 MCP

Mineru Document Parsing Server

Free

Provide powerful document parsing capabilities by integrating with the Mineru API. Enable single and batch file parsing with support for multiple formats, OCR, formula, and table recognition. Monitor

❤️ 0 ⬇️ 0

🧪 Skill

habib-pdf-to-json

Free

Extract structured data from construction PDFs. Convert specifications, BOMs, schedules, and reports from PDF to Excel/CSV/JSON. Use OCR for scanned documents and pdfplumber for native PDFs.

❤️ 0 ⬇️ 791

🧪 Skill

UI Element Ops

Free

Parse UI screenshots into structured element JSON (type, OCR text, bbox) and operate desktop UI from parsed elements. Use when a user asks to detect/locate U...

❤️ 0 ⬇️ 259

🧪 Skill

Pdf To Structured

Free

Extract structured data from construction PDFs. Convert specifications, BOMs, schedules, and reports from PDF to Excel/CSV/JSON. Use OCR for scanned documents and pdfplumber for native PDFs.

❤️ 7 ⬇️ 2.7k

🧪 Skill

deAPI AI Media Suite (Community)

Free

The cheapest AI media API on the market. Generate images (Flux), music (AceStep), speech with voice cloning, transcribe video/audio, OCR, video generation, b...

❤️ 1 ⬇️ 48

🧪 Skill

发票内容识别

Free

增值税发票识别技能：自动识别 PDF（单页/多页）或各种常见图片格式（PNG/JPG等）的发票，调用百度云增值税发票 OCR API 提取关键信息，输出结构化 Excel 报告。适用于以下场景：用户上传发票文件并要求识别、提取、转换信息时；需要批量处理发票并生成 Excel 汇总表时；需要对发票进行检测、内容...

❤️ 0 ⬇️ 19

🧪 Skill

MinerU PDF Extractor

Free

Extract PDF content to Markdown using MinerU API. Supports formulas, tables, OCR. Provides both local file and online URL parsing methods.

❤️ 2 ⬇️ 599

🧪 Skill

Masumi Network Warranty Vault

Free

Masumi Network skill for warranty vault verification. Handles OCR receipt scanning, Cardano blockchain proof-of-purchase logging, immutable decision logging, agent collaboration discovery, and smart w

❤️ 0 ⬇️ 728

🧪 Skill

arithmetic-orc

Free

识别图片中的K12算式（加减乘除、竖式计算、分数、方程等），返回结构化文本结果。支持手写体和印刷体，可拒绝非算式图片。触发条件：用户要求识别算式、数学题、计算题图片，或上传数学题图片时调用。关键词：算式识别、数学题、OCR、竖式计算、ArithmeticOCR

❤️ 0 ⬇️ 12

🧪 Skill

Veryfi Documents AI

Free

Real-time OCR and data extraction API by Veryfi (https://veryfi.com). Extract structured data from receipts, invoices, bank statements, W-9s, purchase orders...

❤️ 16 ⬇️ 513

🧪 Skill

MarkItDown Skill

Free

OpenClaw agent skill for converting documents to Markdown. Documentation and utilities for Microsoft's MarkItDown library. Supports PDF, Word, PowerPoint, Excel, images (OCR), audio (transcription), H

❤️ 0 ⬇️ 1.1k

🧪 Skill

wx

Free

发送微信消息给指定联系人。支持两种模式：(1) 有消息内容：直接发送指定消息；(2) 无消息内容：OCR 截图识别聊天窗口内容并自动回复。当用户需要自动发送微信消息、自动回复微信聊天时触发此技能。

❤️ 0 ⬇️ 74

🔌 MCP

nutrient-dws-mcp-server

Free

MCP server for the Nutrient DWS Processor API. Convert, merge, redact, sign, OCR, watermark, and extract data from PDFs and Office documents via natural language. Works with Claude Desktop, LangGraph,

❤️ 0 ⬇️ 0

🧪 Skill

Siphonclaw Skill

Free

Hybrid document intelligence pipeline ingesting PDFs, images, and spreadsheets with OCR, visual and text search, and field fix capture for fast retrieval.

❤️ 0 ⬇️ 430

🧪 Skill

Bizcard

Free

Business card scanner + Google Contacts manager. Auto-detects business card images, extracts contact info via OCR (imageModel), confirms with user, saves to...

❤️ 0 ⬇️ 323

🧪 Skill

MinerU PDF Parser

Free

用 MinerU API 解析 PDF/Word/PPT/图片为 Markdown，支持公式、表格、OCR。适用于论文解析、文档提取。

❤️ 6 ⬇️ 3.1k

🧪 Skill

中文工具包

Free

为OpenClaw提供中文文本处理、翻译、OCR、语音识别等功能的综合工具包。支持中文分词、拼音转换、中英文翻译、关键词提取、文本分析等功能。

❤️ 1 ⬇️ 705

🧪 Skill

a2a-context

Free

Provides web content fetching, caching, document OCR, real-time messaging, group chats, file transfers, and webhook integrations via Prismer Cloud APIs.

❤️ 0 ⬇️ 142

🧪 Skill

document-parser

Free

Extract structured data from PDFs, images, and Word files with layout analysis, table recognition, OCR, seal detection, and directory extraction.

❤️ 0 ⬇️ 212

🧪 Skill

PR's PDF Agent

Free

Self-hosted PDF operations and conversions with metered usage output.

❤️ 2 ⬇️ 119

🧪 Skill

Nutrient Document Processing (Universal Agent Skill)

Free

Universal (non-OpenClaw) Nutrient DWS document-processing skill for Agent Skills-compatible products. Best for Claude Code, Codex CLI, Gemini CLI, Cursor, Wi...

❤️ 0 ⬇️ 232