Extract text from PDF files using PyMuPDF. Parse tables, forms, and complex layouts. Supports OCR for scanned documents.
Edit PDFs with natural-language instructions using the nano-pdf CLI.
Extract PDF content to Markdown using MinerU API. Supports formulas, tables, OCR. Provides both local file and online URL parsing methods.
--- name: pdf-smart-tool-cn version: 1.0.1 description: "PDF智能处理工具 Pro | PDF转图片、图片转PDF、OCR识别、PDF合并拆分。" author: 赚钱小能手 metadata: clawhub: emoj
Extract text from PDFs with OCR support. Perfect for digitizing documents, processing invoices, or analyzing content. Zero dependencies required.
Generate AI podcast episodes from PDFs, text, notes, and links using MagicPodcast in OpenClaw. Creates natural two-person dialogue audio, supports custom lan...
Generate PDFs and images from HTML. No sign-up required — the free tier works out of the box (1 req/min, small watermark). For higher volume and no watermark...
PDF-API.io integration. Manage data, records, and automate workflows. Use when the user wants to interact with PDF-API.io data.
Extract multilingual document content and language learning notes (French, German, Japanese, Spanish, etc.) from PDFs using multimodal vision (Qwen-VL-Max)....
Extract text from PDFs using Google Gemini OCR. Use when extracting text from PDFs, performing OCR on scanned documents, or processing image-based PDFs.
PDF批量处理技能包 - 一键合并、分割、压缩、转换PDF。适合办公人员、文档处理、自动化工作流。
Download PDFs from PubMed Central (PMC) and Europe PMC. Use when the user needs to download open-access academic papers from PubMed Central using PMC ID, Pub...
Upload PDFs and manage items in a Zotero Web Library. Supports both personal and group libraries. Use when a user wants to add papers/PDFs to Zotero, organiz...
Parse PDFs locally (CPU) into Markdown/JSON using MinerU. Assumes MinerU creates per‑doc output folders; supports table/image extraction.
Generate professional PDFs from HTML/CSS using flow layouts and selective break controls to avoid whitespace gaps and layout issues in printed pages.
Convert Markdown files to styled PDFs using pandoc and wkhtmltopdf with built-in or custom CSS style options.
PDF扫描件转Word文档。支持中文OCR识别,自动裁掉页眉页脚,保留插图,彩色章节封面页保留为图片。使用百度OCR API(免费额度1000次/月)。当用户要求
PDF智能处理工具 v1.1 | 新增:PDF编辑、电子签名、批量OCR、表单填写、文档对比。支持转换、OCR、合并拆分。
将PDF文件的每一页转换为图片文件;支持自定义图片格式(PNG/JPG)和分辨率;适用于文档处理、图片化存档等场景
用 MinerU API 解析 PDF/Word/PPT/图片为 Markdown,支持公式、表格、OCR。适用于论文解析、文档提取。