🧪 Skills

WiseOCR

PDF OCR — Convert PDF to Markdown via WiseDiag cloud API (requires WISEDIAG_API_KEY). Files are uploaded to WiseDiag servers for processing. Supports table r...

v1.0.9
❤️ 5
⬇️ 400
👁 1
Share

Description


name: wiseocr description: "Convert PDF files to Markdown using WiseOCR API (powered by WiseDiag). Supports table recognition, multi-column layouts, and medical document OCR. Usage: Upload a PDF file and say Use WiseOCR to process this." registry: homepage: https://github.com/fmdmm/WiseOCR author: fmdmm credentials: required: true env_vars: - WISEDIAG_API_KEY

⚠️ Privacy Warning

IMPORTANT - READ BEFORE INSTALLING:

This skill uploads your files to WiseDiag's cloud servers for OCR processing.

Do NOT use with sensitive or confidential documents unless:

  • You trust WiseDiag's data handling policies
  • You accept that file contents will be transmitted and processed remotely

For sensitive documents, use offline/local OCR tools instead.


WiseOCR Skill (powered by WiseDiag)

Convert PDF files into Markdown format. The script handles API authentication, file upload, OCR processing, and saves the result automatically.

⚠️ IMPORTANT: How to Use This Skill

You MUST use the provided script to process files. Do NOT call any API or HTTP endpoint directly.

The script scripts/wiseocr.py handles everything:

  • API authentication (reads WISEDIAG_API_KEY from environment)
  • PDF upload and OCR processing
  • Saves the Markdown result to WiseOCR/{filename}.md

🔑 API Key Setup (Required)

Get your API key: 👉 https://console.wisediag.com/apiKeyManage

export WISEDIAG_API_KEY=your_api_key

Installation

pip install -r requirements.txt

Usage

To process a PDF file, run:

cd scripts
python wiseocr.py -i /path/to/input.pdf -n original_filename

IMPORTANT: Always pass -n with the original filename (without extension) so the output file is named correctly. If the uploaded file has been renamed (e.g. to ocr_input.pdf), -n ensures the output uses the user's original filename.

The script will automatically save the result to WiseOCR/{name}.md.

Example:

python wiseocr.py -i /tmp/ocr_input.pdf -n medical_report
# Output saved to: WiseOCR/medical_report.md

With custom output directory:

python wiseocr.py -i /path/to/input.pdf -n medical_report -o /custom/output/dir

Arguments

Flag Description
-i, --input Input PDF file path (required)
-n, --name Original filename without extension for output (recommended)
-o, --output Output directory (default: ./WiseOCR)
--dpi PDF rendering DPI, 72-600 (default: 200)

Output

After the script runs, the Markdown file is saved automatically:

  • Default: WiseOCR/{name}.md
  • The file is named using the -n parameter (e.g. -n reportreport.md), or falls back to the input filename
  • No additional saving is needed — the file is already on disk

Data Privacy

What happens to your files:

  1. Files are uploaded to WiseDiag's OCR API (https://openapi.wisediag.com)
  2. Files are processed on WiseDiag servers
  3. Processing results are returned to you
  4. Files are not permanently stored on WiseDiag servers

For sensitive documents, use offline/local OCR tools instead.

License

MIT

Reviews (0)

Sign in to write a review.

No reviews yet. Be the first to review!

Comments (0)

Sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Compatible Platforms

Pricing

Free

Related Configs