🧪 Skills

MinerU PDF Parser Clawdbot Skill

Parse PDFs locally (CPU) into Markdown/JSON using MinerU. Assumes MinerU creates per‑doc output folders; supports table/image extraction.

v1.0.0
❤️ 2
⬇️ 1.9k
👁 1
Share

Description


name: mineru-pdf description: Parse PDFs locally (CPU) into Markdown/JSON using MinerU. Assumes MinerU creates per‑doc output folders; supports table/image extraction.

MinerU PDF

Overview

Parse a PDF locally with MinerU (CPU). Default output is Markdown + JSON. Use tables/images only when requested.

Quick start (single PDF)

# Run from the skill directory
./scripts/mineru_parse.sh /path/to/file.pdf

Optional examples:

./scripts/mineru_parse.sh /path/to/file.pdf --format json
./scripts/mineru_parse.sh /path/to/file.pdf --tables --images

When to read references

If flags differ from your wrapper or you need advanced defaults (backend/method/device/threads/format mapping), read:

  • references/mineru-cli.md

Output conventions

  • Output root defaults to ./mineru-output/.
  • MinerU creates the per-document subfolder under the output root (e.g., ./mineru-output/<basename>/...).

Batching

Default is single-PDF parsing. Only implement batch folder parsing if explicitly requested.

Reviews (0)

Sign in to write a review.

No reviews yet. Be the first to review!

Comments (0)

Sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Compatible Platforms

Pricing

Free

Related Configs