🧪 Skills

OCR Test

Extract text from images using Tesseract.js OCR. Supports Chinese (simplified/traditional) and English.

v1.0.0
❤️ 0
⬇️ 18
👁 1
Share

Description


name: ocr description: Extract text from images using Tesseract.js OCR. Supports Chinese (simplified/traditional) and English. homepage: https://github.com/naptha/tesseract.js metadata: {"clawdbot":{"emoji":"📝","requires":{"bins":["node"]},"install":[{"id":"npm","kind":"npm","packages":["tesseract.js"],"label":"Install tesseract.js"}]}}

OCR - Image Text Recognition

Extract text from images using Tesseract.js. Supports Chinese and English.

Quick start

node {baseDir}/scripts/ocr.js /path/to/image.jpg
node {baseDir}/scripts/ocr.js /path/to/image.png --lang chi_sim
node {baseDir}/scripts/ocr.js /path/to/image.jpg --lang chi_tra+eng

Options

  • --lang <langs>: Language codes (default: chi_sim+eng)

    • chi_sim - Simplified Chinese
    • chi_tra - Traditional Chinese
    • eng - English
    • Combine with +: chi_sim+eng
  • --json: Output as JSON instead of plain text

Examples

# Recognize Chinese screenshot
node {baseDir}/scripts/ocr.js screenshot.png

# Recognize English document
node {baseDir}/scripts/ocr.js document.jpg --lang eng

# Mixed Chinese + English
node {baseDir}/scripts/ocr.js mixed.png --lang chi_sim+eng

Notes

  • First run downloads language data (~20MB per language)
  • Subsequent runs are cached locally
  • Works best with clear, high-contrast images
  • For handwritten text, accuracy may vary

Reviews (0)

Sign in to write a review.

No reviews yet. Be the first to review!

Comments (0)

Sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Compatible Platforms

Pricing

Free

Related Configs