🧪 Skills
OCR with python
Extract Chinese and English text from images and scanned PDFs, including documents like invoices and contracts, using PaddleOCR in Python.
v1.0.0
Description
name: ocr description: Optical Character Recognition (OCR) tool, supports Chinese and English text extraction from PDFs and images. Use cases: (1) extract text from scanned PDFs, (2) recognize text from images, (3) extract text content from invoices, contracts, and other documents
OCR Text Recognition
This skill uses PaddleOCR for text recognition, supporting both Chinese and English.
Quick Start
Basic Usage
Perform OCR recognition directly on image or PDF files:
from paddleocr import PaddleOCR
ocr = PaddleOCR(lang='ch')
result = ocr.predict("file_path.jpg")
Dependency Installation
Install dependencies before first use:
pip3 install paddlepaddle paddleocr
Output Format
Recognition results return JSON containing:
rec_texts: List of recognized textrec_scores: Confidence score for each text
Typical Use Cases
- PDF Scans: Use PyMuPDF to extract images first, then OCR
- Image Text Recognition: Perform OCR directly on images
- Multi-page PDFs: Process page by page
Scripts
Common scripts are located in the scripts/ directory.
Reviews (0)
Sign in to write a review.
No reviews yet. Be the first to review!
Comments (0)
No comments yet. Be the first to share your thoughts!