Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Suggestion for preprocessing of Image. #12089

Closed
erictan23 opened this issue May 10, 2024 · 2 comments
Closed

Suggestion for preprocessing of Image. #12089

erictan23 opened this issue May 10, 2024 · 2 comments
Assignees

Comments

@erictan23
Copy link

Hello! Thank you for sharing PaddlePaddleOCR, I am trying to read a text from a CCTV surveillance video footage. However, I encounter some characters that are black and background is black, resulting in a poor performance of the OCR model. The Image format is in PNG.

invert_test

The results by the model are: 02+60023-We1-1249:05

I have tried some pre-processing methods such as changing of grayscale and median blurring and the results slightly improve.
gray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)
median = cv2.medianBlur(gray, 7)

processed_image

The results by the model are: 09-20023Wed-1249:05

are there any suggestions to improve on the results?

@GreatV
Copy link
Collaborator

GreatV commented May 10, 2024

In my experience you need to fine-tune the model by collecting some data for the corresponding scenarios. A generic model may not be sufficient.

@zhangyubo0722
Copy link
Collaborator

Yes, you need to collect some data to retrain the model.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants