Optical character recognition in an image using Python

Question

I have an image file which Python reads and converts that to hexadecimal. The problem here is, even if I give an empty blank image its giving hexadecimal numbers as output. I need Python to process only the alphabets in the image and covert them to hexadecimal and give that as output.

Here is the program which I tired

import binascii

filename = 'a.png'
with open(filename, 'rb') as f:
    content = f.read()

print(binascii.hexlify(content))

Your program will give you hex codes of the image file. If you see an image file that is 100000 bytes in size, you will get 200000 hexadecimal digits (two per byte). It has nothing to do with what is shown on the image. The only way you would get no output is if the file was empty (0-length), and such a file cannot be said to be an image file. On the other hand if you want to _read_ the letters shown on the image, you need to use an OCR library (or code up an OCR from a machine learning library), and `binascii.hexlify` is an entirely wrong tool for the job. — Amadan, Aug 13 '18 at 10:54

Saranraj Nambusubramaniyan · Accepted Answer · 2018-08-13T11:03:45.460

2

This is OCR(Optical Character Recognition) problem, which is discussed several times in stack history.

Pytesserect do this in ease.

Usage:

import pytesserect
from PIL import Image

# Get text in the image
text = pytesseract.image_to_string(Image.open(filename))

# Convert string into hexadecimal
hex_text = text.encode("hex")

edited Aug 13 '18 at 11:03

answered Aug 13 '18 at 10:59

Saranraj Nambusubramaniyan

1,734
3
16
23

Thanks for the reply.will it be possible to recognize the characters of different font style.? – Fz Arjun Aug 13 '18 at 11:08
1

Yes as long as it is not something like handwriting – hypadr1v3 Aug 13 '18 at 11:09
or calligraphy type fonts – hypadr1v3 Aug 13 '18 at 11:09
1

Will you please post sample of the image? That helps us to explore possibilities. – Saranraj Nambusubramaniyan Aug 13 '18 at 11:10

Optical character recognition in an image using Python

1 Answers1