I am, right now, using Google Vision API to do OCR in my application.
It's pretty easy. You have plenty of documentation at https://cloud.google.com/vision/ itself.
But to answer your question, the API-response is a JSON, from which you can filter whatever you need, which, in your case I am assuming is text.
Like this:
response = client.text_detection(image= image)
texts = response.text_annotations
print('Texts:')
for text in texts:
print('\n"{}"'.format(text.description))
vertices = (['({},{})'.format(vertex.x, vertex.y)
for vertex in text.bounding_poly.vertices])
print('bounds: {}'.format(','.join(vertices)))
So basically you can do whatever you want with the response.
You will have to make an account, provide billing info and so some setup etc. before you can use it.
But if I'm not mistaken, up to 2000 images per month are free, something like that.