I have been using Google's vision API for text detection for the past few months. The API returns a "map" of words present in a particular image/document. Each element in the "map" (JSON) would have the text of a word and its coordinates in the document
Earlier the mapping used to break up text to words on the basis of spaces and special characters... and now it seems that the text is broken to words based only on spaces.
For example a document with the text "Foo.Bar Hello World" would have 4 elements i.e. 4 words, because a "word" meant breaking text on special character too... but now "Foo.Bar Hello World" results only in 3 words.
My question is that Is there a way to choose which version of the API we can use? or Is there a way to track changes introduced to return structure of the API/the changes in the TEXT_DETECTION model?
What I have checked out:
- https://cloud.google.com/vision/docs/release-notes -> This contains releases for the API as a whole and not changes in the model running for the OCR or "post-processing" of the model's result.
- cloud-vision-discuss google group for similar issues.