Questions tagged [google-cloud-vision]

Google Cloud Vision API enables developers to understand the content of an image by encapsulating powerful machine learning models in an easy to use REST API.

Google Cloud Vision API enables developers to understand the content of an image by encapsulating powerful machine learning models in an easy to use REST API. It quickly classifies images into thousands of categories (e.g., "sailboat", "lion", "Eiffel Tower"), detects individual objects and faces within images, and finds and reads printed words contained within images.

You can build metadata on your image catalog, moderate offensive content, or enable new marketing scenarios through image sentiment analysis. Analyze images uploaded in the request, and in upcoming releases, integrate with your image storage on Google Cloud Storage. Documentation

889 questions
10
votes
1 answer

Incomplete coordinate values for Google Vision OCR

I have a script that is iterating through images of different forms. When parsing the Google Vision Text detection response, I use the XY coordinates in the 'boundingPoly' for each text item to specifically look for data in different parts of the…
crld
  • 387
  • 3
  • 9
10
votes
4 answers

Cloud Vision API - PDF OCR

I just tested the Google Cloud Vision API to read the text, if exist, in a image. Until now I installed the Maven Server and the Redis Server. I just follow the instructions in this…
Christian Salvador
  • 311
  • 1
  • 4
  • 12
9
votes
2 answers

Google cloud vision not accepting base64 encoded images python

I'm having a problem with base64 encoded images sent to Google Cloud Vision. Funny thing is that if I send the image via URI, it works fine, so I suspect there is something wrong the way I'm encoding. Here's the deal: from google.cloud import…
AlejandroVK
  • 7,373
  • 13
  • 54
  • 77
9
votes
2 answers

Does Google Cloud Vision OCR API have better accuracy and performance than Tesseract OCR API

I have integrated Google Cloud Vision API in my java application for text recognition from complex formatted documents. One of my colleague suggested to use "Tesseract API".Can anyone please give difference between these two API's.And which is…
nik
  • 1,464
  • 4
  • 18
  • 32
9
votes
1 answer

Using the Google Cloud Vision API with a simple API key

I am using the Google Cloud Vision Java API client documented here: https://cloud.google.com/vision/docs/reference/libraries. The following quickstart code works fine if I use the implicit default credentials by setting the…
mac01021
  • 745
  • 4
  • 13
9
votes
1 answer

Google Cloud Vision API - How to enable a service account

I am trying to get my head round GoogleVision API Java library. I have created a service account, downloaded the json and set this environment variable. GOOGLE_APPLICATION_CREDENTIALS=C:\GoogleAPI\keys\translate-41428d4d1ec6.json I have set…
smackenzie
  • 2,880
  • 7
  • 46
  • 99
9
votes
1 answer

vision-client doesn't supprt api-key

It looks that google-cloud vision Python client (google.cloud.vision.client.Client) doesn't have an option to accept api-key. https://googlecloudplatform.github.io/google-cloud-python/stable/vision-client.html How can I use the client with api-key…
etusji
  • 138
  • 3
8
votes
0 answers

Google Cloud Vision OCR API returning incorrect values for bounding box/vertices

I'm using the "TEXT_DETECTION" option from the Google Cloud Vision API to OCR some images. The bounding box around individual characters is sometimes accurate and sometimes not, often within the same image. Is this a normal side-effect of a…
Mark Bench
  • 111
  • 3
8
votes
1 answer

Localhost requests from referer are blocked

I'm trying to run this project call thing translator from https://github.com/dmotz/thing-translator and I found this error while running it. It says: {error: {code: 403, message: "Requests from referer http://localhost/www/index.html are…
Daniah
  • 81
  • 1
  • 2
8
votes
1 answer

AggregateException when calling GetApplicationDefaultAsync()

I am trying to run the sample Vision API project. I basically copied and pasted the code Program.cs into my application and executed it. This line (which is line #36-#37 in Program.cs) GoogleCredential credential =…
Rakete1111
  • 47,013
  • 16
  • 123
  • 162
7
votes
1 answer

How to detect handwriting using Google Cloud Vision API

TL;DR: how can I detect the presence of handwriting in an image? I'm using Google's Python Vision API to scan for text in images, with generally good results. Most of the time the images contain printed text, but sometimes there is handwriting. As…
7
votes
1 answer

Is there a way to see estimated time for training a model in Google AutoML Vision?

I'm attempting to train a model to identify certain tags in images. I tried using the 1 hour free version and after an hour the training ended. The results weren't as accurate as I would like, so I took the plunge and selected the option which did…
Jay Song
  • 73
  • 5
7
votes
3 answers

Cloud Vision API Client threw an OS Error "too many open files"

I have met an Error of "Too many open files" when I run label detection via Cloud Vision API Client with Python. When I asked this probrem on GitHub before this post, the maintainer gave me an advice that the problem is general Python issue rather…
katsuya
  • 71
  • 2
7
votes
3 answers

Google vision Text Detection response to be line by line

I am using the Google vision api to perform text recognition on receipt images. I am getting some nice results returned but the format in which the return is quite unreliable. If there is a large gap between text the readout will print the line…
Wrumble
  • 231
  • 1
  • 5
  • 15
7
votes
3 answers

Does Google Cloud Vision API support face recognition or face identification?

I am looking for a Google Cloud API that can do both face recognition and identification. My understanding is that the Google Cloud Vision API will support only face detection, but not recognition. Is there any Google Cloud API that can do face…
sreekar reddy
  • 71
  • 1
  • 1
  • 3
1
2
3
59 60