Questions tagged [google-vision]

The Google Mobile Vision API for Android provides a framework for finding objects in photos and video.

The Google Mobile Vision API for Android provides a framework for finding objects in photos and video.

Note that issues related to the Google Cloud Vision API should use the tag "google-cloud-vision" rather than this tag.

Learn more about the Google Mobile Vision API through the Google Mobile Vision documentation and Google Mobile Vision sample apps.

720 questions
7
votes
1 answer

Android face detection - Vision API or Camera API

I need a simple way of detecting faces in my Camera app. All I care about is a callback saying that a face was detected in certain place or with coordinates where it was detected on the preview. I've noticed that there seem to be several face…
vkislicins
  • 3,331
  • 3
  • 32
  • 62
7
votes
2 answers

How to generate an Barcode and convert it to Bitmap using new Google Vision API?

How to generate an Barcode and convert it to Bitmap using new Google Vision API? Barcode barcode = new Barcode(); Barcode.Email email = new Barcode.Email(); email.address = "my_email@gmail.com"; email.subject = "My Subject; email.body = "My body…
7
votes
1 answer

Google Vision Barcode Detection Library Not Installing On Some Devices

I'm writing an app using the new Google Play Services Vision libraries to detect barcodes. Everything works fine on most devices I've tested on, but one particular device refuses to install the native libraries (mentioned in this comment) // Note:…
jsonfry
  • 2,067
  • 2
  • 15
  • 16
6
votes
2 answers

google.cloud.vision_v1.types.image_annotator.AnnotateImageResponse to Json in python

I am using Google Vision document_text_detection function and I am trying to dump the AnnotateImageResponse to json Earlier this code used to word client = vision.ImageAnnotatorClient() image = vision.Image(content=image) response =…
Akash Kumar
  • 1,356
  • 1
  • 10
  • 28
6
votes
0 answers

Returning formatted text from GCP Vision PDF results

I finally got my script to submit PDF document to Google Storage and then extract Text using Google Vision for PDF, as described in documentation. The data is returned in a huge JSON file. There's one node that contains test, but it's no longer…
santa
  • 12,234
  • 49
  • 155
  • 255
6
votes
2 answers

How to make bounding box around text-areas in an image? (Even if text is skewed!!)

I am trying to detect and grab text from a screenshot taken from any consumer product's ad. My code works at a certain accuracy but fails to make bounding boxes around the skewed text area. Recently I tried Google Vision API and it makes bounding…
6
votes
2 answers

How to get complete product information from Bar code display value in android?

I used Google vision Barcode API for scanning the bar code. From that, I can able to get display value from the bar code. But, how can I get complete product information after scanning the bar code? Which means, After scanning I'm getting bar code…
Arjun G
  • 606
  • 1
  • 7
  • 20
6
votes
1 answer

How to get frame of detected Barcode using Google Vision api for Barcode detection

The google Vision's Barcode detection API works fine and gets the result of the scanned barcode using Android. But I didn't find any way to get the frame from which the barcode is detected. Is there any way to get that exact frame?
6
votes
1 answer

How to enable Google Vision API to access Google Cloud Storage Bucket within same project

I have uploaded some test images to a Google Cloud Bucket, but don't want to make them public (which would be cheating). When I try to run a rest call for Google Vision API I get: { "responses": [ { "error": { "code": 7, …
smackenzie
  • 2,880
  • 7
  • 46
  • 99
6
votes
0 answers

Why barcodedetector and associated APIs are so unreliable?

I've been experimenting with barcodedetector and associated Vision APIs for a week or so and - unfortunately - the conclusion is that they are unreliable. I've tested them with ~10 devices: Samsung and Nexus tablets/phones with Android 4.4 to 6.01. …
6
votes
2 answers

Trying to import code from GitHub into Android Studio

I feel incredibly dumb asking this, but I've downloaded the sample code of the Android Mobile Vision API available here: https://github.com/googlesamples/android-vision. I'm trying to import this code into Android Studio so that I am able to just…
Pratik Thaker
  • 637
  • 2
  • 10
  • 18
5
votes
1 answer

Firebase iOS annotateImage Function returning 'Unexpected token o in JSON at position 1'

I'm writing an ImageRecognizer using Firebase Cloud Functions API in Xcode 12.4 with Swift as follows: import Firebase import UIKit import Foundation class ImageRecognizer { let imageName: String lazy var functions = Functions.functions() …
5
votes
1 answer

The shape of dict['ToFloat'] provided in model.execute(dict) must be []

** I'M AWARE OF SIMILAR QUESTIONS!! ** My question is for my particular situation... I used Google Vision to train my own model to detect custom objects. I've come across similar errors about shape in the past and I resolved them by reshaping my…
5
votes
2 answers

OCR confidence score from Google Vision API

I am using Google Vision OCR for extracting text from images in python. Using the following code snippet. However, the confidence score always shows 0.0 which is definitely incorrect. How to extract the OCR confidence score for individual char or…
5
votes
1 answer

Combining nearby bounding boxes along one axis

Here, I use Google Vision API to detect text from the following image. The red box indicates samples of a combined bounding box that I would like to obtain. Basically, I get the text output and bounding box from the above image. Here, I would like…
titipata
  • 5,321
  • 3
  • 35
  • 59
1 2
3
47 48