I am using Firebase Vision Image in one of my android apps. Whenever the android emulator takes a picture, Firebase Vision Image Labeler returns a list of labels for the bitmap that came from the camera's capture.
My problem is that the labels are barely close to accurate. For instance, if I take a picture of a face, it will return a 80% confidence label that says "dog". However, when I take a picture of a face and plug it in Google's Cloud Vision (which Firebase builds upon) Demo labeler on their website, it returns much more accurate responses.
Why are Firebase Vision responses/labels so much different from Google Cloud Vision responses/labels?
The pictures that I'm taking are on my laptop btw.