Methods to acquire, analyze and understand images using mathematical approaches.
Questions tagged [vision]
664 questions
4
votes
2 answers
flutter could not find com.google.android.gms.vision.barcode when using flutter_barcode_scanner
I have this error when using flutter_barcode_scanner :
Local module descriptor class for com.google.android.gms.vision.barcode not found.
Error loading optional module com.google.android.gms.vision.barcode:…

marwa nam
- 41
- 3
4
votes
2 answers
Meaning of the metrics in YOLOv5
What does means in YOLOv5 the metric "mAP_0.5"?
If we arrive at N iterations and its value is constant, what does it mean?
Lastly, what the training loss metrics, "box loss" and "obj loss", mean ?
Is it normal that by increasing the number of…

compup
- 77
- 1
- 8
4
votes
2 answers
Google Cloud Vision API "Bad image data" error
I'm using google cloud vision API trying to detect text from maps. The image size I'm processing is less than 20MB and it is stored in cloud storage. When I run the document_text_detection API, some of the images lead to the following error (some…

Zekun
- 375
- 5
- 12
4
votes
0 answers
Confidence metric in the Hough transform: does one exist?
I am using the Hough Transform algorithm from scikit-image and Python to find the center of a crosshair.
Overall this produces great results, but I need to quantify exactly how well the center of the crosshair has been found. The error propagation…

perezed_
- 41
- 4
4
votes
1 answer
FFMpeg vs. OpenCV for format conversion/simple transformation
I had to create a system that can process images in realtime. I have implemented in C++ a pixel format conversion system that can also do some simple transformation (currently: rotation & mirroring).
Input/output format of the system are frame in a…

lp35
- 198
- 3
- 9
4
votes
3 answers
Computer vision over cloud
Is any way of doing computer vision over the cloud?
The idea is like people log in a website, then the webcam is activated, the video data is sent to the server through internet. Server processes those data and sent back the processed data to user…

George
- 41
- 2
4
votes
2 answers
Is it possible to get the count of objects using Google's Vision API or Amazon's Rekognition?
I have been exploring to get the count of the objects in an image / video using AWS Rekognition & Google's Vision, but haven't been able to find a way out. Though at Google's Vision site, they do have a section 'Insight from the Images' where…

Amit
- 194
- 4
- 20
4
votes
0 answers
iOS11:How can I use Vision framework track face across video?
i can track object across video ,but i can't track face.
when i use camera track face . the code print []
extension FaceTrackingViewController: AVCaptureVideoDataOutputSampleBufferDelegate {
func captureOutput(_ output: AVCaptureOutput, didOutput…

yang yang
- 41
- 1
4
votes
1 answer
Microsoft Cognitive Service Vision API ClientException Error
I'm trying to use Vision Cognitive Services to receive the description of an image but my code always throws this exception:
Exception Microsoft.ProjectOxford.Vision.ClientException
HResult=0x80131500
Origine=

Tobio Dev
- 37
- 3
4
votes
1 answer
Google Play Vision: Define OCR Character Range?
I'm using Google's play-services-vision to perform Optical Character Recognition (OCR) using the TextRecognizer class on the Android platform.
In the interest of performance, I would like to limit the range of characters possible for recognition to…

Mapsy
- 4,192
- 1
- 37
- 43
4
votes
2 answers
What is the relationship between color space RGB, XYZ and the color matching function?
What is the relationship between color spaces (RGB, XYZ) and the color matching function? Let's say we have some color matching function in the color space XYZ (3 row matrix). We also have the transformation matrix which translates from XYZ…

RebeccaK375
- 871
- 3
- 17
- 28
4
votes
1 answer
Google Vision API - Face methods on null object references
I am trying to change the sample app provided by Google for the face detection on Android.
FaceDetector detector = new FaceDetector.Builder(getApplicationContext())
.setTrackingEnabled(false)
…

Loïs Talagrand
- 810
- 2
- 13
- 32
4
votes
1 answer
OpenCV 3.0 Can't load neural network
I need to use a neural network in my OpenCV (version 3.0) project. I've created and trained neural network and it works, but if I want to load neural network from YML file, it doesn't predict.
This is a code where I creat, train and save my neural…

Simon
- 133
- 1
- 5
4
votes
2 answers
Night Vision Mode on WPF Windows
We've made a WPF application with a traditional UI (common controls like tabs, buttons, labels, textboxes, and so on).
We need to add a "night vision" mode, which would make it look like Stellarium's night vision mode, so that it can be comfortably…

Daniel
- 41
- 1
- 3
4
votes
2 answers
Implementation of Absolute Differences Stereo Matching Algorithm
Good day!
I am trying to learn how to manually implement stereo matching algorithms. I'm basically starting with the most basic of them all - Absolute Difference.
I found some slides online that describe how to do it. Basically, from what I…

renzbagaporo
- 83
- 2
- 3