Methods to acquire, analyze and understand images using mathematical approaches.
Questions tagged [vision]
664 questions
2
votes
1 answer
Running out of memory while fetching images from Photos SwiftUI
I am working on an app that can detect dog breeds, and I would like to make it when users pull up their photo library, they only see images with a dog. To do this, I used the Vision pet recognizer model which is very basic, the issue is that it…

SwiftlyCoding314
- 51
- 2
2
votes
1 answer
Pytorch Global Pruning is not reducing the size of the model
I am trying to Prune my Deep Learning model via Global Pruning. The original UnPruned model is about 77.5 MB. However after pruning, when I am saving the model, the size of the model is the same as the original. Can anyone help me with this…

Whatsinthename
- 21
- 3
2
votes
1 answer
Vision framework to detect face looking up/down
i’m using Vision framework to detect face orientation. VNFaceObservation has roll and yaw properties but unfortunately no pitch. How can i calculate pitch value? i need to check if a person looking up or down.
Someone in another forums suggested me…

maviton
- 23
- 1
- 7
2
votes
2 answers
EXC_BAD_ACCESS on VNSequenceRequestHandler
The following code uses the Vision and AVFoundation frameworks to enable face tracking on the built-in camera on macOS. In some circumstances the code crashes due to EXC_BAD_ACCESS (code=2) on a working thread on the queue…

Tambarskjelve
- 53
- 6
2
votes
3 answers
Interfacing LabVIEW with python for triggering USB Camera
LabVIEW supports pallete for calling function from python. Do I need to develop 3 seperate functions each for initializing, capturing and closing? Using OpenCV in python for accessing directshow based usb cameras.
If I create 3 seperate functions,…

Amlo4300
- 21
- 2
2
votes
1 answer
How can I thin or skeletonize an object without the objects corners being divergent?
I have the following image, which represents a path.
I want to apply an algorithm called HoughLines(P) on it. This algorithm should detect lines and coordinates. But first I need to make the path thinner for it to be valid input for…

Dirk
- 95
- 1
- 8
2
votes
0 answers
Could not cast TextBlock to Element - Android.gms.Vision
Code crashes at selected line. Reason: Cannot cast TextBlock to Element.
One would think that the SparseArray(items) is of type 'Element' because of previous declarations.(language: Kotlin)

Jomme
- 1,256
- 4
- 14
- 26
2
votes
1 answer
Cannot get docker image of gcr.io/automl-vision-ondevice/gcloud-container-1.14.0:latest
I got an error when I was trying to download the docker image from
gcr.io/automl-vision-ondevice/gcloud-container-1.14.0:latest
But I got an error of Access Denied. How can I solve this issue? Thanks a lot.
$ export…

Wingo Wong
- 23
- 2
2
votes
0 answers
iOS Vision framework text detection does not detect single digit numbers?
Using text detection inside iOS vision framework struggles to detect single digit numbers, it seems to detect certain digits fine but with others it will fail such as the number 1, 2 or 3 etc. I am just trying to highlight/extract the numbers in…

Luke Ireton
- 479
- 1
- 3
- 18
2
votes
0 answers
Flutter OCR - Google Cloud Vision API
I have a problem implementing Google Cloud Vision API into my flutter code. I need to have an OCR functionality and the language I’m targeting is only available through this API. Any help?

Koalaz
- 21
- 2
2
votes
0 answers
Opencv can't find the contours of an aluminium plate
I'm trying to extract this mold from this picture. I've got some code to extract it, the only thing I need are the four corners.
def detect_object(image):
orig = image.copy()
image = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)
blurry =…

Randpy Projects
- 21
- 3
2
votes
1 answer
Separate crossings segments in binarised image
I have some image processing that allows me to extract a binary image containing thick segments and i'm facing the issue that these segments may cross each other. Hence I need to find an efficient way to separate them, i'll have to implement this in…

Julien M
- 657
- 3
- 10
- 40
2
votes
2 answers
Get object from bounding box [Object Detection]
I have a .txt file that contains path/to/image.jpg,xmin,ymin,xmax,ymax for each row and a img folder that contains the jpg images. What is the best way using python to extract the "objects" inside the coordinates of each file and look if the…

Jinmare
- 31
- 4
2
votes
1 answer
How to remove the adaptive average pool layer from vgg19?
I have loaded the pre-trained model of vgg19. How to remove the adaptive average pool layer which is present before the classifier?

i_am_somebody
- 23
- 3
2
votes
0 answers
How to apply props/ mask(like snapchat) on face using real camera in swift?
I am working on a snapchat like app where user will be able to apply filters on his face or multiple faces. Minimum iOs support is iOS 12 , device support : iPhone 6 & above. Please let me know how to implement this.

SThakur
- 262
- 4
- 16