Questions tagged [vision]

Methods to acquire, analyze and understand images using mathematical approaches.

664 questions
2
votes
1 answer

Running out of memory while fetching images from Photos SwiftUI

I am working on an app that can detect dog breeds, and I would like to make it when users pull up their photo library, they only see images with a dog. To do this, I used the Vision pet recognizer model which is very basic, the issue is that it…
2
votes
1 answer

Pytorch Global Pruning is not reducing the size of the model

I am trying to Prune my Deep Learning model via Global Pruning. The original UnPruned model is about 77.5 MB. However after pruning, when I am saving the model, the size of the model is the same as the original. Can anyone help me with this…
2
votes
1 answer

Vision framework to detect face looking up/down

i’m using Vision framework to detect face orientation. VNFaceObservation has roll and yaw properties but unfortunately no pitch. How can i calculate pitch value? i need to check if a person looking up or down. Someone in another forums suggested me…
2
votes
2 answers

EXC_BAD_ACCESS on VNSequenceRequestHandler

The following code uses the Vision and AVFoundation frameworks to enable face tracking on the built-in camera on macOS. In some circumstances the code crashes due to EXC_BAD_ACCESS (code=2) on a working thread on the queue…
2
votes
3 answers

Interfacing LabVIEW with python for triggering USB Camera

LabVIEW supports pallete for calling function from python. Do I need to develop 3 seperate functions each for initializing, capturing and closing? Using OpenCV in python for accessing directshow based usb cameras. If I create 3 seperate functions,…
Amlo4300
  • 21
  • 2
2
votes
1 answer

How can I thin or skeletonize an object without the objects corners being divergent?

I have the following image, which represents a path. I want to apply an algorithm called HoughLines(P) on it. This algorithm should detect lines and coordinates. But first I need to make the path thinner for it to be valid input for…
Dirk
  • 95
  • 1
  • 8
2
votes
0 answers

Could not cast TextBlock to Element - Android.gms.Vision

Code crashes at selected line. Reason: Cannot cast TextBlock to Element. One would think that the SparseArray(items) is of type 'Element' because of previous declarations.(language: Kotlin)
Jomme
  • 1,256
  • 4
  • 14
  • 26
2
votes
1 answer

Cannot get docker image of gcr.io/automl-vision-ondevice/gcloud-container-1.14.0:latest

I got an error when I was trying to download the docker image from gcr.io/automl-vision-ondevice/gcloud-container-1.14.0:latest But I got an error of Access Denied. How can I solve this issue? Thanks a lot. $ export…
2
votes
0 answers

iOS Vision framework text detection does not detect single digit numbers?

Using text detection inside iOS vision framework struggles to detect single digit numbers, it seems to detect certain digits fine but with others it will fail such as the number 1, 2 or 3 etc. I am just trying to highlight/extract the numbers in…
Luke Ireton
  • 479
  • 1
  • 3
  • 18
2
votes
0 answers

Flutter OCR - Google Cloud Vision API

I have a problem implementing Google Cloud Vision API into my flutter code. I need to have an OCR functionality and the language I’m targeting is only available through this API. Any help?
Koalaz
  • 21
  • 2
2
votes
0 answers

Opencv can't find the contours of an aluminium plate

I'm trying to extract this mold from this picture. I've got some code to extract it, the only thing I need are the four corners. def detect_object(image): orig = image.copy() image = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY) blurry =…
2
votes
1 answer

Separate crossings segments in binarised image

I have some image processing that allows me to extract a binary image containing thick segments and i'm facing the issue that these segments may cross each other. Hence I need to find an efficient way to separate them, i'll have to implement this in…
Julien M
  • 657
  • 3
  • 10
  • 40
2
votes
2 answers

Get object from bounding box [Object Detection]

I have a .txt file that contains path/to/image.jpg,xmin,ymin,xmax,ymax for each row and a img folder that contains the jpg images. What is the best way using python to extract the "objects" inside the coordinates of each file and look if the…
Jinmare
  • 31
  • 4
2
votes
1 answer

How to remove the adaptive average pool layer from vgg19?

I have loaded the pre-trained model of vgg19. How to remove the adaptive average pool layer which is present before the classifier?
2
votes
0 answers

How to apply props/ mask(like snapchat) on face using real camera in swift?

I am working on a snapchat like app where user will be able to apply filters on his face or multiple faces. Minimum iOs support is iOS 12 , device support : iPhone 6 & above. Please let me know how to implement this.
SThakur
  • 262
  • 4
  • 16