Questions tagged [apple-vision]

Apple Vision is a high-level computer vision framework to identify faces, detect and track features, and classify images, video, tabular data, audio and motion sensors data.

Apple Vision framework performs face and face landmark detection on input images and video, barcode recognition, image registration, text detection, and, of course, feature tracking. Vision API allows the use of custom CoreML models for tasks like classification or object detection.

205 questions
3
votes
1 answer

Bounding Box from VNDetectRectangleRequest is not correct size when used as child VC

I am trying to use VNDetectRectangleRequest from Apple's Vision framework to automatically grab a picture of a card. However when I convert the points to draw the rectangle, it is misshapen and does not follow the rectangle is it should. I have been…
user
  • 105
  • 1
  • 11
3
votes
0 answers

Add a Snap like effect to a Live Video based on tracking the users face in real time in Swift

I desire to add a Snap like effect to a Live Video based on tracking the users face in real time. My design would like to place streams of particles coming from they eyebrows, eyes, or lips. I already have a flexible effects library that can…
user3408691
  • 93
  • 1
  • 1
  • 7
3
votes
1 answer

How to use "layerRectConverted(fromMetadataOutputRect:)" method?

I am working with Apple's Vision framework for detecting faces. Then I come to this function. let cgr: CGRect = CGRect(x: 0, y: 0, width: 1, height: 1) let videoPreviewRect = previewLayer.layerRectConverted(fromMetadataOutputRect: cgr) where…
Asif Mujtaba
  • 447
  • 6
  • 17
3
votes
3 answers

Facing issue of Real face detection in Vision Framework

I have faced the issue of real face detection using Vision Framework. I have referred below apple link. https://developer.apple.com/documentation/vision/tracking_the_user_s_face_in_real_time I used demo code provided in above link. I see, Camera…
Dipang
  • 1,111
  • 12
  • 12
3
votes
1 answer

How do I extract specific text from an image using a UITextField in Swift?

I am using the Vision framework and I want to be able to use a UITextField to find a specific word in a picture. For example let's say I type in the word black in the text field and I want it to detect that in the picture I have. How would I do…
xcode22
  • 118
  • 1
  • 9
3
votes
0 answers

Detect text from an image and get the rect of an keyword

I'm trying to get all the text found in an UIImage using VisionKit and get the location of a keyword (if it exists) in the image. So far I've got this: var detectedText = "" var textRecognitionRequest =…
perteadi
  • 13
  • 13
3
votes
1 answer

Frame information in completion handler for text detection in ARSession

I am using Core Vision to detect text boxes in an ARKit session, my problem is accessing the frame to perform a hit test once I have detected the boxes. func startTextDetection() { let textRequest =…
Daniele Bernardini
  • 1,516
  • 1
  • 12
  • 29
3
votes
1 answer

How to apply iOS VNImageHomographicAlignmentObservation warpTransform?

I'm testing Apple's Vision Alignment API and have questions regarding VNHomographicImageRegistrationRequest. Has anyone got it to work? I can get the warpTransform out of it, but I've yet to see a matrix that makes sense, meaning, I'm unable to…
3
votes
1 answer

Object detection ARKit vs CoreML

I am building ARKit application for iPhone. I need to detect specific perfume bottle and display content depending on what is detected. I used demo app from developer.apple.com to scan real world object and export .arobject file which I can use in…
3
votes
1 answer

Merge images using "VNImageHomographicAlignmentObservation" class

I am trying to merge two images using VNImageHomographicAlignmentObservation, I am currently getting a 3d matrix that looks like this: simd_float3x3([ [0.99229, -0.00451023, -4.32607e-07)], [0.00431724,0.993118, 2.38839e-07)], …
user9135334
3
votes
1 answer

Filter Pixels in Image by Color Swift

My project is using Vision to identify a certain rectangle in the field of view. I know this rectangle will be of a certain color. In OpenCV, there is a filter that allows you to filter pixels out by color, as seen here. I am new to the CoreImage…
technology08
  • 131
  • 8
3
votes
1 answer

VNDetectFaceRectanglesRequest is not calling completionHandler

I'm trying to detect face. It can not detect because handleFaces is not called. I would like to know why is it not called and how I can solve if I would like to detect face. faceDetectionRequest = VNDetectFaceRectanglesRequest(completionHandler:…
ying pod
  • 31
  • 4
3
votes
1 answer

VNImageBasedRequest regionOfInterest not within normalized bounds

I am attempting to access specific sections of an image to perform a VNCoreMLRequest on user defined areas of the image. I am getting errors for certain regions of interest while others are just fine. Each region is normalized. The following two…
ɯɐɹʞ
  • 1,040
  • 11
  • 17
3
votes
0 answers

Face cropping along the face edge

Before anyone rates this question negative, let me clarify that I have tried all the resources possible in last two weeks before posting this query. And also I've never done this stuff previously and I don't have someone to guide me anyhow currently…
Ashish Sharma
  • 663
  • 5
  • 15
3
votes
0 answers

VNTrackObjectRequest not tracking face using VNFaceObservation in ARSCNView

I am trying to track face using VNFaceObservation in ARSCNView let response = observations.map({ (face) -> (observation: VNFaceObservation, image: CIImage, frame: ARFrame) in self.lastObservation =…
Ros
  • 144
  • 8