Questions tagged [pose-estimation]

The specific task of determining the position and orientation of an object in an image relative to the camera coordinates is called Pose Estimation.

The pose of a camera is also known as the camera Extrinsic Parameters. It is a 3x4 matrix where the first 3 columns are the rotation (orientation) and the las column is the translation (position).

Camera Pose matrix. Rotation and Translation

291 questions
1
vote
0 answers

Finding Pitch, Yaw and Roll Angles using Mediapipe and Opencv-Python

I want to find the human face's yaw, pitch, and roll angles in a given image. Based on my understanding, I need to follow the steps below: Using mediapipe to find the face landmarks Opencv-Python solvePnP function to produce rotation vector Pass…
1
vote
2 answers

AttributeError: module 'cv2.aruco' has no attribute 'drawFrameAxes'

I am working with VSCode 1.68.1, Ubuntu 20.04 I am following link (https://programming.vip/docs/3d-pose-estimation-using-aruco-tag-in-python.html) to achieve pose estimation for aruco marker But I am getting below error: aruco.drawFrameAxes(dst1,…
Manpreet
  • 43
  • 1
  • 6
1
vote
1 answer

Transform value 3D coordinate pose landmarks (Mediapipe) to real world value in meters?

I already print the results.pose_landmarks.landmark[mp_pose.PoseLandmark.LEFT_WRIST].z but I am not sure what this number represents and how I can transform this to meters. Can someone help me? Here is part of my code to retrieve z coordinate of the…
1
vote
1 answer

How to calculate distance from hips to hands in Mediapipe/Blazepose (z coordinates)?

I am working on a project where I detect poses from a stickfigure and I am trying to extract the z coordinates from the landmarks via mediapipe/blazepose. The next step would be to calculate the distance between the z-coordinate from the hips to the…
Amy de Lange
  • 21
  • 1
  • 2
1
vote
1 answer

Corner detection: getting rid of unwanted corners

I'd like to find the corners of the following box However, as you can see I'm detecting a lot of corners I don't want to find. I'm completly stuck on this one. No matter what I try, I always seem to find corners in the dots on the box. I used the…
Schweini14
  • 23
  • 5
1
vote
0 answers

COCO API evaluation for subsets of key points classes

I'm using the python coco API to run evaluation for Key points estimation. I have two files, a ground truth json, and a results json. My goal is to evaluate KP estimation performance for ALL KP classes. For example, for category 'person', I want…
Alex Goft
  • 1,114
  • 1
  • 11
  • 23
1
vote
0 answers

Combine single wavefront .obj objects into a clip sequence

Given many .obj files that represent the 3D pose estimation for each frame of a clip, how to combine the .obj files to form an animation? After performing 3D Pose Estimation on a single human's dance movements, I exported every frame as a .obj file.…
Bendemann
  • 735
  • 11
  • 31
1
vote
0 answers

Determine camera coordinates using image frame coordinates for planar pose estimation

I am new here and am very thankful to be one of this awesome community. I am working right now on a object detection and planar localization project with the 6DOF robot UR10e. I have already detected the object using an Mask R-CNN approach, got the…
Amin
  • 11
  • 2
1
vote
0 answers

Aruco Cube Pose Estiamtion

I want to detect pose of the Cube (see image). I have the coordinates of centers of all markers and I want to draw axis on the origin/ or outline of the whole cube using 2D detection points and 3D object points. I am using opencv and I tried to draw…
1
vote
2 answers

How to run pose estimation inference on video instead of webcam with Teachable Machine in JavaScript?

I am attempting to run a pose classification model that I trained on a video file, but I can't seem to figure out how to get it working with anything other than the documentation they provide for running inference on webcam input. I am trying to run…
1
vote
1 answer

How to set input to the LSTM ( pose recognition through videos), if my videos are with variable frames?

I have 2 poses to classify. For each pose i have 60 video samples. But the problem is that the total number of frames in each video are different. In that case, the input to the LSTM will be uneven. Is there any way to solve this? or we need the…
Lakpa Tamang
  • 408
  • 1
  • 4
  • 12
1
vote
1 answer

Converting an OpenCV program into a Module reduces frame rate drastically

I wrote a code for pose estimation using OpenCV and mediapipe library. The program was working well and I was getting around 30-35 fps. When I tried to convert the same program to a module so that I can use it easily in future for different…
Falcon
  • 73
  • 6
1
vote
0 answers

How do I get the pose velocity and acceleration via the poses (rotation matrix and translation vector) obtained from solvePnP()?

I am currently getting the pose of an apriltag object via solvePnP() and projecting the points with projectPoints() This is called on a videoStream, so to try to optimise solvePnP(), I am trying to get the previous pose (pose of object in the…
Zoe
  • 31
  • 5
1
vote
0 answers

How do I get the 3D points of a camera with known initial extrinsics?

I am working on estimating the pose of an object with apriltags attached to them. I have initially done this successfully for an apriltag board: The 3D points were found using the tag radius, (tag_size/2) as shown in the code: ob_pt1 = [-tag_size/2,…
1
vote
0 answers

How can I estimate the pose of an object with tags attached, and always display the entire object, with known rotation and translation vectors?

I have been able to detect multiple apriltags on a board of apriltags like this: This was done using the AprilTag library: https://github.com/swatbotics/apriltag. I used OpenCV solvePnP() and projectPoints() to get the rotation and translation…
Zoe
  • 31
  • 5