I'm having trouble understanding the Perspective-n-Point problem. A few questions:
What is
s
for? Why do we need a scale factor for the image point?Is
K[R|T]
a "change of coordinates matrix" which movesp_w
, the homogenous world point, into the coordinate space of the 2D image plane?- I understand that
[R|T]
represents the "rotation and translation" of the camera relative to the corresponding world pointp_w
and that is what we are trying to solve for. What's particularly difficult about this? Can't we just say[R|T] =inv(K)s(p_c)inv(p_w)
? I just did this with some basic matrix algebra. - I don't understand why PnP has multiple solutions... what are these multiple solutions exactly?
Thanks for any help!