How to compute the rotation and translation between 2 cameras?

Question

I am aware of the chessboard camera calibration technique, and have implemented it.

If I have 2 cameras viewing the same scene, and I calibrate both simultaneously with the chessboard technique, can I compute the rotation matrix and translation vector between them? How?

score 3 · Answer 1 · answered Apr 29 '12 at 10:57

3

If you have the 3D camera coordinates of the corresponding points, you can compute the optimal rotation matrix and translation vector by Rigid Body Transformation

answered Apr 29 '12 at 10:57

Harsha

31
4

score 2 · Answer 2 · edited Apr 20 '11 at 03:33

2

If You are using OpenCV already then why don't you use cv::stereoCalibrate.

It returns the rotation and translation matrices. The only thing you have to do is to make sure that the calibration chessboard is seen by both of the cameras.

The exact way is shown in .cpp samples provided with OpenCV library( I have 2.2 version and samples were installed by default in /usr/local/share/opencv/samples).

The code example is called stereo_calib.cpp. Although it's not explained clearly what they are doing there (for that You might want to look to "Learning OpenCV"), it's something You can base on.

edited Apr 20 '11 at 03:33

Chandu

81,493
19
133
134

answered Apr 19 '11 at 11:28

morynicz

2,322
2
20
34

This is the way to go since here we clearly have a sort of stereo camera system. Even if there was a single camera that was just translated and rotated to a different location we can still emulate such a system. There are of course other ways to calculate the trajectory of a camera such as measuring camera translation by the dominant apical angle (see paper with the same name on the internet). – rbaleksandar Apr 21 '14 at 09:21

score 1 · Answer 3 · edited Jun 20 '20 at 09:12

If I understood you correctly, you have two calibrated cameras observing a common scene, and you wish to recover their spatial arrangement. This is possible (provided you find enough image correspondences) but only up to an unknown factor on translation scale. That is, we can recover rotation (3 degrees of freedom, DOF) and only the direction of the translation (2 DOF). This is because we have no way to tell whether the projected scene is big and the cameras are far, or the scene is small and cameras are near. In the literature, the 5 DOF arrangement is termed relative pose or relative orientation (Google is your friend). If your measurements are accurate and in general position, 6 point correspondences may be enough for recovering a unique solution. A relatively recent algorithm does exactly that.

Nister, D., "An efficient solution to the five-point relative pose problem," Pattern Analysis and Machine Intelligence, IEEE Transactions on , vol.26, no.6, pp.756,770, June 2004 doi: 10.1109/TPAMI.2004.17

Just chiming in, but this seems strange to me. If you're able to calibrate via chessboard, like stated in the question, should you not be able to get the exact distances between the cameras? How would this change if you had the corresponding depth point? Say, you were working with the Kinect? Very interested — Chris, Apr 07 '11 at 14:22
Perhaps the misunderstanding is due to my assuming that the chessboard is visible only during the camera calibration procedure and not during estimation of the relative orientation. Of course, the scale ambiguity can be resolved if you observe some objects of the known size or you know the exact depth of any pixel. Kinect is different because there the relative pose between the camera and the projector is precalibrated at the factory. — ssegvic, Apr 08 '11 at 10:36

peakxu · Answer 4 · 2011-04-07T14:45:14.647

Update:

Use a structure from motion/bundle adjustment package like Bundler to solve simultaneously for the 3D location of the scene and relative camera parameters.

Any such package requires several inputs:

camera calibrations that you have.
2D pixel locations of points of interest in cameras (use a interest point detection like Harris, DoG (first part of SIFT)).
Correspondences between points of interest from each camera (use a descriptor like SIFT, SURF, SSD, etc. to do the matching).

Note that the solution is up to a certain scale ambiguity. You'll thus need to supply a distance measurement either between the cameras or between a pair of objects in the scene.

Original answer (applies primarily to uncalibrated cameras as the comments kindly point out):

This camera calibration toolbox from Caltech contains the ability to solve and visualize both the intrinsics (lens parameters, etc.) and extrinsics (how the camera positions when each photo is taken). The latter is what you're interested in.

The Hartley and Zisserman blue book is also a great reference. In particular, you may want to look at the chapter on epipolar lines and fundamental matrix which is free online at the link.

Note that Hartley and Zisserman's book, while being an excellent source, mainly deals with the geometry of uncalibrated cameras, which is not so relevant in the scope of the original question. — ssegvic, Apr 07 '11 at 12:59

How to compute the rotation and translation between 2 cameras?

4 Answers4

Linked