I am working on my implementation of the ORB descriptor. I went trough the paper but I am finding hard to understand how to select the training set that is used on their learning method to select a good subset of binary tests.
If I have a single image with very few keypoints shall I use as training set all the patches of the keypoints at hand or only the patches of the keypoint I want to describe?