Heyyy,,, Im going to do classification. And for the descriptor I planning to use the HOG and SIFT descriptors from LOWE..
1. For HOG, is that true that we need to compute the gradient of all image's pixels??
For example we have image with size 10x10 pixels. And we compute the HOG to seek the orientation
for every pixels of image. And eventually we will get 100 orientations and generate histogram
(represent all of those image's pixels orientation). This Histogram is going to be used for the
classification??
And to get the scale invariant we need to perform this descriptor of different size (scale)
images??
SIFT is Scale Invariant Features Transform. So it's scale and rotation invariant.
I read from here that in SIFT we need to do smoothing using Gaussian to make our image's
resolution become low..
2. Why we have to do that?
And for scale invariant features, how to obtain that in SIFT?? Do we need to rescale our image
in every octave and then apply Gaussian filter in this new scaled images?? Or it's enough to
get scale invariant only in 1 octave by applying 3 times gaussian filter??
How about the histogram, Is it same with HOG that we have to compute all the pixels???
Thankss