I am new to computer vision, can anyone tell me the steps to do object detection in natural image? (Here object refers to logo). I drafted the following steps based on my own understanding:
Problem statement: Suppose there are 20 reference logos, given an natural image, tell out which logo exists in the image and in which location (bounding box).
Step1: Collect many (i.e. 100) images containing corresponding logos, and crop out the logo region. Hence, there are 100 examples for each logo. The purpose of this step is to deal with logos under different conditions, such as illumination, rotations etc.
Step2: Collect random images that don't contain any logos.
Step3: Extract features for example logos and random images, use SIFT feature.
Step4: Now, the problem becomes a multi-class classification problem. There are 21 classes, 20 classes corresponds to 20 logos, and 1 class corresponds random images.
Question1: use which classifier? what is input and what is output?
Step5: Given a test image, extract SIFT features, use all the features as input?
Question2: For the test image, use what as input and how to do the classification to tell out whether it contains a logo or not, and which logo it is?
Question3: How to determine the location of the detected logo?
Question4: Any image labeling or cropping tool?
If my procedure is not correct, please tell me how to do this step by step. Thanks in advance!!