I am trying to train a learning model to recognize which part (left/central/right) of a known object is represented in an image, assuming that the model's input can be one of the following: an image of the left part of the object (whole left part or some smaller portion of the left part); an image of its central part (whole central part or some smaller portion of the central part); an image of its right part (whole right part or some smaller portion of the right part). The position of the object is always fixed so all images are taken in front of the object and this will be also the case in which the model will be asked to make a prediction. I've collected a few thousands images belonging to the left, central and right part of the object; for each part, as explained, some images represent the whole part while others represent a smaller portion of that part, anyway i'm just interested to predict which part of the object (left/central/right) the image belongs to, so i've tackled this as a classification task over 3 labels but I'm wondering if the best choice for this task is to use a convolutional neural network or some other approach.
Any suggestion is very appreciated. Thanks