Situation: We are training a Neural Network that distinguishes between opened and closed eyes. While collecting the data we didn't simply collect opened and closed eyes but we also collected data for edgecases like eyes covered by the hand or arm and looking down very strongly what covers most of the eyes. However, we put all opened and all closed eyes together and trained a binary classification Neural Network, because that's all we care about during inference.
Question: We were wondering if it made sense to train a multiclass network. For example we would then have the classes opend, closed, covered by hand, no eye at all, looking down.. and combine all classes besides from closed to one class. We think that the model may be able to understand better the real world if we distinguish between all these cases but I want to emphazise that we don't need the information if the eye is covered by a hand - we only need to know if the eye is closed or not.
I tried to find research done on this topic but without success.