I recently downloaded Deeplearning for Java (DL4J) and am now experimenting a bit with convolutional nets. I found some working samples on the homepage and the internet, for example how to classify images, i.e. recognizing faces. I understand roughly how the training data is read and that the images are labeled with the correct label the net should recognize ("Barack Obama - Picture 1"). However, from here on I am a little bit stuck. I cannot make sense of the output. Like described on http://deeplearning4j.org/image-data-pipeline we use an ImageRecordReader to read in the images and create training data out of it. But how does the program then for example know to classify one image as "barack obama" and not "barack obama - sample picture 1"? Or does the net do that? I don't think so. My next problem then is to change the application to not merely recognize an object but to evaulate an image, kind of like in AlphaGo evaluating a board position represented as an image. How would I input the data then? I could label for example training board states with their score ... but I do not know if that is good at all. I hope this was understandable, help and minimal samples would be greatly appreciated!
Thanks and have a good day Oliver