I have about 300.000 images - all categorized manually as "clip art" or "photo". For each image I can calculate three independent, numerical features that give a clear hint about whether the image is indeed a clip art or a photo. None of these numbers alone is enough for auto-categorizing new images reliably. Used in combination, however, auto-categorizing should be pretty accurate.
I can manually fiddle around and test hundreds of images and observe data. Thus, I can empirically find more or less suitable weighing factors or something alike. But I do have 300.000 properly categorized data sets ... I should be able to use this data to categorize new images pretty reliably. But how? I don't even know the proper terms to Google for an answer: is it "self learning" or a "neural network" or "artificial intelligence" that I'm looking for? How do I start in Python to solve this?