In a pickle...
I have a dataset with >100,000 observations; datasets' columns include CustomerID, VendorID, ProductID and CatNMap. Here is what it looks like:
As you can see values represented in first 3 columns (CustomerID, VendorID, ProductID) represent unique numerical mapped values and would make no sense if represented on x,y plane (which eliminates use of a lot of Classification methods); last column has strings with categories assigned by customers. Now, here is the part that I do not understand and not sure how to approach...
Goal: is to predict CatNMap values in the future for customers, however as I see it the features I have here are not useful, is that true? Now if they are, what method can I use as CatNMap column has >7,000 unique values; also, how would any method deal with categorizing future items if let's say for the same product there are 2 or more different categories assigned by different customers? Do I need to Implement NN for this one?
All answers are appreciated!