I am trying to classify a large number of words into 5 categories. Examples of classes and strings for each class include:
invoice-Number : "inv123","in12","123"
invoice-Date : "22/09/1994","22-Mon-16"
vendor-Name : "samplevendorname"
email : "abc@gmail.com"
net-amount : "1234.56"
Any pointers to achieve this in python is very much appreciated.
EDIT 1: I'm looking for a machine learning approach as the number of classes will be more and the data in each class will be different so regex is not feasible.