I have a multi-column data set as follows
Id Summary Component Description Labels Action
id1 free-text-11 free-text-12 free-text-13 label1, label2 action1
id2 free-text-11 free-text-22 free-text-23 label2, label3 action2
... so on
Here Summary, Component, Description contains user provided free text in english. Labels and Action columns contains system defined fixed texts. Now my job at hand is to train a model using java which will predict Action value after reading data from other columns - Summary, Component, Description and Labels and here some of the columns can be optional.
As a total newbie, I tried to use LDA using mallet, but all of the examples only handle one free text input column, and also I am not sure which algorithm would be best fit for my use case. So how do I solve this problem using java? Any help would be appreciated.