I have data like following:
col1 col2 col3
2 14 text, text, some text
I went through http://scikit-learn.org/stable/modules/preprocessing.html#preprocessing but I could only find information to vectorize col3 and pass it on for classification. In my scenario, I have numerical information in col1 and col2 as well.
If without vectorizing I pass col1, 2 and 3 I get an error for col3 as it is String.
If I vectorize col3, the output is a sparse matrix. I need to add col1 and col2 to the vectorized data. How do I do that?
I am using scikit-learn.