I have training data as like following
col1 col2 col3 col4 col5 Target
187.67 448.41 45.7 880070.41 1 -3
95.44 446.08 70.51 909069.06 4 120
I need to build a model and test with following data,
col1 col2 col3
45 2989 12
3 1111 121
The test data has only three column. I am planning to build a model with all the 5 columns as feature columns of traini data set.Is it good to build model only with three columns of train data set and use only 3 columns of test data for prediction? or is it good to build model with 5 columns of train dataset and preprocess(impute) col4 and col5 of test and then run a prediction?We have felt like col4 and and col5 are important. Please suggest the methodology to handle this?