0

I have created my model POJO, I have to keep my columns in same order with same datatype when generating predictions using Hive UDF? what is the cleanest way to ignore extra columns and add the columns which are present in train data set but not in test data set, my all columns are either double or long.

Shubham
  • 1
  • 1

1 Answers1

1

If you use the Easy wrapper, it does this for you automatically.

If you are not using the Easy wrapper then you need to invent the same kind of behavior.

With the Easy wrapper, new columns are ignored and missing columns are treated as N/A.

TomKraljevic
  • 3,661
  • 11
  • 14