I am currently working on Regression problem statement wherein we have around 13 Numerical Features and 38 Categorical Feature and we are required to predict the Target feature (Discreate Numerical Feature).
Is there a method to determine which categorical feature (out of all 38) is best for my target feature? Or should I perform Label encoding on all categorical features, check for skewness of data, perform Factor analysis?
My dataset has (6403320, 51) Rows and Columns.
Let me know what's the best approach for dataset which has huge categorical features?