Performing logistic regression with large number of explanatory variables (400 in this example). I can easily reference all 400 variables using the code below in the model statement, but is there also an easy way to generate 1st level interaction terms (i.e. all pairs of two)?
proc logistic data = d1;
model y = var1-var400 / rsquare;
run;
I've seen code like this:
proc logistic data = d1;
model y = var1 | var2 | var3... @2 / rsquare;
run;
but this is not realistic for 400 variables.
Any suggestions that provide a better method than doing this the hard way and creating a new dataset that contains all of the interaction terms?