I know this might seem as a duplicate question, but I have done what was suggested here, but it didn't work. I'm working with 28 variables, some of them categorical, but I dropped the one of each variable's categories. As stated in the old question I defined:
import statsmodels.api as sm
from sklearn.linear_model import LogisticRegression
logit = sm.Logit(y_train, X_train)
clf = LogisticRegression(C=1e8,fit_intercept=False)
clf.fit(X_train, y_train)
And then, when I check my results, in sm
my log-likelihood is -19661. In sklearn
my log_loss is 1.9. In sm
my coefficients range form -1.38 to 1.98. In sklearn
they range from -0.01 to 0.02. I need my results to be more alike, so I can perform some inference on my results. What could be happening? What am I missing?
Thanks.