Low F1-score for the first few Folds

Question

I created a classification model using Random forest. To validate the model i am using K-Fold method with 10 splits and measuring model performance by f1-score. when i perform this i am having very less f1-score for the first few folds and very high f1-score for the rest of the folds.

i am expecting same range of score in each split.

code:

from sklearn.ensemble.forest import RandomForestClassifier
from sklearn.model_selection._split import KFold
kf = KFold(n_splits=20,random_state=41) 

f1list = []

for train_index, test_index in kf.split(XX):
    print("Train:", train_index, "Validation:",test_index)
    X_train, X_test = XX[train_index], XX[test_index] 
    Y_train, Y_test = YY[train_index], YY[test_index]
    LR1 = RandomForestClassifier(n_estimators=10,criterion='entropy',random_state=1,max_depth=25,warm_start=True,bootstrap=True, oob_score=True,n_jobs=-1)

    model1 = LR1.fit(X_train,Y_train)
    pred1 = model1.predict(X_test)

    from sklearn.metrics import f1_score

    f1list.append(f1_score(pred1,Y_test))

and the list of f1-score for 10 splits is

[0.3659305993690852, 0.32, 0.3440860215053763, 0.3668639053254438, 0.4183381088825215, 0.9969525468001741, 0.9979652345793849, 0.9984892504357932, 0.9980234856412045, 0.9977904407489243]

score 0 · Answer 1 · edited Jun 20 '20 at 09:12

0

The code seems to be correct to me, so the problem could be on your data. The problem here is that results depend heavily on the partition... you could try the following:

Check that you've got enough data to make a 20-fold CV. Maybe you could consider less folds.
Shuffle data. Is a good practice, as explained here.
Repeat the CV several times. For having a single metric, you can average the f1-score for every split, and then averege each cv's f1-score average.

Let me know if it works!

edited Jun 20 '20 at 09:12

Community

1
1

answered Oct 08 '19 at 09:55

alexdefelipe

169
2
8

i tried above mentioned approaches . but nothing worked. – LUZO Oct 08 '19 at 11:20
@LUZO Given the F1 values I very much suspect you messed up the 2. point. – user2974951 Oct 10 '19 at 08:44
@user2974951 No, I shuffled the data before training. – LUZO Oct 10 '19 at 13:17

Low F1-score for the first few Folds

1 Answers1