is there a simple way to divide a dataset into 5 subsets of the same size? Also taking into account the percentage of class distribution? It should be possible to access each subset directly.
Many thanks
is there a simple way to divide a dataset into 5 subsets of the same size? Also taking into account the percentage of class distribution? It should be possible to access each subset directly.
Many thanks
Are you talking about Kfold? scikit-learn StratifiedKFold is a variation of KFold. The folds are made by preserving the percentage of samples for each class.