In GroupKFold in ScikitLearn error message: ValueError: too many values to unpack (expected 2)

Question

In using the GroupKFold method from scikit-learn I am getting an error message which I can not understand given the documentation.

The error message is:

ValueError: too many values to unpack (expected 2)

The documentation states:

For a reproducible example:

from sklearn.model_selection import GroupKFold

X1 = np.random.randint(1, 10, size = (100, 2))

groups1 = np.random.choice([1,2,3, 4, 5], size = 100, p = [.15, .2, .3, .15, .2])


gkf1 = GroupKFold(5)

train, test = gkf1.split(X = X1, groups = groups1 )

Which yields the following error message:

---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
<ipython-input-56-911681dea183> in <module>
      8 gkf1 = GroupKFold(5)
      9 
---> 10 train, test = gkf1.split(X = X1, groups = groups1 )

ValueError: too many values to unpack (expected 2)

score 1 · Accepted Answer · answered Oct 29 '19 at 20:01

1

The split function returns a generator. You will have to iterate over the generator to get the train and test groups.

Like shown in the example

for train_index, test_index in gkf1.split(X, y, groups):

answered Oct 29 '19 at 20:01

abhilb

5,639
2
20
26

In GroupKFold in ScikitLearn error message: ValueError: too many values to unpack (expected 2)

1 Answers1