Is it logical to loop on model.fit in Keras?

Question

Is it logical to do as below in Keras in order not to run out of memory?

for path in ['xaa', 'xab', 'xac', 'xad']:
    x_train, y_train = prepare_data(path)
    model.fit(x_train, y_train, batch_size=50, epochs=20, shuffle=True)

model.save('model')

Daniel Möller · Accepted Answer · 2018-05-21T13:40:16.410

It is, but prefer model.train_on_batch if each iteration is generating a single batch. This eliminates some overhead that comes with fit.

You can also try to create a generator and use model.fit_generator():

def dataGenerator(pathes, batch_size):

    while True: #generators for keras must be infinite
        for path in pathes:
            x_train, y_train = prepare_data(path)

            totalSamps = x_train.shape[0]
            batches = totalSamps // batch_size

            if totalSamps % batch_size > 0:
                batches+=1

            for batch in range(batches):
                section = slice(batch*batch_size,(batch+1)*batch_size)
                yield (x_train[section], y_train[section])

Create and use:

gen = dataGenerator(['xaa', 'xab', 'xac', 'xad'], 50)
model.fit_generator(gen,
                    steps_per_epoch = expectedTotalNumberOfYieldsForOneEpoch
                    epochs = epochs)

LOL... exactly the same mistake as in the other answer :) -- It's `batch`. — Daniel Möller, May 21 '18 at 13:40

benjaminplanche · Answer 2 · 2018-05-21T13:12:09.160

4

I would suggest having a look at this thread on Github.

You could indeed consider using model.fit(), but it would make the training more stable to do it in such a way:

for epoch in range(20):
    for path in ['xaa', 'xab', 'xac', 'xad']:
        x_train, y_train = prepare_data(path)
        model.fit(x_train, y_train, batch_size=50, epochs=epoch+1, initial_epoch=epoch, shuffle=True)

This way you are iterating over all your data once per epoch, and not iterating 20 epochs over part of your data before switching.

As discussed in the thread, another solution would be to develop your own data generator and use it with model.fit_generator().

edited May 21 '18 at 13:12

answered May 21 '18 at 12:48

benjaminplanche

14,689
5
57
69

Thank you very much, but what do you mean by "i"? Shoudn't it be "epoch"? – May 21 '18 at 12:58
You are right, thanks for spotting my mistake. I corrected my post. – benjaminplanche May 21 '18 at 13:12
In this case, when to save the model/weights? After the inner loop finish? Then, load it for the next epoch? – Amal Nasir May 10 '22 at 19:25

Is it logical to loop on model.fit in Keras?

2 Answers2