Apply ZCA whitening to VGG in keras

Question

I am using the VGG here to train my data:

# 分類するクラス
classes = ['chino', 'cocoa', 'chiya', 'rize', 'syaro']
nb_classes = len(classes)

img_width, img_height = 150, 150

# トレーニング用とバリデーション用の画像格納先
train_data_dir = 'dataset/train'
validation_data_dir = 'dataset/validation'

# 今回はトレーニング用に200枚、バリデーション用に50枚の画像を用意した。
nb_train_samples = 200
nb_validation_samples = 50

batch_size = 16
nb_epoch = 10


result_dir = 'results'
if not os.path.exists(result_dir):
    os.mkdir(result_dir)


def vgg_model_maker():
    """ VGG16のモデルをFC層以外使用。FC層のみ作成して結合して用意する """

    # VGG16のロード。FC層は不要なので include_top=False
    input_tensor = Input(shape=(img_width, img_height, 3))
    vgg16 = VGG16(include_top=False, weights='imagenet', input_tensor=input_tensor)

    # FC層の作成
    top_model = Sequential()
    top_model.add(Flatten(input_shape=vgg16.output_shape[1:]))
    top_model.add(Dense(256, activation='relu'))
    top_model.add(Dropout(0.5))
    top_model.add(Dense(nb_classes, activation='softmax'))

    # VGG16とFC層を結合してモデルを作成
    model = Model(input=vgg16.input, output=top_model(vgg16.output))

    return model


def image_generator():
    """ ディレクトリ内の画像を読み込んでトレーニングデータとバリデーションデータの作成 """
    train_datagen = ImageDataGenerator(
        rescale=1.0 / 255,
        zoom_range=0.2,
        horizontal_flip=True)

    validation_datagen = ImageDataGenerator(rescale=1.0 / 255)

    train_generator = train_datagen.flow_from_directory(
        train_data_dir,
        target_size=(img_width, img_height),
        color_mode='rgb',
        classes=classes,
        class_mode='categorical',
        batch_size=batch_size,
        shuffle=True)

    validation_generator = validation_datagen.flow_from_directory(
        validation_data_dir,
        target_size=(img_width, img_height),
        color_mode='rgb',
        classes=classes,
        class_mode='categorical',
        batch_size=batch_size,
        shuffle=True)

    return (train_generator, validation_generator)


if __name__ == '__main__':
    start = time.time()

    # モデル作成
    vgg_model = vgg_model_maker()

    # 最後のconv層の直前までの層をfreeze
    for layer in vgg_model.layers[:15]:
        layer.trainable = False

    # 多クラス分類を指定
    vgg_model.compile(loss='categorical_crossentropy',
              optimizer=optimizers.SGD(lr=1e-3, momentum=0.9),
              metrics=['accuracy'])

    # 画像のジェネレータ生成
    train_generator, validation_generator = image_generator()

    # Fine-tuning
    history = vgg_model.fit_generator(
        train_generator,
        samples_per_epoch=nb_train_samples,
        nb_epoch=nb_epoch,
        validation_data=validation_generator,
        nb_val_samples=nb_validation_samples)

    vgg_model.save_weights(os.path.join(result_dir, 'finetuning.h5'))

    process_time = (time.time() - start) / 60
    print(u'学習終了。かかった時間は', process_time, u'分です。')

If it is just VGG alone, it works very well, but I want to apply zca whitening to it. Keras allows adding ZCA with only one line of code (zca_whitening=True), but the problem is that we have to call the fit() function later.

The argument of ImageDataGenerator#fit() is x, which is the training data set. But in this program the training data set will be defined afterward. So how can I use the zca whitening in this case?

Update I already did as the comments below but it raised an error:

Too large work array required -- computation cannot be performed with standard 32-bit LAPACK

To reduce memory, I tried to put only 6 images to the training dataset, but nothing changed. So actually what does this error mean?

What do you mean apply whitening to VGG? You usually whiten the inputs, not the network itself. Please clarify your question. — Dr. Snoopy, Mar 28 '18 at 12:19
Whitening the input is what I meant here. I have problem with image generation. — Le Trong Nghia, Mar 29 '18 at 09:26

score 2 · Answer 1 · answered Mar 28 '18 at 12:19

2

It's pretty simple, actually. You create a generator+DirectoryIterator without whitening, load a bunch of batches (containing sufficient samples to represent your dataset) in memory and use them to fit a new generator:

import numpy as np
from keras.preprocessing.image import ImageDataGenerator

train_dir = '/datasets/train/'
gen_tr_batches = 4

# Generator for the network's training generator.
gen_train = (ImageDataGenerator(rescale=1. / 255.)
             .flow_from_directory(train_dir,
                                  target_size=(256, 256),
                                  batch_size=32,
                                  shuffle=True))
gen_tr_x = np.vstack(next(gen_train)[0] for _ in range(gen_tr_batches))

# Actual generator for the network's training.
g = ImageDataGenerator(rescale=1. / 255.,
                       zca_whitening=True)
g.fit(gen_tr_x)

train = g.flow_from_directory(train_dir)

# Network's training ...

Note: don't post links. It's preferable to simply write down a reproducible code snippet in your post instead.

answered Mar 28 '18 at 12:19

ldavid

2,512
2
22
38

Thank you! I will try not to post a link again. – Le Trong Nghia Mar 29 '18 at 08:53
Thank you! I tried it but know the program said that "computation cannot be performed with standard 32-bit LAPACK. What does it means? – Le Trong Nghia Mar 29 '18 at 09:06
I don't have this problem. Dump here your pip packages listing and give me some info on your setup. Also, someone had a [similar problem](https://github.com/numpy/numpy/issues/8261) when dealing with very large matrices. Reduce `gen_tr_batches` to 1 and `batch_size` to 1. See if it works then. – ldavid Mar 29 '18 at 11:54
1

Yes I also found the "similar problem" but I already changed all batch size to 1 and nothink work. My code is kind of the same of what you wrote except I changed the target size to (100,100). Does it matter? – Le Trong Nghia Mar 29 '18 at 12:11

Apply ZCA whitening to VGG in keras

1 Answers1