Keras: ValueError: You are trying to load a weight file containing 3 layers into a model with 1 layers.

Created on 21 Oct 2017 · 6Comments · Source: keras-team/keras

I train a model with the following code:

base_model = VGG19(...)
..............
model.fit_generator(...., callbacks=[ModelCheckpoint(filepath=checkpoint_file, save_best_only=True,
                    save_weights_only=True)]....)

however, when I load the weights with the following code:

from keras.utils.training_utils import multi_gpu_model
model = load_vgg19(...)
model = multi_gpu_model(model, gpus=3)
model.load_weights(checkpoint_file)
model.fit_generator(...........)

I got the error: ValueError: You are trying to load a weight file containing 3 layers into a model with 1 layers and followed by AttributeError: 'NoneType' object has no attribute 'TF_DeleteStatus'. Is this a bug?

Source

jamesben6688

👍2

Most helpful comment

This probably has something to do with multi_gpu_model copying your model onto multiple gpus. do a model.summary() after multi_gpu_model and you will see that you can't see the layers directly.
Could you try loading the weights before multi_gpu_model? It might work that way.

from keras.utils.training_utils import multi_gpu_model
model = load_vgg19(...)
model.load_weights(checkpoint_file) # DO THIS FIRST
model = multi_gpu_model(model, gpus=3) # split amongst multiple gpus
model.fit_generator(...........) # enjoy!

akshaychawla on 23 Oct 2017

👍3

All 6 comments

from keras.utils.training_utils import multi_gpu_model
model = load_vgg19(...)
model.load_weights(checkpoint_file) # DO THIS FIRST
model = multi_gpu_model(model, gpus=3) # split amongst multiple gpus
model.fit_generator(...........) # enjoy!

akshaychawla on 23 Oct 2017

👍3

@jamesben6688 I encountered the same problem with you. After reading the source code of function multi_gpu_model(...) and running parallel_model.summary(), I found that the original single gpu model structure is nested in the 'sequential_1' layer of parallel_model. And this 'sequential_1' layer is actually a keras.models.Sequential object(i.e. a sequential model), which means you can see the inside structure by running:

nested_layers = parallel_model.get_layer('sequential_1')
nested_layers.summary()

wowzhangji on 5 Nov 2017

I encountered the same problem. Thank to @akshaychawla 's comment, I figured it's caused by multi_gpu_model. The saved model was trained on two GPUs, then I tried to load the weights into a model on a single GPU. This gave me the error. A simple solution is to make sure your new model use the same number of GPUs by calling multi_gpu_model.

Xiaolong-Li on 29 Aug 2018

👍2

model.add(tf.keras.layers.Flatten()) # takes our 28x28 and makes it 1x784
CHANGE THE PREVIOUS LINE TO
model.add(tf.keras.layers.Flatten(input_shape=(28, 28)))