Mask_rcnn: Loss increases after restarting training with model.find_last()

Created on 4 Sep 2019 · 4Comments · Source: matterport/Mask_RCNN

Hi,
When I restart model training after loading last weights using model.find_last() my losses jump up
I use the following to load the weights from the last run
model.load_weights(model.find_last(),by_name=True)
model.train(train, test, learning_rate=train_config.LEARNING_RATE / 10 , epochs=60, layers='all')

In the image the circled point (epoch 30) is where I stopped my last run. When I restarted training the loss jumped up. Any ideas why that might be happening?
Thanks,
Amit

Source

akrsrivastava

Most helpful comment

I also encountered this problem. I noticed that optimizer status is not saved at the end of each epoch, and every time training is restarted, a new optimizer is initialized. This might cause the loss to increase.

ZhenghanFang on 5 Sep 2019

👍3

All 4 comments

ZhenghanFang on 5 Sep 2019

👍3

@akrsrivastava @ZhenghanFang But did you guys manage to understand how to deal with this problem? Is there a way to save the optimizer at the end of the epoch?