Deepspeech: Save only the best checkpoint

Created on 24 Jul 2018 · 8Comments · Source: mozilla/DeepSpeech

Instead of using EarlyStop to avoid overfitting, I would like to save the model at the time it had the best (lowest) validation loss. In other words, I would like to keep track only for the best checkpoint (based on val/dev set).

I tried to use a wrapper for tf.train.Saver (this one: https://github.com/vonclites/checkmate), but couldn't make it works with DeepSpeech. Is there a easy way to do tha (maybe using the MonitoredTrainingSession as you are using)?

Source

bernardohenz

Most helpful comment

Updated version of the patch is here: https://gist.github.com/reuben/dcc2deaf85568591e34ce363bc3bac2a

reuben on 17 Nov 2018

🎉1 👍1

All 8 comments

I looked into this briefly but couldn't find a clean way to implement it with MonitoredTrainingSession (which is IMO a terrible API). I ended up just writing a hack that works, but isn't really code we can land. I'm attaching the patch.

save_best_val.patch.txt

reuben on 24 Jul 2018

Thanks @reuben , I had to change the name of the MonitoredTrainingSession to train_session for this to work. Now it is working perfectly fine.

bernardohenz on 25 Jul 2018

@bernardohenz What's the status here, is the issue fixed, do you have a workaround ? Should we close this ?

lissyx on 2 Oct 2018

@lissyx yes, the patch from @reuben worked just fine.

bernardohenz on 2 Oct 2018

👍1

We should have a proper solution for this in-tree. This would be too much work with the current training setup, but would probably be very simple if we used TF Eager, for example. Reopening so we don't forget.

reuben on 2 Oct 2018

Updated version of the patch is here: https://gist.github.com/reuben/dcc2deaf85568591e34ce363bc3bac2a

reuben on 17 Nov 2018

🎉1 👍1