Ml-agents: Rewind a Model that Unlearned Something

Created on 1 Aug 2018  路  3Comments  路  Source: Unity-Technologies/ml-agents

I'm having an issue where, when I leave an agent learning overnight, I return and it has gotten significantly worse and the cumulative reward never returns to an upward trend. What can I be doing wrong and can I "rewind" the model to its peak state?
screen shot 2018-08-01 at 9 01 01 am

help-wanted

Most helpful comment

@GuntherFox You can set the --keep-checkpoints option to a high enough value, so that older cptk files don't get deleted. After training, create a backup of your models folder, then edit the "checkpoint" file and set model_checkpoint_path to the cptk file associated with the best results. Now, when you rerun learn.py with the --load option, training should resume from that checkpoint.

All 3 comments

@GuntherFox You can set the --keep-checkpoints option to a high enough value, so that older cptk files don't get deleted. After training, create a backup of your models folder, then edit the "checkpoint" file and set model_checkpoint_path to the cptk file associated with the best results. Now, when you rerun learn.py with the --load option, training should resume from that checkpoint.

Awesome, thanks a bunch! This is exactly what I was lookin' for.

Thanks @mbaske, you just saved me a few hours of retraining after pyton threw a wobbly. 馃憤

Was this page helpful?
0 / 5 - 0 ratings

Related issues

GeriBP picture GeriBP  路  3Comments

tensorgpu picture tensorgpu  路  3Comments

DVonk picture DVonk  路  3Comments

Sohojoe picture Sohojoe  路  3Comments

MrGitGo picture MrGitGo  路  4Comments