Ml-agents: Imitation Learning slows over time

Created on 24 Apr 2018 · 5Comments · Source: Unity-Technologies/ml-agents

While I was doing the work for my video tutorial on Imitation Learning, I found that in long training sessions the game would slow down, sometimes significantly. The scenario being used is pretty straightforward (I am using the Physics engine though). Any thoughts as to why the training might get slower over time?

Video demoing scenario is here: https://youtu.be/WzuVZuGT-rw
Code is here: https://github.com/dracolytch/ML-Simplest-Scenario

help-wanted

Source

dracolytch

Most helpful comment

@superjayman
The learned behavior is stored within the weights of the neural network. This is not getting erased/cleared/reset.

The learning algorithm samples training examples from the experience buffer. So if the buffer is large, then there is a potential that old or useless data is utilized during training and thus it is not of help for the training progress.

MarcoMeter on 25 Apr 2018

👍3

All 5 comments

Hi,
The slowdown happens because the experience buffer used to train the agent becomes very large and it becomes harder to sample efficiently from it. If you have a BCTeacherHelper component attached to your teacher in your environment, you can press C to clear the experience buffer. Another thing that could cause this is when the batch_size and batches_per_epoch parameters are too large, the amount of calculation needed to update the model (at every step) slows the communication down. I think you could decrease these two values and see if it solves your problem.
I hope this helps.