Stable-baselines: Question about masks for predict method

Created on 18 Aug 2018  路  1Comment  路  Source: hill-a/stable-baselines

Are masks (dones) only useful when using recurrent policies ?

question

Most helpful comment

When using the predict method of an algorithm, yes it should only be used in recurrent policies. It allows the LSTMs to reset their internal values when the environment resets.

>All comments

When using the predict method of an algorithm, yes it should only be used in recurrent policies. It allows the LSTMs to reset their internal values when the environment resets.

Was this page helpful?
0 / 5 - 0 ratings