Stable-baselines: Question about masks for predict method

Created on 18 Aug 2018 · 1Comment · Source: hill-a/stable-baselines

Are masks (dones) only useful when using recurrent policies ?

question

Source

araffin

Most helpful comment

When using the predict method of an algorithm, yes it should only be used in recurrent policies. It allows the LSTMs to reset their internal values when the environment resets.

hill-a on 18 Aug 2018

👍2

>All comments

When using the predict method of an algorithm, yes it should only be used in recurrent policies. It allows the LSTMs to reset their internal values when the environment resets.

hill-a on 18 Aug 2018

👍2

Was this page helpful?

0 / 5 - 0 ratings