Stable-baselines: Dueling and Double DQN

Created on 15 Jul 2019 · 1Comment · Source: hill-a/stable-baselines

Hi,
I would like to be able to compare the different variations of DQN, the vanilla DQN(original paper) and double DQN and dueling extensions but I don't know how to disable the dueling and double extensions. I notice that the double option is within the build_training method but the parameter is not exposed. Likewise the dueling option is part of the policy parameters but the DQN instatiation doesn't allow to modify the policy parameters, for instance like this:

policy = LnMlpPolicy(train_env.observation_space ,train_env.action_space, 1, sz_train, sz_train, reuse=False, dueling=False)

But I can try to pass the modified policy to the DQN class instantiation, it doesn't seem to work.
Any advice?

Thanks in advance

enhancement question

Source

edwinlima

👍2

Most helpful comment

Hello,

That's a good question.
For disabling the dueling, you can use net_kwargs argument like that:

from stable_baselines import DQN

model = DQN('MlpPolicy', 'CartPole-v1', policy_kwargs=dict(dueling=False))

For double q learning, we should expose that.
We would appreciate PR that adds that feature ;)

araffin on 16 Jul 2019

👍4 ❤1

>All comments

Hello,

That's a good question.
For disabling the dueling, you can use net_kwargs argument like that:

from stable_baselines import DQN

model = DQN('MlpPolicy', 'CartPole-v1', policy_kwargs=dict(dueling=False))

For double q learning, we should expose that.
We would appreciate PR that adds that feature ;)

araffin on 16 Jul 2019

👍4 ❤1

Was this page helpful?

0 / 5 - 0 ratings

Related issues

[question] Is PPO2 with multiple workers the same as DPPO?

acyclics · 3Comments

What is the default network architecture for MlpLnLstmPolicy?

ktattan · 3Comments

[question] What does .action_probability mean for continuous spaces?

shwang · 3Comments

Trying to run the GAIL Example Code

JankyOo · 3Comments

Tensorboard add summary image

maystroh · 3Comments