Gym: "Taxi-v2" done = True after 200 steps

Created on 4 Feb 2017 · 5Comments · Source: openai/gym

I like using the Taxi environment for educational purposes but was kind of upset to see Taxi-v1 removed completely and that in Taxi-v2 is considered "done" after 200 steps. Is there any workaround for this?

Source

wagonhelm

Most helpful comment

My work around for this was creating a loop that finishes when reward == 20 rather than when done == True

wagonhelm on 8 Feb 2017

👍3

All 5 comments

Hello @wagonhelm . You never said why is this a problem.

olegklimov on 5 Feb 2017

It's kind of nice to show that a completely random policy will eventually solve the environment in a varying number of steps. It sometimes will solve in <200 steps using random actions, but not often.

wagonhelm on 5 Feb 2017

So it sometimes solves problem within 200 steps, right. You can calculate mean score.

olegklimov on 5 Feb 2017

That's not really my main concern. I'm guessing there is no way to use Taxi-v1 using the master branch nor a workaround for v2 considering the environment done after 200 steps? Ultimately it would be nice to see v1 in the master branch as it's on the gym website as well.

wagonhelm on 5 Feb 2017

My work around for this was creating a loop that finishes when reward == 20 rather than when done == True

wagonhelm on 8 Feb 2017

👍3

Was this page helpful?

0 / 5 - 0 ratings