Hello, i am working on simple racing game project.
For now, i manage to achieve some results based on rewards system that is taking into consideration position, speed and collision. If agent agent is not hitting any obstacles and move forward during step i rewarded him basing on his speed. I punish agent when he is colliding with walls, obstacles or he is off the race track.
But this method is far from ideal and will not encourage agent to minimise lap times (agent will focus on keeping just high speed and stopping and hitting obstacles).
I would love to acess the agent basing on lap times, but this would mean that i need to set reward only after finnishing lap and will probably not give enough feedback information on start of the training and during the lap.
Do You have any suggestion on how to implement reinforcement learning rewards for racing type of game?
You could implement checkpoints, which reward the agent relatively to the taken time.
@MarcoMeter suggests, you can implement a time-based reward. A popular technique we use in our example environments is to assign a small negative reward at each timestep, so the agent is incentivized to finish as quickly as possible.
@MarcoMeter checkpoints are good idea. I can reward agent when he reach next checkpoint (+1) and assign small negative reward in meantime (in way to checkpoint) (i.e. -0.001 each step).
Thank you for the discussion. We are closing this issue due to inactivity. Feel free to reopen it if you鈥檇 like to continue the discussion though.
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.
Most helpful comment
You could implement checkpoints, which reward the agent relatively to the taken time.