Reward Function in MIT Deep Traffic Challenge?

Question

After getting a General understanding of the architecture I was wondering what exactly the reward function given by the Environment is.

I also found this javascript Codebase, which does not really help my understanding either.

mljack · Accepted Answer · 2018-07-20T18:12:55.077

1

The reward is scaled average speed within the interval: [-3, 3].

The implementation of the deeptraffic environment locates in this file: https://selfdrivingcars.mit.edu/deeptraffic/gameopt.js

    var reward = (avgSpeedMeasurement - 60) / 20;

edited Jul 20 '18 at 18:12

answered Jul 18 '18 at 18:53

mljack

Just to make this complete. There is no reward clipping involved, as you can see in the above mentioned equation. – mrk Jul 19 '18 at 21:10

1 Answers1