How to rescale the output of PPO model to the range of the action space in PPO?

Asked Feb 07 '23 at 06:08

Active Feb 07 '23 at 06:08

Viewed 40 times

Stable baselines3 PPO implementation using MLP policy uses two hidden layers with 64 nodes each. On setting my gym environment, I had set my action space in the range [-50,50]. However, there seem to be no such bounds on the model output in the PPO MLP policy implementation.

How does one scale the model output to the scale of the action space, especially on stable baselines3?

asked Feb 07 '23 at 06:08

Manav Mishra

How to rescale the output of PPO model to the range of the action space in PPO?

0 Answers0