Stable Baselines is a library with implementations of various reinforcement learning algorithms in Python, developed by OpenAI. Please mention the exact version of Stable Baselines that is being used in the body of the question.
Questions tagged [stable-baselines]
277 questions
0
votes
0 answers
For DQN with prioritized experience replay, what is the TD error for terminal states?
While calculating TD error of the target network in Prioritized Experience Replay, we have from the paper equation 2) in Appendix B:
$$\delta_t := R_t + \gamma max_a Q(S_t, a) - Q(S_{t-1}, A_{t-1})$$
It seems unnecessary / incorrect to me that the…

Srikiran
- 309
- 1
- 3
- 9
-1
votes
0 answers
Frozenlake Reward shaping
I'm trying to use a neurosymbolic approach to solve the Frozenlake enviroment, using also stable baselines 3.
I used the TransformReward on the enviroment, and seems that it's working (changing the reward values).
So here it is how it works the…

Joaquin
- 139
- 1
- 3
- 12
-1
votes
1 answer
How to evaluate q-value network of sac agent in stable baselines (on a state-action pair)?
I am implementing SAC agent in stable baselines and need to evaluate q-value network inside my custom environment.I have tried to get the q-values from SAC class object, but failed.
Any method or function like the one with PPO (.value) will be very…
-2
votes
1 answer
Reinforcement learning agent picking same action for every state - OpenAI gym environment with stable_baselines3
I have been training a reinforcement learning agent to play ultimate-tictactoe (an expanded version of tic-tac-toe with a 9x9 board and additional rules).
I've created an openai gym environment, and have been trying to train the agent using…

Samatva K
- 1
- 1