Questions tagged [stable-baselines]

Stable Baselines is a library with implementations of various reinforcement learning algorithms in Python, developed by OpenAI. Please mention the exact version of Stable Baselines that is being used in the body of the question.

277 questions

votes

0 answers

For DQN with prioritized experience replay, what is the TD error for terminal states?

While calculating TD error of the target network in Prioritized Experience Replay, we have from the paper equation 2) in Appendix B: $$\delta_t := R_t + \gamma max_a Q(S_t, a) - Q(S_{t-1}, A_{t-1})$$ It seems unnecessary / incorrect to me that the…

asked Jul 17 '20 at 21:20

Srikiran

-1

votes

0 answers

Frozenlake Reward shaping

I'm trying to use a neurosymbolic approach to solve the Frozenlake enviroment, using also stable baselines 3. I used the TransformReward on the enviroment, and seems that it's working (changing the reward values). So here it is how it works the…

python artificial-intelligence openai-gym stable-baselines reward

asked Aug 09 '23 at 18:51

Joaquin

-1

votes

1 answer

How to evaluate q-value network of sac agent in stable baselines (on a state-action pair)?

I am implementing SAC agent in stable baselines and need to evaluate q-value network inside my custom environment.I have tried to get the q-values from SAC class object, but failed. Any method or function like the one with PPO (.value) will be very…

machine-learning reinforcement-learning stable-baselines

asked Jul 16 '22 at 15:34

Moiz Ahmad Muhammad Khawar Sae

-2

votes

1 answer

Reinforcement learning agent picking same action for every state - OpenAI gym environment with stable_baselines3

I have been training a reinforcement learning agent to play ultimate-tictactoe (an expanded version of tic-tac-toe with a 9x9 board and additional rules). I've created an openai gym environment, and have been trying to train the agent using…

python reinforcement-learning openai-gym stable-baselines

asked May 04 '23 at 08:13

Samatva K

Prev 1 2 3

…