Highest Voted 'stable-baselines' Questions

0

votes

1 answer

Unable to see stable-baselines output?

Greetings! I am new to stable-baselines3, but I have watched numerous tutorials on its implementation and the custom environment formulation. After developing my model using gym and stable-baselines3 SAC algorithm, I applied (check_env) function to…

python stable-baselines

asked Feb 09 '22 at 19:59

Michael HK

1

0

votes

1 answer

I'd like to get the episodic rewards in csv format in stable baselines 3

I want to retrieve the data after every episode, I've read the documentation that you can use, stable_baselines3.common.monitor.ResultsWriter but I don't know how to implement it in my code. import gym import numpy as np import…

python stable-baselines

asked Feb 07 '22 at 13:53

Remian-Feral

3
4

0

votes

1 answer

How to I specify model.learn() to end within a certain episodes of stable baselines 3?

I know specifying that total_timesteps= is a require parameter, but how to I end model.learn() within a certain episodes? Forgive me for I'm still new to stables_baselines3 and pytorch still not how to implement it in code. import gym import numpy…

pytorch python-3.7 stable-baselines

asked Feb 05 '22 at 13:56

Remian-Feral

3
4

0

votes

1 answer

StableBaslines3 PPO model train() freezes?

I'm trying to have my RL model play a game, but I've encountered a peculiar problem. I am kind of new to all this, so maybe it's stupid, but: My environment and everything are set up nicely and when testing works like a charm. I can see the inputs…

reinforcement-learning stable-baselines

asked Jan 25 '22 at 16:09

Temojikato

1
2

0

votes

2 answers

The right way to install stable-baselines?

I am trying to install stable-baselines and run the first two lines from Getting Started section of the online manual but no option is working. I started with pip install stable-baselines Now when I run: import gym from…

python stable-baselines

asked Jan 13 '22 at 12:03

Simd

19,447
42
136
271

0

votes

0 answers

How to use Numpy version 1.19.5 with Tensorflow and not get numpy.ndarray size changed error?

I have been trying to use stable_baselines on my new m1 chip computer; however, after installing all of the packages from some sample code I found, I kept getting this error ValueError: numpy.ndarray size changed, may indicate binary…

python python-3.x numpy tensorflow stable-baselines

asked Sep 08 '21 at 17:00

Joe Nordling

73
1
8

0

votes

1 answer

Reinforcement Learning - Custom environment implementation in Java for Python RL framework

I have a bunch of Java code that constitutes an environment and an agent. I want to use one of the Python reinforcement learning libraries (stable-baselines, tf-agents, rllib, etc.) to train a policy for the Java agent/environment. And then deploy…

java python reinforcement-learning openai-gym stable-baselines

asked Sep 02 '21 at 18:48

ak.

3,329
3
38
50

0

votes

1 answer

How do I get openai.gym.spaces.Dict state updated?

"AttributeError: 'dict' object has no attribute 'flatten'". I get this error when I run the following code: import math from gym import Env from gym.spaces import Discrete, Box, Dict, Tuple, MultiBinary, MultiDiscrete from stable_baselines3 import…

python dictionary reinforcement-learning openai-gym stable-baselines

asked Aug 10 '21 at 23:26

msba

141
1
8

0

votes

1 answer

GNN with Stable baselines

I am looking to use DGL or pytorch geometric for building my policy and value networks in stable baselines, however I am struggling to figure out how to send over observations. The observations must be one of the gym spaces class but I am not sure…

pytorch reinforcement-learning stable-baselines pytorch-geometric dgl

asked Aug 10 '21 at 18:13

pd109

335
2
3
12

0

votes

1 answer

stable_baselines 3 doesn't store the tensorboard_log

I am just getting into reinforcement learning. My Model doesn't create any files in the given directory. What am i doing wrong? def train(): model = PPO('MlpPolicy', env, verbose=1, tensorboard_log=log_path) …

python stable-baselines

asked Jul 03 '21 at 15:41

msba

141
1
8

0

votes

1 answer

Stablebaselines MultiInputpolicies

I tried to use the MultiInputPolicy by : model = PPO("MultiInputPolicy", env, verbose = 1) But, I get an error: KeyError: "Error: unknown policy type MultiInputPolicy,the only registed policy type are: ['MlpPolicy', 'CnnPolicy']!" Please help.…

openai-gym stable-baselines

asked Jun 21 '21 at 18:22

CMOS-Y

3
4

0

votes

1 answer

How to make the model learn in the loop using stable baselines3?

In the sample code from the stable baselines3 website (https://stable-baselines3.readthedocs.io/en/master/modules/ppo.html), the model first will learn via model.learn(total_timesteps=25000) line, then it can be used in the playing loop. Now, as I…

python-3.x reinforcement-learning stable-baselines

asked Mar 31 '21 at 08:17

mac179

1,540
1
14
24

0

votes

0 answers

Use LSTM in stable baselines

I'm using PPO2 of stable baselines for RL. My observation space has a shape of (100,10), I would like to replace the network using in the policy by a LSTM, do u know if it's possible? Thanks

lstm reinforcement-learning stable-baselines

asked Mar 02 '21 at 19:56

Léo Dunand

91
8

0

votes

1 answer

Custom Openai Gym Environment with Stable-baselines

I am trying to create a simple 2D grid world Openai Gym environment which agent is headed to the terminal cell from anywhere in the grid world. For example, in the 5x5 grid world, X is the current agent location and O is the terminal cell where…

reinforcement-learning openai-gym stable-baselines

asked Dec 09 '20 at 02:48

HW Siew

973
8
16

0

votes

1 answer

MlpPolicy only return 1 and -1 with action spece[-1,1]

I try to use Stable Baseliens train a PPO2 with MlpPolicy. After 100k timesteps, I can only get 1 and -1 in action. I restrict action space to [-1, 1] and directly use action as control. I don't know if it is because I directly use action as…

reinforcement-learning openai-gym policy-gradient-descent stable-baselines mujoco

asked Nov 22 '20 at 14:14

qwererer2

11

Questions tagged [stable-baselines]