I'm stuck, I've seen a post of Training a Neural Network ATARI Pong agent with Policy Gradients from raw pixels of Kapathy, and I copy all code in there and well done, but only have one problem. The result is run fine without gameplay of pong (gym[atari]
in this post). While if I only test code to check Atari enviroment by...
import gym
env = gym.make('SpaceInvaders-v0')
env.reset()
for _ in range(1000):
env.step(env.action_space.sample())
env.render('human')
...it just runs roughly 30 seconds and crashes, and I don't know why. My OS is Windows 10 32bit and installing python 2.7.6
And if I run entire code from Kapathy's post, I just display result without gameplay. Here is source code: kapathy's code
Thanks for viewing my question. Please help me.