Questions tagged [q-learning]

Q-learning is a model-free reinforcement learning technique.

Q-learning is a model-free, on-policy reinforcement learning technique that aims to find an action-value function that gives the expected utility (reinforcement) of taking a given action in a given state and following a fixed policy thereafter.

One of the strengths of Q-learning is that it needs only a reinforcement function to be given (i.e. a function which tells how well, or how bad the agent is performing). During the learning process, the agent needs to balance exploitation (acting greedily in terms of current action-value function) vs exploration (action randomly to discover new states or better actions then currently estimated). A common simple example for handling this issue is using an epsilon-greedy policy.

447 questions
-1
votes
1 answer

Knowledge from Past Experiences in Q-Learning

I had been going through numerous articles of Reinforcement Learning - more specifically Q-Learning. The area where I'm stuck is how does it learns from past experiences? I came across a concept called experience-replay where it actually learns from…
-1
votes
1 answer

Q Learning Techniuqe for not falling in fires

Please take a look at picture below : My Objective is that the agent rotating and moving in the environment and not falling in fire holes, I have think like this : Do for 1000 episodes: An Episode : start to traverse the environment; if falls into…
S.A.Parkhid
  • 2,772
  • 6
  • 28
  • 58
-1
votes
1 answer

Line Following Robot using JavaScript, Arduino and Q-Learning

I'm hoping to create a line following robot that uses Q-Learning. My intention is to use/build a robot based upon Arduino parts, while using JavaScript for the programming side. At the time of writing I've been googling around a bit, and think…
Mark
  • 455
  • 1
  • 5
  • 11
-2
votes
0 answers

Problem with training Deep Q-learning model on local machine

I am attempting to solve an OpenAI Gym environment using Deep Q Learning with TensorFlow on an M1 Mac. The model seems to start fitting well but ends up performing worse every time. In contrast, when I used the same code in Google Colab, it worked…
-2
votes
1 answer

How can I Find Walking Paths for Different People in a Graph With Reinforcement Learning?

I don't know it is possible or not with reinforcement learning but my question is about finding walking paths for different people in a graph. A sample image that I draw to help the problem description is here: The problem is that: There is a…
-2
votes
2 answers

Variable updating wrong in loop - Python (Q-learning)

Why does the position and newposition give the same output and update together in the next loop? for game in range(nr_of_games): # Initialize the player at the start position and store the current position in position …
GAUSS
  • 3
  • 6
-2
votes
1 answer

Get state of TicTacToe board in Q-Learning

I'm just getting into reinforcement learning and q-learning, and I wanted to try and create a Tic-Tac-Toe AI. With a Q-Table, I need to find the "state" of the board, and I was having trouble finding a way to do this. For extra clarification, a…
CircuitSacul
  • 1,594
  • 12
  • 32
-3
votes
1 answer

Deep Q learning Replay method Memory Vanishing

In the Q-learning algorithm used in Reinforcement Learning with replay, one would use a data structure in which it stores previous experience that is used in training (a basic example would be a tuple in Python). For a complex state space, I would…
-3
votes
1 answer

Q-learning R has length zero

I am trying to inplement a simulation of a simplified blackjack game that will return the best policy at each state s. The blackjack simulation seems to work properly, but i somehow get an error when trying to apply the Q learning algorithm to…
-3
votes
2 answers

C++ Problems Store array in hashtable

I am currently doing a research about implement a C++ programe by using Q-learning algorithm to help the agent get the reward. I am trying to use the Hashtable to store my states and Actions. I am not familiar with the C++ programming... What i am…
Junwen Xie
  • 15
  • 4
-5
votes
1 answer

train a neural network on real subject input/output to have it behave similarly to subject

The goal is to create an AI to play a simple game, tracking a horizontally moving dot across the screen which increases speed until no longer tracked. I would like to create an AI to behave similarly to a real test subject. I have a large amount of…
1 2 3
29
30