Questions tagged [q-learning]

Q-learning is a model-free reinforcement learning technique.

Q-learning is a model-free, on-policy reinforcement learning technique that aims to find an action-value function that gives the expected utility (reinforcement) of taking a given action in a given state and following a fixed policy thereafter.

One of the strengths of Q-learning is that it needs only a reinforcement function to be given (i.e. a function which tells how well, or how bad the agent is performing). During the learning process, the agent needs to balance exploitation (acting greedily in terms of current action-value function) vs exploration (action randomly to discover new states or better actions then currently estimated). A common simple example for handling this issue is using an epsilon-greedy policy.

447 questions

-1

votes

1 answer

Knowledge from Past Experiences in Q-Learning

I had been going through numerous articles of Reinforcement Learning - more specifically Q-Learning. The area where I'm stuck is how does it learns from past experiences? I came across a concept called experience-replay where it actually learns from…

asked Jan 17 '18 at 04:16

pythonic_autometeor

-1

votes

1 answer

Q Learning Techniuqe for not falling in fires

Please take a look at picture below : My Objective is that the agent rotating and moving in the environment and not falling in fire holes, I have think like this : Do for 1000 episodes: An Episode : start to traverse the environment; if falls into…

machine-learning reinforcement-learning q-learning

asked Nov 09 '15 at 19:52

S.A.Parkhid

2,772
6
28
58

-1

votes

1 answer

Line Following Robot using JavaScript, Arduino and Q-Learning

I'm hoping to create a line following robot that uses Q-Learning. My intention is to use/build a robot based upon Arduino parts, while using JavaScript for the programming side. At the time of writing I've been googling around a bit, and think…

javascript arduino robot q-learning

asked Oct 05 '14 at 18:03

Mark

-2

votes

0 answers

Problem with training Deep Q-learning model on local machine

I am attempting to solve an OpenAI Gym environment using Deep Q Learning with TensorFlow on an M1 Mac. The model seems to start fitting well but ends up performing worse every time. In contrast, when I used the same code in Google Colab, it worked…

tensorflow google-colaboratory apple-m1 openai-gym q-learning

asked Aug 12 '23 at 09:10

Sid

-2

votes

1 answer

How can I Find Walking Paths for Different People in a Graph With Reinforcement Learning?

I don't know it is possible or not with reinforcement learning but my question is about finding walking paths for different people in a graph. A sample image that I draw to help the problem description is here: The problem is that: There is a…

deep-learning artificial-intelligence reinforcement-learning path-finding q-learning

asked Jul 23 '21 at 18:47

Gamzedeyim

-2

votes

2 answers

Variable updating wrong in loop - Python (Q-learning)

Why does the position and newposition give the same output and update together in the next loop? for game in range(nr_of_games): # Initialize the player at the start position and store the current position in position …

python q-learning

asked May 18 '21 at 18:31

GAUSS

-2

votes

1 answer

Get state of TicTacToe board in Q-Learning

I'm just getting into reinforcement learning and q-learning, and I wanted to try and create a Tic-Tac-Toe AI. With a Q-Table, I need to find the "state" of the board, and I was having trouble finding a way to do this. For extra clarification, a…

python arrays numpy reinforcement-learning q-learning

asked Jun 11 '20 at 13:41

CircuitSacul

1,594
12
32

-3

votes

1 answer

Deep Q learning Replay method Memory Vanishing

In the Q-learning algorithm used in Reinforcement Learning with replay, one would use a data structure in which it stores previous experience that is used in training (a basic example would be a tuple in Python). For a complex state space, I would…

python machine-learning deep-learning reinforcement-learning q-learning

asked Jun 27 '18 at 18:33

codroneci

-3

votes

1 answer

Q-learning R has length zero

I am trying to inplement a simulation of a simplified blackjack game that will return the best policy at each state s. The blackjack simulation seems to work properly, but i somehow get an error when trying to apply the Q learning algorithm to…

r machine-learning reinforcement-learning q-learning

asked May 18 '18 at 20:35

Lucien Ledune

-3

votes

2 answers

C++ Problems Store array in hashtable

I am currently doing a research about implement a C++ programe by using Q-learning algorithm to help the agent get the reward. I am trying to use the Hashtable to store my states and Actions. I am not familiar with the C++ programming... What i am…

c++ arrays dictionary hashtable q-learning

asked May 01 '17 at 06:20

Junwen Xie

-5

votes

1 answer

train a neural network on real subject input/output to have it behave similarly to subject

The goal is to create an AI to play a simple game, tracking a horizontally moving dot across the screen which increases speed until no longer tracked. I would like to create an AI to behave similarly to a real test subject. I have a large amount of…

machine-learning neural-network deep-learning artificial-intelligence q-learning

asked Jun 04 '19 at 14:57

Michael1997

Prev 1 2 3

…