Is MaxQ' sum of all possible rewards or highest possible reward?

Question

I'm coding a simple q-learning example and to update q-values you need a maxQ'.

I'm not sure if maxQ' is referring to the sum of all possible rewards or the highest possible reward:

Please put *all* relevant information *in* the question. *Not* behind external links. — Jesper Juhl, Jul 01 '19 at 15:57

score 1 · Accepted Answer · answered Jul 01 '19 at 21:39

1

That is maximum Q-values among all possible actions for the state s'. Basically, you need to take a max over all Q(s',a') for all valid actions a' in state s'.

answered Jul 01 '19 at 21:39

Afshin Oroojlooy

1,326
3
21
43

Is MaxQ' sum of all possible rewards or highest possible reward?

1 Answers1