I'm just getting into reinforcement learning and q-learning, and I wanted to try and create a Tic-Tac-Toe AI. With a Q-Table, I need to find the "state" of the board, and I was having trouble finding a way to do this.
For extra clarification, a state is a number that represents the current board, including the value of each of the nine squares.
A board that looks like:
[[0, 0, 0],
[0, 0, 0],
[0, 0, 0]]
would be state 0, as it is the first board. Beyond this, I am not sure how to calculate the state of the board based on the array.
[EDIT] I'm coming here because I honestly don't know where to start; I can't find anything on the web, and if you dislike my question you could at least tell me why.