Suppose the action space is a game with 5 doors and you can choose 2 and only 2 at each step. How could that be represented as an action_space?
self.action_space = spaces.Box( np.array([0,0,0,0,0]), np.array([+1,+1,+1,+1,+1])) #
Using the above method the action_space could be none [0 0 0 0 0] or all [1 1 1 1 1] or anything in between. I am trying to force the action to choose only 2 doors.
Examples of correct actions:
[1 1 0 0 0]
[1 0 1 0 0]
etc.