I need to train an RL agent, that have to control some switches. Lets imagine, that we have n switches, that could be turned on (1) or turned off (0). my agent have to decide in each step which one of the to torn of, and turn of, so i want action look like this: [1,0,1,....1] - vector that consists of n binary elements.
How can i train an agent? Something beside DQN with converting given action space to discreet one with 2^n actions possible?