I was wondering if it was possible to implement a recurrent network in Theano in the case where inputs are not known initially. Specifically, I have in mind the 'Recurrent Models of Visual Attention' paper (http://arxiv.org/abs/1406.6247) and the part concerning game playing. In this case, each game image is only available after the network has outputted an action.
As I understand, RNN's in Theano are implemented using theano.scan function, which expects a sequence as an input. Obviously, I can't produce such a sequence of game images without running the full recurrent loop and recording the actions that would be generated. And I can't run the loop and generate the sequence of actions, since I don't have the sequence of game images to pass as an input.
So, it would seem that under those conditions I can't use a proper back propagation and train the network correctly. I could run each iteration of the loop manually, but then there would be no BPTT.
What am I missing here? Is it possible to implement the algorithm in the paper describing the game playing part in Theano (I've seen implementations of digit classification part, but it's easier, since the input never changes)?
Thanks.