I have been able to train and test a shared RL model that produces a Actor.h5 and Critic.h5 file, as well as a json containing the parameters for training.
I am now at the stage where I would like to make a prediction on the next best action for the model to take, based on the 50 previous time series data points. One was I have been doing this is to use a modified version of my test function and reducing the data frame a little larger than my 50 lookback window, then taking the final prediction/action in the episode of testing.
I think this is a work around and was looking for some advice on the methodology to approach this.
Thanks.