I need to solve a forecasting problem I have in my company. I am going to use the example of racing cars because it very closely matches the problem I am trying to solve:
Let's say I have data from hundreds of races and the distribution of time taken per car to complete the race is gamma distributed. I guess this is my prior. (The assumption is based on the same cars racing on the same track over and over).
In today's race all cars are required to use a slightly different fuel. Ie all vehicles will be impacted equally making them all slightly slower or faster but the outcome is still expected to be a gamma distribution (shifted to the left or right).
All cars start the race together. ie this is not an independent dataset!
As each car crosses the finish line, I want to be able to estimate the new gamma distribution parameters. Ie when the 5th and 6th and 7th car cross the line slightly faster than the 5th, 6th and 7th car in the previous races, we can predict that the race as a whole will be slightly faster.
What is the approach I should take?
We have a very similar real-world problem and I need to be able to forecast when 50%, 75% and 95% of the events will be completed. Some batches are slightly faster than others (we don't fully understand why) but the predicted outcome is fairly evident early in the batch. I need to write some code to automate the prediction.
Many thanks for your help!