I want to implement a Kalman filter to predict the actual daily number of dine-in customers in a restaurant with the two sets of time series data below.
- Daily total number of people entering the restaurant: This is not exactly the same as the number of dine-in customers because it also includes the number of staffs and take-away customers entering the restaurant.
- Daily total number of main dishes sold (dine-in): This is not exactly the same as the number of dine-in customers because some customers may order more than one main dish while some customers may not order any main dish at all.
With the above, how should I set the equations for implementing a Kalman filter to make the best guess of the actual number of dine-in customers?