Why are we using ARMA model that mixes AR and MA model. Isn't AR or MA sufficient?
I know that AR model is a function of previous readings and MA model is a function of previous errors, also know that identifying AR model is best done with PACF and know that identifying MA model is best done with ACF (and identifing AR model is best done with PACF)... But what is the intiuition behind using mixed model between the two