My question is related to the Backward algorithm.
The recursive formula for the algorithm is as follows:
sigma j = 1 to N (t+1(j) * aij * bj(Ot+1))
Where t+1(j) is the recursive element, aij is the transition probability from state i to j, and bj(Ot+1) is the omission probability of the observation O at time t+1.
Given the above, when I start calculating the backward probabilities, it seems that it does not matter what the first observation is, since the observation at time t is not taken into consideration for calculation of the corresponding backward probability.
For Example, for a sequence of observations A,T,G,A, it does not matter what the first observation , i.e, A in this case, is since its emission probability is not taken into consideration in the backward algorithm.
I just want to know if my intuition is correct. Otherwise please point me to some reference or explanation which will clarify my doubt.