I have a large Markov chain and a sample, for which I want to calculate the likelihood. The problem is that some obervations or transitions in the sample don't occur in the Markov chain, which makes the total likelihood 0 (or the log-likelihood - infinity). It is not possible to use more data to construct the Markov chain. I was wondering if there's a way to still have a meaningfull likelihood.
I tried already to filter out these "unknown" observations in the sample and report them seperately. But the problem with that is that I want to compare the likelihood of the sample with the likelihood of the same sample, but after a transformation. The transformed sample has a different amount of "unknown" observations. So I don't think I can compare these two likelihoods, seeing as they have been calculated with a different amount of observations.
Is there a way to still calculate a meaningfull likelihood that can be compared? I was thinking about averaging the probabilities of the observations in the sample, but I can't find anything about that being correct.
Thanks in advance!