I want to estimate the Transition Probability Matrix for a first order Markov chain from a given set of data sequences (i.e. clickstream data). Possibly in java, otherwise Matlab is ok.
I have each sequence in a different file (but of course I can merge everything in a single one) and one of the issues is that I don't have a standard length for the sequences. I Know the state space and I'm only interested in the state transitions.
I've read this: Estimate Markov Chain Transition Matrix in MATLAB With Different State Sequence Lengths but i'm not sure it fits to my problem. I was also wondering if there are Java libraries that handle this issues. If so, I wasn't able to find them.