I am currently working with Scikit Learn and have been running into the following issue while trying to train a Gaussian HMM:
File "/Library/Python/2.7/site-packages/sklearn/hmm.py", line 443, in fit
self._do_mstep(stats, self.params)
File "/Library/Python/2.7/site-packages/sklearn/hmm.py", line 798, in _do_mstep
super(GaussianHMM, self)._do_mstep(stats, params)
File "/Library/Python/2.7/site-packages/sklearn/hmm.py", line 580, in _do_mstep
np.maximum(self.startprob_prior - 1.0 + stats['start'], 1e-20))
File "/Library/Python/2.7/site-packages/sklearn/hmm.py", line 476, in _set_startprob
raise ValueError('startprob must sum to 1.0')
ValueError: startprob must sum to 1.0
If I eliminate some of the features (fewer than 13 features per observation), it still works. I have checked that all of the input is valid and consists of only 2d-arrays of numpy.float64s for each training example. Any ideas on what is going wrong? Thanks!