I am trying to write down the MAP updates for the EM in case of mixtures of Bernoulli distributions.
I know that for ML estimates, we have:
E-step: compute P(Z|X,p,t)
M-Step: (p,t)<-argmax sum(over Z): p(Z|X,p,t)log p(X,Z|p,t)
where p are the vectors parameters for each class (K of them, each of size D, where K is the number of classes and D is the number of features) and t are the multinomial parameters for each class.
But how do I get MAP estimates? What would p(X) be...?