How to evaluate the best K for LDA using Mallet?

Asked Jul 30 '15 at 16:26

Active Aug 03 '15 at 12:10

Viewed 1,904 times

I am using Mallet api to extract topic from twitter data and I have already extracted topics which are seems good topic. But I am facing problem to estimating K.

For example I fixed K value from 10 to 100. So, I have taken different number of topics from the data. But, now I would like to estimate which K is best. There are some algorithm I know as

Perplexity
Empirical likelihood
Marginal likelihood (Harmonic mean method)
Silhouette

I found a method model.estimate() which may be used to estimate with different value of K. But I am not getting any idea to show the value of K is best for the model. Does anyone give some idea about it with some sample code? Thanks.

asked Jul 30 '15 at 16:26

Khaled

consider gensim Hdpmodel (Hierarchical Dirichlet Processes). – sachinruk Feb 09 '17 at 03:40

How to evaluate the best K for LDA using Mallet?

0 Answers0

Linked