0

I created an LDA visualization using pyLDAvis and was wondering what "token" means in the title of the bar graph (i.e., Topic 3 (14% of tokens)). I read the pyLDAvis documentation, but could not find an explanation. Does this mean that 14% of all the documents in the corpus fit into this topic or does it have to do with the distribution of words?

Thank you for the help.

Example bar graph that I generated using pyLDAvis

ac12
  • 1
  • What have you tried so far? – U13-Forward Jan 03 '22 at 07:45
  • Hi there - I read the pyLDAvis documentation and did a Google search, but was not getting a consistent answer. Now that I think about it, I do not think the percentage represents the percent of documents in the corpus that are in the topic, but rather the percentage of words in the bag of words. Just wanted to get confirmation if I am interpreting that correctly. Thank you. – ac12 Jan 03 '22 at 15:20
  • Any insight you have here is appreciated. Thanks. – ac12 Jan 05 '22 at 22:56

0 Answers0