Naive Bayes Classification: Understanding example correctly?

Asked May 29 '16 at 18:11

Active May 29 '16 at 18:11

Viewed 1,766 times

I am currently looking into the multinomial model for Naive Bayes classification, and have come across the following example:

I think I understand everything, but I have developed the following reasoning I would like confirmed:

For a given class c, and document d consisting of terms t1, t2, ..., tn. Here is how to calculate P(c|d):

P(class | doc): (prior[c]) * (prob[t1 in c]) * (prob[t2 in c]) * ... * (prob[tn in c])
P (! class | doc): (prior[!c]) * (prob[t1 in !c]) * (prob[t2 in !c]) * ... * (prob[tn in !c])

Is this correct? And thus, is this the reason the power 3 is present in both (3/7) and (2/9), denoting P(Chinese|c) and P(Chinese|!c) together with the fact that 'Chinese' appears three times in d5?

Thank you in advance.

asked May 29 '16 at 18:11

yulai

you should probably ask this on stats.stackexchange.com – Pavel May 29 '16 at 18:41
Your understanding is correct. (how you calcuate p(c|d) and why you need the power of 3). :) – greeness May 30 '16 at 22:25

Naive Bayes Classification: Understanding example correctly?

0 Answers0