4

I am planning to implement spam filter using Naive Bayesian classification model.

Online I see a lot of info on Naive Bayesian classification, but the problem is its a lot of mathematical stuff, than clearly stating how its done. And the problem is I am more of a programmer than a mathematician (yes I had learnt Probability and Bayesian theorem back in school, but out of touch for a long long time, and I don't have luxury of learning it now (Have nearly 3 weeks to come-up with a working prototype)).

So if someone can explain or point me to location where its explained for programmers than a mathematician, it would be a great help.

PS: By the way I have to implement it in C, if you want to know. :(

Regards, Microkernel

merv
  • 67,214
  • 13
  • 180
  • 245
Microkernel
  • 1,347
  • 4
  • 17
  • 38
  • 3
    suck it up and learn the math. if you don't you never have a hope in improving yourself, you will always merely be a pawn in a larger game. – ldog Apr 12 '10 at 17:47
  • @gmatt: Yes I agree with you. But its not that I am not good @ math or I hate math. Incidentally my favourite subject @school used to be math. But now I don't have time to relearn and implement (consider, I have atmost 3 weeks to do this, to be implemented in C, and this is a project for some weekend class, and apart from this I have a full time job(which needless to say sucking away vitality from me)). But I still strongly agree with you and I really want to learn. Thanks :) – Microkernel Apr 12 '10 at 18:55

5 Answers5

5

The book Programming Collective Intelligence has chapter that covers this and other methods. The chapter (#6) can be understood without reference to previous chapters, is written clearly, and discusses only the minimal mathematics necessary to get the job done.

tom10
  • 67,082
  • 10
  • 127
  • 137
2

You could try this website. It's got some source code.

abatishchev
  • 98,240
  • 88
  • 296
  • 433
aduric
  • 2,812
  • 3
  • 22
  • 16
2

I would highly recommend Andrew Moore's tutorials and I think you should start with this one.

Jacob
  • 34,255
  • 14
  • 110
  • 165
1

You could also take a look at POPFile, an open source spam filter engine.

Peter Lillevold
  • 33,668
  • 7
  • 97
  • 131
1

Have you looked at dspam?

http://dspam.irontec.com/faq.shtml#1.0

http://www.nuclearelephant.com/

NinjaCat
  • 9,974
  • 9
  • 44
  • 64