11

A quick Google search reveals that there are a good number of Bayesian classifiers implemented as Python modules. If I want wrapped, high-level functionality similar to dbacl, which of those modules is right for me?

Training

% dbacl -l one sample1.txt
% dbacl -l two sample2.txt

Classification

% dbacl -c one -c two sample3.txt -v
one
lejlot
  • 64,777
  • 8
  • 131
  • 164
Dan
  • 1,721
  • 3
  • 16
  • 20

3 Answers3

9

I think you'll find the nltk helpful. Specifically, the classify module.

theycallmemorty
  • 12,515
  • 14
  • 51
  • 71
  • I don't believe NLTK includes a feature selection algorithm, but it has frequency dist tools etc so you can write your own and feed it to the classifier of your choice. – apexdodge Dec 28 '10 at 05:55
0

If you're trying to detect language this works fine even with pretty short texts.

The api is pretty close to yours but I don't know if it is called a Bayesian classifier.

fulmicoton
  • 15,502
  • 9
  • 54
  • 74
-1

Try Mallet and LingPipe. they provide more models for the classifier.

Vikrant Sagar
  • 71
  • 1
  • 7