Questions tagged [mallet]

MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text.

From Mallet's website:

MALLET includes sophisticated tools for document classification: efficient routines for converting text to "features", a wide variety of algorithms (including Naïve Bayes, Maximum Entropy, and Decision Trees), and code for evaluating classifier performance using several commonly used metrics.

In addition to classification, MALLET includes tools for sequence tagging for applications such as named-entity extraction from text. Algorithms include Hidden Markov Models, Maximum Entropy Markov Models, and Conditional Random Fields. These methods are implemented in an extensible system for finite state transducers.

In addition to sophisticated Machine Learning applications, MALLET includes routines for transforming text documents into numerical representations that can then be processed efficiently. This process is implemented through a flexible system of "pipes", which handle distinct tasks such as tokenizing strings, removing stopwords, and converting sequences into count vectors.

321 questions
0
votes
1 answer

Mallet CRF SimpleTagger phrases/multi words

I am a newbies to Mallet, I am trying use mallet Simple tagger/CRF and experimenting with phrases - I tried lookup the documentation on mallet site and also went through the user archives - nothing helped. I tried training mallet for simple tagging,…
rtuser
  • 33
  • 1
  • 5
0
votes
1 answer

Error: Could not load or find main class language

I am new to Mallet I have installed Mallet in my computer. I have set the class path correctly (I verified it by printing the class path) I want to execute the following command mallet import-smvlight file1 file2.mallet Everytime I try running a…
Rahul Dhamecha
  • 23
  • 1
  • 1
  • 7
0
votes
1 answer

Changing Target Alphabet in Mallet: Alphabets Don't Match

I am having difficulty getting help from the Mallet dev list, so I am trying here. I have an InstancesList with a target alphabet of {A, B, C} and I need to change the target alphabet for another analysis to {A, NOT_A}. So far, I have the following…
Ryan R. Rosario
  • 5,114
  • 9
  • 41
  • 56
-1
votes
1 answer

Latent Dirichlet Allocation and Analyzing Two Data Sets using MALLET

I am currently analyzing two datasets. Dataset A has about 600000+ documents whereas Dataset B has about 7000+ documents. Does this mean that the topic outputs will be more about Dataset A because it has a larger N? The output of mallet in…
-1
votes
1 answer

Mallet download issues

I am really new to this type of computer work and am having trouble creating the path. I looked up a tutorial on how to do this online and followed everything to a tee, but when I try to change the directory path it is not working. Here's what I did…
-1
votes
1 answer

Classification using Mallet and MaxEntropy

I want to do preprocessing of docs(wsdl files) using mallet in Eclipse. I want to generate feature vectors and perform classification using mallet and MaxEntropy. I am new in using mallet, Can anyone guide me in this regard. Thanks
sid
  • 9
  • 5
1 2 3
21
22