7

Does anyone know where I can find an example of how to use the SimpleLemmatizer() class in the OpenNLP library, and where I can find a sample english dictionary? It appears to be missing from the documentation.

pYr0
  • 159
  • 2
  • 9

2 Answers2

10

You can download Dictionary from here - en-lemmatizer.dict

Example :

import opennlp.tools.lemmatizer.SimpleLemmatizer;

private static SimpleLemmatizer lemmatizer;

private String lemmatize(String word, String postag) throws IOException {
    if (lemmatizer == null) {
        InputStream is = getClass().getResourceAsStream("/models/en-lemmatizer.dict");
        lemmatizer = new SimpleLemmatizer(is);
        is.close();
    }
    String lemma = lemmatizer.lemmatize(word, postag);
    return lemma;
}

Sample code taken from here - DocumentTaggerService

RAVI
  • 3,143
  • 4
  • 25
  • 38
3

RAVI did not fully answer the question:

The reason you aren't seeing the SimpleLemmatizer in the autodoc is because it actually doesn't exist.

Use DictionaryLemmatizer instead.

opennlp.tools.lemmatizer.DictionaryLemmatizer

Collin Bell
  • 585
  • 4
  • 12