Questions tagged [opennlp]

Apache's libraries for natural language processing (NLP).

The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text. It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution. These tasks are usually required to build more advanced text processing services. OpenNLP also included maximum entropy and perceptron based machine learning.

More about Natural Language Processing :

Natural language processing (NLP) is the ability of a computer program to understand human speech as it is spoken.

Apache OpenNlp is often used with Apache Flink(a document query library).

Relevant Links,

http://searchcontentmanagement.techtarget.com/definition/natural-language-processing-NLP https://opennlp.apache.org/docs/.

Cornerstone books: https://www.manning.com/books/taming-text

684 questions
-1
votes
1 answer

error in trainingmodel in openNLP

Error in code: If that train is removed then also show compiler error as below. I tried a lot to remove that and classify sentence as positive or negative.
-1
votes
1 answer

How to extract meaningful noun phrases based on probability using OpenNlp's chunking parser

I am newbie to Natural Language processing. I need to extract meaningful noun and noun phrases based on their probability (eg. 75% and above) to make a auto-suggest dictionary. I have been reading on-line posts, articles for a couple of days, but…
Wendy
  • 1
  • 1
-1
votes
1 answer

Error while training opennlp with indian names

I am getting the following response while I am trying to train the opennlp for Indian names. $ opennlp TokenNameFinderTrainer -model en-ner-person.bin -lang en -data en-ner-person.train -encoding UTF-8 Indexing events using cutoff of…
Jin Krish
  • 41
  • 7
-1
votes
3 answers

How to pass the ArrayList between methods and classes?

I have this method which returns an arrayList tokens : public static String[] Tokenize(String input) throws InvalidFormatException, IOException { InputStream is = new FileInputStream("en-token.bin"); TokenizerModel model = new…
smoothsipai
  • 43
  • 10
-1
votes
1 answer

How to identify Coreference set and representative mentions in Stanford CoreNLP Coreferences?

I am using Stanford CoreNLP. I need to detect and identify the "Coreference set"s and "representative mention"s for each CorefChain in my input text: For example: Input: Obama was elected to the Illinois state senate in 1996 and served there for…
Ismail
  • 1
  • 1
-1
votes
2 answers

Name Extraction - CV/Resume - Stanford NER/OpenNLP

I'm currently on a learning project to extract an individuals name from their CV/Resume. Currently I'm working with Stanford-NER and OpenNLP which both perform with a degree of success out of the box on, tending to struggle on "non-western" type…
-1
votes
1 answer

POS Tagging too slow - using OpenNLP

I am just playing around with Part-of-speech Tagging, and started using OpenNLP. I am using the following code to load the model (Java): m_modelFile = new FileInputStream("c:\\DATA\\en-parser-chunking.bin"); m_model = new…
Phoeniyx
  • 542
  • 4
  • 15
-1
votes
2 answers

What are some good tools/practises for aspect level sentiment analysis?

I am planning to get some review data from tripadvisor and I want to be able to extract hotel related aspects and assign polarity to them and classify them as negative or positive. What tools can I use for this purpose and how and where do I start?…
Kripa Jayakumar
  • 891
  • 9
  • 16
-1
votes
1 answer

How to do semantic analysis from POS tag?

suppose sentence is, "Vehicle does not start in cold weather and need to change windshield blades." I'm interested to find out what part of car is affected, and what is the reason for that. From above sentence, we can not infer that windshield…
Nitin
  • 43
  • 1
  • 6
-1
votes
2 answers

Is there a way matching two profiles based on the profile data

I need to measure similarity between two profiles, in which we will have description about them in words. Now using the profile data, i need to find the similarity between them. Can you suggest me a way.?
Srini
  • 3,334
  • 6
  • 29
  • 64
-1
votes
1 answer

opennlp with netbeans is not giving output

How to use opennlp with netbeans. I made a small program as given in apache document but it is not working. I have set path to the opennlp bin as stated in the apache document but still i m not geting an output. it is not able to find .bin and hence…
-1
votes
3 answers

How to correlate similar messages using NLP

I have couple of tweets which needs to be processed. I am trying to find occurrences of messages where it mean some harm to a person. How do I go about achieving this via NLP I bought my son a toy gun I shot my neighbor with a gun I don't like this…
AVM
  • 243
  • 3
  • 8
-1
votes
1 answer

OpenNLp training data for organisation

I am trainng my data for opennlp organisation entity finder from command line but it's showing a null pointer exception I have used: opennlp TokenNameFinderTrainer -model en-ner-person.bin -lang en -data >en-ner->person.train -encoding UTF-8
-1
votes
1 answer

How to create training data

Can anybody tell me how to create training data for categorization. I am using OpenNLP for categorization. Is there any tool to create training data or if i have to create it manually then how it should be done? I am a complete noob in this field.…
Vishal Thakur
  • 141
  • 4
  • 16
-1
votes
2 answers

Classifier for diff reports

I am new to ML. I have a diff report with annotations indicating good diff and bad diff. Example - OLD STRING NEW STRING DIFF ANNOTATION abc AbC good pqr xyz bad lmn wxy good .... Given this training set, is it…
user379151
  • 1,289
  • 1
  • 16
  • 25
1 2 3
45
46