Questions tagged [nlp]

Natural language processing (NLP) is a subfield of artificial intelligence that involves transforming or extracting useful information from natural language data. Methods include machine-learning and rule-based approaches.

Natural language processing (NLP) is a subfield of artificial intelligence that involves transforming or extracting useful information from natural language data. Methods include machine-learning and rule-based approaches. It is often regarded as the engineering arm of Computational Linguistics.

NOTE: If you want to use this tag for a question not directly concerning implementation, then consider posting on Data Science, or Artificial Intelligence instead; otherwise you're probably off-topic. Please choose one site only and do not cross-post to more than one - see Is cross-posting a question on multiple Stack Exchange sites permitted if the question is on-topic for each site? (tl;dr: no).

NLP tasks

Beginner books on Natural Language Processing

Popular software packages

20185 questions
6
votes
4 answers

How to Convert English to Cron?

I did some searching but haven't landed anything that looks useful yet but I am wondering if anyone knows of something (tool,lib etc) that can parse English phrases and translate them into a cron string. For example: Every Tuesday at 15:00 converts…
Tom Duckering
  • 2,727
  • 1
  • 23
  • 27
6
votes
2 answers

why Wordnet dictionary doesn't contain the word 'she'?

anyone know why wordnet doesn't contain the word 'she'? thanks. see this link
Xin
  • 575
  • 4
  • 20
6
votes
5 answers

Calculating context-sensitive text correlation

Suppose I want to match address records (or person names or whatever) against each other to merge records that are most likely referring to the same address. Basically, I guess I would like to calculate some kind of correlation between the text…
Anders Fjeldstad
  • 10,724
  • 2
  • 33
  • 50
6
votes
4 answers

Natural Language Understanding API

I am unaware if such an API or service exists currently so this is a vague question, my apologies. I have a PHP script that works with Freebase and I was wondering if I can enable it so a user can ask a question on my site which will be…
6
votes
11 answers

Natural Language Processing: Find obscenities in English?

Given a set of words tagged for part of speech, I want to find those that are obscenities in mainstream English. How might I do this? Should I just make a huge list, and check for the presence of anything in the list? Should I try to use a regex to…
Nick Heiner
  • 119,074
  • 188
  • 476
  • 699
6
votes
1 answer

Gensim Dictionary Implementation

I was just curious about the gensim dictionary implementation. I have the following code: def build_dictionary(documents): dictionary = corpora.Dictionary(documents) dictionary.save('/tmp/deerwester.dict') # store the dictionary …
dmil
  • 119
  • 1
  • 9
6
votes
2 answers

Analyse the sentences and extract person name, organization and location with the help of NLP

I need to solve the following using NLP, can you give me pointers on how to achieve this using OpenNLP API a. How to find out if a sentence implies a certain action in the past, present or future. (e.g.) I was very sad last week - past I feel…
SST
  • 2,054
  • 5
  • 35
  • 65
6
votes
1 answer

How to Normalize similarity measures from Wordnet

I am trying to calculate semantic similarity between two words. I am using Wordnet-based similarity measures i.e Resnik measure(RES), Lin measure(LIN), Jiang and Conrath measure(JNC) and Banerjee and Pederson measure(BNP). To do that, I am using…
nish
  • 325
  • 4
  • 9
6
votes
2 answers

Open-source rule-based pattern matching / information extraction frameworks?

I'm shopping for an open-source framework for writing natural language grammar rules for pattern matching over annotations. You could think of it like regexps but matching at the token rather than character level. Such a framework should enable the…
John Lehmann
  • 7,975
  • 4
  • 58
  • 71
6
votes
2 answers

How do I use the book functions (e.g. concoordance) in NLTK?

I am going through this wonderful tutorial. I downloaded a collection called book: >>> import nltk >>> nltk.download() and imported texts: >>> from nltk.book import * *** Introductory Examples for the NLTK Book *** Loading text1, ..., text9 and…
Alex Gordon
  • 57,446
  • 287
  • 670
  • 1,062
6
votes
5 answers

Does WordNet have "levels"? (NLP)

For example... Chicken is an animal. Burrito is a food. WordNet allows you to do "is-a"...the hiearchy feature. However, how do I know when to stop travelling up the tree? I want a LEVEL. That is consistent. For example, if presented with a bunch of…
TIMEX
  • 259,804
  • 351
  • 777
  • 1,080
6
votes
2 answers

SQL queries to their natural language description

Are there any open source tools that can generate a natural language description of a given SQL query? If not, some general pointers would be appreciated. I don't know much about NLP, so I am not sure how difficult this is, although I saw from some…
zpavlinovic
  • 1,507
  • 1
  • 17
  • 36
6
votes
1 answer

Natural Language Processing - Converting Text Features Into Feature Vectors

So I've been working on a natural language processing project in which I need to classify different styles of writing. Assuming that semantic features from texts have already been extracted for me, I plan to use Weka in Java to train SVM classifiers…
myrocks2
  • 305
  • 3
  • 14
6
votes
0 answers

Multi-column layout handling with pdfminer pdf2txt.py module

So far I am using pdfminer pdf2txt.py module with success. But a problem arises in pdf files formatted in two columns. The module retrieves text into a single column which results into many split words, at the end of lines. example: and…
user1834437
6
votes
1 answer

Should I use LingPipe or NLTK for extracting names and places?

I'm looking to extract names and places from very short bursts of text example "cardinals vs jays in toronto" " Daniel Nestor and Nenad Zimonjic play Jonas Bjorkman w/ Kevin Ullyett, paris time to be announced" "jenson button - pole position,…
pedalpete
  • 21,076
  • 45
  • 128
  • 239