Questions tagged [linguistics]

Linguistics is the scientific study of language and its structure, including the study of morphology, syntax, phonetics, and semantics.

Linguistics is the scientific study of language and its structure, including the study of morphology, syntax, phonetics, and semantics.

Specific branches of linguistics include sociolinguistics, dialectology, psycholinguistics, computational linguistics, historical-comparative linguistics, and applied linguistics.

323 questions
13
votes
6 answers

Monitor brands with common words

Let's say you should monitor the brand "ONE" online. What algorithms can be used to separate pages about the brand ONE from pages containing the common word ONE? I'm thinking maybe Bayes could work, but are there other ways to do this?
Christian Davén
  • 16,713
  • 12
  • 64
  • 77
12
votes
2 answers

Probability tree for sentences in nltk employing both lookahead and lookback dependencies

Does nltk or any other NLP tool allow to construct probability trees based on input sentences thus storing the language model of the input text in a dictionary tree, the following example gives the rough idea, but I need the same functionality such…
stackit
  • 3,036
  • 9
  • 34
  • 62
12
votes
5 answers

Where can I find a list of English phrases?

I'm tasked with searching for the use of cliches and common phrases in text. The phrases are similar to the phrases you might see for the phrase puzzles on Wheel of Fortune. Here are a few examples: Easy Come Easy Go Too Good To be True Winning…
Marcus Adams
  • 53,009
  • 9
  • 91
  • 143
11
votes
1 answer

Tabulating characters with diacritics in R

I'm trying to tabulate phones (characters) occurrences in a string, but diacritics are tabulated as characters on their own. Ideally, I have a wordlist in International Phonetic Alphabet, with a fair amount of diacritics and several combinations of…
Stefano
  • 1,405
  • 11
  • 21
11
votes
2 answers

Estimate Phonemic Similarity Between Two Words

I am working on detecting rhymes in Python using the Carnegie Mellon University dictionary of pronunciation, and would like to know: How can I estimate the phonemic similarity between two words? In other words, is there an algorithm that can…
duhaime
  • 25,611
  • 17
  • 169
  • 224
11
votes
4 answers

Generating the plural form of a noun

Given a word, which may or may not be a singular-form noun, how would you generate its plural form? Based on this NLTK tutorial and this informal list on pluralization rules, I wrote this simple function: def plural(word): """ Converts a…
Cerin
  • 60,957
  • 96
  • 316
  • 522
11
votes
5 answers

Implementing a rhyme finder

Was wondering if anyone had any tips or could point me in the right direction to finding/creating some sort of algorithm to find rhyming words. I specifically do not want to use an API, as creating the algorithm just to create it is my end…
Gagan Singh
  • 988
  • 12
  • 22
10
votes
2 answers

Render linguistic syntax tree in browser

The input is either: (1) a bracketed representation of a tree with labeled internal nodes such as: (S (N John) (VP (V hit) (NP (D the) (N ball)))) with output: (Whether the lines are dashed and whether the caption is present are not…
dsg
  • 12,924
  • 21
  • 67
  • 111
10
votes
4 answers

Python NLP British English vs American English

I'm currently working on NLP in python. However, in my corpus, there are both British and American English(realize/realise) I'm thinking to convert British to American. However, I did not find a good tool/package to do that. Any suggestions?
Mr.cysl
  • 1,494
  • 6
  • 23
  • 37
10
votes
1 answer

Qt5 on Linux - where is Linguist?

I'm using Linux Mint 16 and I got a problem. I installed Qt Creator 5.2.1 from binary package, downloaded from Qt site. I don't have Qt Linguist... Also, I have installed qtcreator and qt5-dev-tools (or similar name) from repository, still no…
aso
  • 1,331
  • 4
  • 14
  • 29
10
votes
4 answers

Finding Tense of A sentence using stanford nlp

Q1.I am trying to get tense of a complete sentence,just don't know how to do it using nlp. Any help appreciated. Q2 .What all information can be extracted from a sentence using nlp? Currently I can, I get : 1.Voice of sentence 2.subject…
swapyonubuntu
  • 1,952
  • 3
  • 25
  • 34
10
votes
2 answers

Which word stemmer should I use in nltk?

My goal is to analyze some corpus (twitter for the now) for emotional content. Just today I realized it would make a bit of sense to search for word stems as opposed to having an exhaustive list of emotional word stems. And so I've been exploring…
speciousfool
  • 2,620
  • 5
  • 28
  • 33
9
votes
1 answer

How To Parse Verbs Using Spacy

I'm trying to parse verbs in a corpus and list them in dictionaries and count how many times each verb appears as a transitive, intransitive and ditransitive. I was wondering how I could use spacy to parse through the verbs and notate them as…
emanuel tru
  • 111
  • 1
  • 3
9
votes
3 answers

How to filter word permutations to only find semantically correct ngrams? (Python 3, NLTK)

I want to create a number of permutations from a list of 200 words -- this obviously creates a problem because this leads to some ridiculously gigantic number of possible permutations (up to 5 words in a phrase). In order to effectively limit this…
user3682157
  • 1,625
  • 8
  • 29
  • 55
9
votes
4 answers

Natural Language Processing - Word Alignment

I am looking for word alignment tools and algorithms. I am dealing with bilingual English - Hindi text, and currently working on DTW (Dynamic Time Warping) algorithm CLA (Competitive Linking Algorithm) NATools Giza++ Could you please suggest any…
boddhisattva
  • 6,908
  • 11
  • 48
  • 72
1
2
3
21 22