Questions tagged [linguistics]

Linguistics is the scientific study of language and its structure, including the study of morphology, syntax, phonetics, and semantics.

Linguistics is the scientific study of language and its structure, including the study of morphology, syntax, phonetics, and semantics.

Specific branches of linguistics include sociolinguistics, dialectology, psycholinguistics, computational linguistics, historical-comparative linguistics, and applied linguistics.

323 questions
0
votes
1 answer

Software to detect grammar constructions in a sentence

I have the following sentence: She usually walks three miles a day. There are several grammar elements here, like present simple, personal pronouns, numerals, etc. Is there a software that can detect these?
Max Koretskyi
  • 101,079
  • 60
  • 333
  • 488
0
votes
2 answers

Is there a downloadable corpus (dictionary/ lexicon) for informal, playful words such as 'gonna', 'LOL', 'wanna' in English?

Please suggest me a downloadable English corpus that contains informal, playful words such as 'gonna', 'LOL' and 'wanna'
user5232014
0
votes
1 answer

Tree for 'unelectability'

How would I go about making a derivational tree for this word? I am at a loss, I can't find the rules for making trees online and it is very confusing. I got that the stem is elect as a verb, then un-elect, then un-elect-able, and finally the full…
Tim
  • 11
  • 4
0
votes
0 answers

Data structure / data model for multi-language phrasebook

We want create a multi-language phrasebook / dictionary for a specific area. And now I'm thinking about the best data structure / data model for that. Since it should be more phrasebook than dictionary we want to keep the data model / structure…
Tima
  • 12,765
  • 23
  • 82
  • 125
0
votes
1 answer

Semantic Distance/Similarity between two terms in a sentence

I was wondering of anyone have a reference of information talking about semantic distance/similarity between 2 words in a sentence BASED on counting the words that separate between these 2 words? Example: "The student went to the university to…
Nirke
  • 173
  • 1
  • 5
  • 12
0
votes
1 answer

Type/Token Ratio in R

I'm working with a new corpus and want to get the type/token ratio. Does anyone know of a standard way to do this? I've been trawling around the internet and didn't find anything relevant. Even the tm package doesn't seem to have an easy way to do…
hooliaquoolia
  • 61
  • 1
  • 3
  • 12
0
votes
1 answer

How to pass in an estimator to NLTK's NgramModel?

I am using NLTK to train a bigram model using a Laplace estimator. The contructor for the NgramModel is: def __init__(self, n, train, pad_left=True, pad_right=False, estimator=None, *estimator_args, **estimator_kwargs): After some…
unicorn_poet
  • 475
  • 1
  • 5
  • 14
0
votes
1 answer

Hyphenating arbitrary text automatically

What kinds of challenges are there facing automatic hyphenation? It seems that you could just draw word by word, breaking when the length of the line exceeds the length of the viewport (or whatever we're wrapping our text in), placing hyphens after…
Kat
  • 4,645
  • 4
  • 29
  • 81
0
votes
1 answer

Novel Query/Hypothesis Formulation in AI Systems

Anyone know of material addressing question formulation in AI systems? I believe there are a few architectures that can produce predictive or creative output based on patterns of input like Boltzmann machines, but I'm skeptical that they can address…
0
votes
2 answers

How to linguistically parse English Text?

Is there a way to linguistically parse English text? I mean get something like this? I{I,pronoun} am{to be, verb, Present Simple} late{late, adverb}. Or even better with dependencies, like: I -> am -> (what?) -> late. Better in Java, but it…
Denis Kulagin
  • 8,472
  • 17
  • 60
  • 129
0
votes
2 answers

What's the end on 'print "%d" % (5)' called?

A linguistic question I guess, but what's the (5) called in the code print "%d" % (5) ? I call the %d an integer representation, but I'm not sure what to call the stuff it actually represents, regardless of it being a number, a variable, a…
Dune
  • 5
  • 1
0
votes
1 answer

Database with syllable counts

Is there a database somewhere with files that contain words tagged by syllable count? I'm looking to implement a haiku generator for fun.
goodcow
  • 4,495
  • 6
  • 33
  • 52
0
votes
1 answer

enlarging a text corpus with classes

I have a text corpus of many sentences, with some named entities marked within it. For example, the sentence: what is the best restaurant in wichita texas? which is tagged as: what is the best restaurant in ? I want to expand this corpus,…
Beka
  • 725
  • 6
  • 22
0
votes
1 answer

How to use bootcov on a glm model (R)

I'm using R to analyze linguistic data. I have a dataframe that looks something like this - phoneme onset voice ident b TRUE TRUE TRUE b TRUE TRUE FALSE b TRUE TRUE FALSE b TRUE TRUE FALSE b TRUE…
Clara B
  • 461
  • 4
  • 14
0
votes
1 answer

Searching and segmentation in python

My list is : search=[1രാമന്‍,2സീതയെ,7പൂവ്‌,16കോട്ടയത്ത്‌,22പരീക്ഷ,28രാമന്‍,29ലക്ഷ്മനനെ,33രാമനോടു,36ലക്ഷ്മണന്‍,37സീതയെ,45വഴ] My inputfile contains: 1രാമന്‍ N_NNP_S_M_SG 1 2സീതയെ N_NNP_O_F_SG 1 4. RD_PUNC 0 1രാമന്‍,5അവന്‍ PR_PRP_S_M_SG…
user3251664
  • 441
  • 2
  • 7
  • 11