Questions tagged [linguistics]

Linguistics is the scientific study of language and its structure, including the study of morphology, syntax, phonetics, and semantics.

Linguistics is the scientific study of language and its structure, including the study of morphology, syntax, phonetics, and semantics.

Specific branches of linguistics include sociolinguistics, dialectology, psycholinguistics, computational linguistics, historical-comparative linguistics, and applied linguistics.

323 questions
0
votes
0 answers

NLTK - Access Brown corpus as one file

I have recently started working with NLTK, and I have downloaded the Brown corpus to do analyze its data. However, I see that the corpus is split into around 500 files. I need to run a script on the content of the whole corpus, though. Is there a…
The Philgrim
  • 21
  • 1
  • 4
0
votes
1 answer

Why mono sounds are preferred over stereo sounds when it comes to analyse acoustic parameters like intensity, RMS amplitude, fundamental frequency etc

In phonetic analyses, while using software like Praat, I have seen people's liking mono sounds over stereo sounds. Even if the recordings were done using a stereo microphone, the audio files are often converted to stereo. Any reason besides saving…
Pranav_b
  • 19
  • 8
0
votes
2 answers

Finding the rhyming words in a corpus with R, regex

I have the following corpus: corpus_rhyme <- c("helter-skelter", "lovey-dovey", "riff-raff", "hunter-gatherer", "day-to-day", "second-hand", "chock-a-block") Out of all of these words I only need words like "helter-skelter",…
0
votes
2 answers

Tartar language and UTF-8

I'm working on a project that includes transforming latin symbols into the corresponding cyrillic ones. I'm talking about Tartar language which is used by one of many nations in Russia. I tried find these symbols in UTF-8 but failed so far. The only…
Rodj
  • 3
  • 1
0
votes
1 answer

Get rows from MySQL table that have a column in common but slightly diffrent value

Might not even be a programming question, but the problem I'm having is I have a big MySQL table with products from different sources (stores) and in many cases it's the same product with different sources. For example an "iPhone…
himmip
  • 1,360
  • 2
  • 12
  • 24
0
votes
1 answer

To remove non- pair entries in a dataframe?

A column 'Consonant' in my dataframe 'df' has two levels with numbers, 250 for 'Geminate and 214 for 'Singleton'. Because Geminates are more in number, how can I remove the extra entries for Geminates, so to equate with the 'Singleton'? The output…
Pranav_b
  • 19
  • 8
0
votes
0 answers

Is this is the correct way to sort a dataframe to get sensible results in ANOVA (aov)?

I have (theoretically) understood how to interpret results from ANOVA. I am also aware that the format presented herein is okay while performing LMER tests but since the sample size is small, I am restricted to Anova. Basically, I want to see if…
Pranav_b
  • 19
  • 8
0
votes
1 answer

Where is the same error coming from, LMER test?

I am trying to perform an LMER test on a dataset (original data attached), the number of rows for all columns is the same (153). However, it gives me an error when I try to fit the formula Error: number of levels of each grouping factor must be <…
Pranav_b
  • 19
  • 8
0
votes
1 answer

Extracting verb from german sentenceces

I'm trying to extract verbs from German sentences. The problem is, for example in this sentence Ich rufe noch einmal an. Im getting rufe as the verb but its anrufe. I'm using textBlob and dont really know anything about linguistic. and using…
0
votes
1 answer

How to find the root of a word from its present participle or other variations in Python?

I'm working on a NLP project, and right now, I'm stuck on detecting antonyms for certain phrases that aren't in their "standard" forms (like verbs, adjectives, nouns) instead of present-participles, past tense, or something to that effect. For…
Joshua Crotts
  • 69
  • 1
  • 8
0
votes
1 answer

Extracting Related Date and Location from a sentence

I'm working with written text (paragraphs of articles and books) that includes both locations and dates. I want to extract from the texts pairs that contain locations and dates that are associated with one another. For example, given the following…
ITz
  • 35
  • 6
0
votes
1 answer

NLP - linguistic consistency analysis

I hope you can help me :). I am working for a translation company. As you know, every translation consists in splitting the original text into small segments and then re-joining them into the final product. In other words, the segments are…
Mauro
  • 189
  • 2
  • 14
0
votes
1 answer

How to fix error in Linguistics library in Rails 3.1 and Ruby 1.9.2

My application has had the following lines without any trouble in recent versions of rails: gem 'linguistics' # in Gemfile Linguistics::use(:en, :installProxy => :en) # In environment.rb Recently I upgraded from Ruby 1.8.7 to Ruby 1.9.2 and am…
tanman
  • 1,379
  • 1
  • 10
  • 19
0
votes
2 answers

Use Spacy Models to find Modal Verb for languages fr, es, ru

I am using Spacy models to find modal verb (MD) from following languages. en de fr es ru From tag_map.py of en and de it is clear that "VerbType": "mod" is a modal verb. But tag_map.py for fr, es and ru do not have any such property. How can I find…
Hammad Hassan
  • 1,192
  • 17
  • 29
0
votes
1 answer

Finding total count for word form when many possible POS tags

I feel like I have a dumb question, but here goes anyway.. I'm trying to go from data that looks something like this: a word form lemma POS count of occurrance same word form lemma Not the same POS another count same…