Questions tagged [lexicon]

69 questions
1
vote
1 answer

Training corpus for Brill Tagger in other languages than English

Does anyone know any tagged corpus or lexicon for using the Brill Part-of-Speech (POS) tagger in other languages than English? Thanks!
0
votes
0 answers

package ‘textdata’ is in use and will not be installed while installing install_github("EmilHvitfeldt/textdata")

If you use this lexicon, then please cite it. 1: Yes 2: No Selection: 1 Error: 'C:/Users/rithv/AppData/Local/textdata/textdata/Cache/nrc/NRC-Emotion-Lexicon/NRC-Emotion-Lexicon-v0.92/NRC-Emotion-Lexicon-Wordlevel-v0.92.txt' does not exist. I have…
0
votes
0 answers

How to add words manually to nrc emotions lexicon?

Can someone help me on how to add new words with matching emotion on NRCLex Python? Given emotions output is accurate but can be much better. Find my code here: https://github.com/buetvictoria/Emotion_Detection Thank you! Code is already done but…
0
votes
0 answers

Function error when counting positive words row

I'm fairly new to R. I'm trying to do a sentiment analysis using the bing lexicon. I've tried to find a solution for my problem, but I wasn't able to apply it to my data. I'm trying to count the positive words in my data, which consists of 401 hotel…
ponyo
  • 1
  • 1
0
votes
1 answer

R: Counting frequency of words in a character column

I'm trying to count the number of times that some pre-specified words appear in a character column, Post. This is what my dataset looks like: Now, I want to count all green/sustainable words in each of the posts and add this number as an extra…
Sarah K.
  • 1
  • 1
0
votes
0 answers

Logistic Regression and Sentiment Analysis

I need help with the following code everywhere that says "YOUR CODE HERE". Any help is appreciated. Thank you! #Use the lexicons to create two lexicon features. A feature 'POSLEX' whose value indicates how many tokens belong to the positive…
0
votes
0 answers

"'utf-8' codec can't decode byte 0xf3" while performing the sentiment lexicon

I am getting an error in the following code while performing the sentiment lexicon. If someone could guide me here, that'd be really helpful. "'utf-8' codec can't decode byte 0xf3" from textblob import TextBlob pos_count = 0 pos_correct = 0 with…
Radhika Singh
  • 21
  • 1
  • 2
0
votes
2 answers

R: find words from tweets in Lexicon, count them and save number in dataframe with tweets

I have a data set of 50,176 tweets (tweets_data: 50176 obs. of 1 variable). Now, I have created a self-made lexicon (formal_lexicon), which consists of around 1 million words, which are all formal language style. Now, I want to create a small code…
Ja123
  • 73
  • 6
0
votes
1 answer

Does sentimentr package account for number of words in sentence and number of sentence in paragraph?

Can anyone help explain whether sentimentr package accounts for word number? I am trying to work out how the number of words affects the sentiment score. Does it take into account if people have more sentences in an answer or more words per…
Gabriella
  • 421
  • 3
  • 11
0
votes
1 answer

Issues with using lexicon on Azure Cognitive services (text-to-speach) from python

I am using Azure cognitive TTS from python for quite some time now, using their examples from the web and it works just fine. I had an issues and had to introduce external lexicons, so I made them and added them to commands. It looks like…
Branko Radojevic
  • 660
  • 1
  • 5
  • 14
0
votes
1 answer

R sentiment analysis; 'lexicon' not found;

i am very new to sentiment analysis. I am running the codes based on the tutorial here It is using a tidytext package. But I encountered the problem when I run the code AFINN <- sentiments %>% filter(lexicon == "AFINN") %>% select(word,…
0
votes
2 answers

Change value of words in bing lexicon

I'm analyzing a survey using R Studio. I'm using Bing Sentiment lexicon from tidytext package to do so. Some words don't have the right meaning for my survey, specifically 'tender' is coded as positive, but my respondents mean 'tender' as a negative…
Gabriella
  • 421
  • 3
  • 11
0
votes
0 answers

NRC Emolex (Lexicon) in R

im a newbie and im trying to get to work with Rstudio for my final project. As i know that NRC Emolex is available in 40+ languages including Indonesian language. My problem is, I still dont know how to get the sentiment score with NRC Emolex in…
0
votes
1 answer

Is it possible to add terms to the dictionaries of `lexicon` package?

The package sentimentr provides tools to calculate text polarity sentiment at the sentence level and optionally aggregate by rows or grouping variable. One of its functions, sentiment, approximates the sentiment (polarity) of text by sentence. In…
Mark
  • 1,577
  • 16
  • 43
0
votes
0 answers

Sentiment Dictionary for Business

I am looking for a sentiment dictionary/lexicon for business context to perform sentiment analysis. I have already built the analysis and calculation portion, but simply need the classes with contents. Here's an example of what I want to build…