Questions tagged [qdap]

qdap (Quantitative Discourse Analysis Package) is an R package designed to assist in quantitative discourse analysis. The package stands as a bridge between qualitative transcripts of dialogue and statistical analysis & visualization.

110 questions
0
votes
2 answers

Why is my R script using all the computer's memory?

I wrote a simple script which opens each .docx document from my folders and looks for certain words. If any of the words exist, it extracts an ID number and then moves on to the next document. The problem is that after ~1500 documents, it consumes…
0
votes
1 answer

Package ("qdap") fails to load with multiple errors

While installing qdap I have not faced any errors. But when loading the library, I face the follwing errors. > library(qdap) Loading required package: qdapDictionaries Loading required package: qdapRegex Loading required package: qdapTools Loading…
Salih
  • 391
  • 1
  • 13
0
votes
1 answer

Strange behavior in dplyr when mapping language vector on tm::stopwords

I want to extract stop words for several languages in one dplyr pipeline using this code: library(tidyverse) library(qdap) library(tm) map_dfr(tibble(language=c("english", "italian")), tm::stopwords) Which gives me uninformative…
Alexander Borochkin
  • 4,249
  • 7
  • 38
  • 53
0
votes
1 answer

How to filter out all short strings (2 and lower chars) in a corpus?

Given a simple string: t <- "hello world ww ff a wr gj dkjffdkn kuku" VCorpus(VectorSource(t)) I want to filter out all the 2 and lower length substrings. How can I do this using qdap or tm packages? I know I can use regex for this but is there a…
SteveS
  • 3,789
  • 5
  • 30
  • 64
0
votes
1 answer

Combine tidy text with synonyms to create dataframe

I have sample data frame as below: quoteiD <- c("q1","q2","q3","q4", "q5") quote <- c("Unthinking respect for authority is the greatest enemy of truth.", "In the middle of difficulty lies opportunity.", "Intelligence is the ability to…
R noob
  • 495
  • 3
  • 20
0
votes
1 answer

R qdap Search exclude syntax

I have the following output from data that I have downloaded from the Wall Street Journal. > Search(MySymList, " Net Income") Fiscal year is July-June. All values AUD Millions. 2018 2017 2016 2015 2014 5-year trend 82 …
Stephen
  • 1
  • 3
0
votes
1 answer

Apply a custom (weighted) dictionary to text based on sentiment analysis

I am looking to adjust this code so that I can assign each one of these modal verbs with a different weight. The idea is to use something similar to the NRC library, where we have the "numbers" 1-5 represent categories, rather than…
0
votes
1 answer

Prevent the rm_stopwords function creating a list

I used the rm_stopwords function from the qdap package to remove the stopwords and the punctuation from a text column in my data frame. library(qdap) library(dplyr) library(tm) glimpse(dat_full) Observations: 500 Variables: 9 $ reviewerID …
Banjo
  • 1,191
  • 1
  • 11
  • 28
0
votes
2 answers

Keep special characters in a word-frequency matrix

I analyze some brands in text to find out KPI´s like Ad recognition. However brands which contain special characters are destroyed by my code so far. library(qdap) library(stringr) test <- c("H&M", "C&A", "Zalando", "Zalando", "Amazon",…
user7353167
0
votes
1 answer

R- Text Analysis- Print specific text that contain a bi-gram

Text analysis with R. My dataset is 2000 comments from 2000 different surveys. I have created Bi-grams. I have checked frequecy of words, then word cluster analysis with hclust(), then Word association with findAssocs, for example,…
Robbie
  • 121
  • 11
0
votes
1 answer

R - Text Analysis - Misleading results

I am doing some text analysis of comments from bank customers related to mortgages and I find a couple of things I do understand. 1) After cleaning data without applying Stemming Words and checking the dimension of the TDM the number of terms (2173)…
Robbie
  • 121
  • 11
0
votes
1 answer

Converting a text corpus of character string to character vector before using the stringi package

I have a corpus containing two text files that I imported as: temp = list.files(pattern = ".txt") mydata = lapply(temp, read.delim, sep ="\t", quote = "") mydata the output class was list but I converted it to character as: class(mydata) …
0
votes
1 answer

Update qdap Dictionary for Sentiment Analysis

I am using polarity function from qdap. There are few words that I want to add to dictionary as negative when said in combination. For instance. "Pretty Bad" The polarity score becomes neutral when this is sent into polarity function. >…
Rana Usman
  • 1,031
  • 7
  • 21
0
votes
2 answers

R3.5.0 and data.tables not working

edited 2018-04-26: Changed title as per comments, this is a broader issue with R3.5.0 original: I'm pretty new to R, and not savvy with all of it's joys. I've just had my works computer upgrade (result!) to windows 10 with java version 8.1 64bit;…
AJP
  • 1
  • 4
0
votes
0 answers

using qdap::check_spelling taking a very long time, can I make this more efficient

I'm trying to use qdap::check_spelling() on 7M very short sentences (e.g. 1 - 4 word sentences). I'm running the script via ssh/linux and after about 6 hours of running I'm getting a "killed" message which I think means I'm using up a lot of memory?…
Doug Fir
  • 19,971
  • 47
  • 169
  • 299