Questions tagged [qdap]

qdap (Quantitative Discourse Analysis Package) is an R package designed to assist in quantitative discourse analysis. The package stands as a bridge between qualitative transcripts of dialogue and statistical analysis & visualization.

110 questions
1
vote
3 answers

word_stats function from qdap package application on a dataframe

I have a dataframe, where one column contains strings. q = data.frame(number=1:2,text=c("The surcingle hung in ribands from my body.", "But a glance will show the fallacy of this idea.")) I want to use the word_stats function for each individual…
Akhil
  • 165
  • 1
  • 1
  • 8
1
vote
1 answer

Compare the bag of words in two document and find the matching word and their frequency in second document

I have calculated the bag of words for 'yelp.csv', 'yelpp.csv', 'yelpn.csv' and created the matrix of individuals dataset's word frequency. Now, I want to compare the bag of words of yelp with yelpn and check how many words in yelp appears in yelpn…
Ash
  • 23
  • 1
  • 5
1
vote
1 answer

Within the context of tm::content_transformer() how would I use mgsub?

qdap::mgsub takes the following parameters: mgsub(x, pattern, replacement) Within library(tm) corpus transformation you can wrap non tm functions within content_transformer(), e.g. corpus <- tm_map(corpus, content_transformer(tolower)) Here is a…
Doug Fir
  • 19,971
  • 47
  • 169
  • 299
1
vote
0 answers

adding words to qdap dictionary in R

I am using qdap package for spell checking, but the default dictionary doesn't have many words such as cities,states, countries, or languages. I know I can add words to the dictionary like below dict <- c(qdapDictionaries::GradyAugmented,…
1
vote
1 answer

dplyr filter using qdap::which_misspelt OR dplyr filter with a nested function

A small data frame: words <- data.frame(terms = c("qhick brown fox", "tom dick harry", "cats dgs")) If I use qdap::which_misspelled I can find out missspelled words: > which_misspelled(words) 1 8 "qhick" "dgs"…
Doug Fir
  • 19,971
  • 47
  • 169
  • 299
1
vote
0 answers

Can you install packages in R without imports or dependencies?

I work on a computer that doesn't have internet access. I download all of my R packages and install them from .zip files. One issue, however, is that when I install a package, it will require other packages because I load them into the library. …
Alex
  • 77
  • 1
  • 10
1
vote
1 answer

Can't seem to get vectorized gsub to work with library "qdap" and mgsub

I was looking at this question: R: gsub, pattern = vector and replacement = vector I am trying a simple example and I can't get it to work. testList <- c("apple", "orange", "banana1", "apple4", "orange 8", "banana 10") repl <- c("apple", "orange",…
1
vote
4 answers

Issues with installing rJava or qdap in R

when trying to install qdap or rJava, it's always returning image not found error library(rJava) Error: package or namespace load failed for 'rJava': .onLoad failed in loadNamespace() for 'rJava', details: call: dyn.load(file,…
santoku
  • 3,297
  • 7
  • 48
  • 76
1
vote
0 answers

Omitting Words from Spellcheck in qdap

This is my first post with StackOverflow, I apologize if I violate any rules. I am working with the R package qdap on spellchecking very messy medical record text. The goal of this work is to identify misspellings of drug side effects in order to…
Scott
  • 15
  • 3
1
vote
0 answers

Finding association among all terms in a term document matrix - Huge size

I have a requirement to find correlation among every term in a term document matrix. The number of terms in the matrix is 181841 and docs are 191431. I need to get the correlation coefficient for every term with the other term. I have used a for…
NinjaR
  • 621
  • 6
  • 22
1
vote
2 answers

Replace the string value with value in the find list in R

I have a dataset that has a column like string<-c('lib1_Rstudio_case1','lib2_Rstudio_case1and2','lib5_python_notthe correct_language','lib3_Jupyter_really_good','lib1_spyder_nice','lib1_R_the_core') …
nityansh seth
  • 31
  • 2
  • 9
1
vote
0 answers

Creating list of data frames for transcriptions in R using lapply or a for loop

I'm trying to create a list of all my transcriptions that I would like to run text mining analyses on. I'm using qdap to read in the transcriptions using the code below: read.transcript(transcript1_filename,col.names = c("Person","Dialogue"),skip =…
Gerard
  • 159
  • 1
  • 2
  • 11
1
vote
2 answers

Extract only words containing ASCII characters from vector of strings

I'm stuck with it, so, please, any advice is welcome. b <- str_extract_all(c('hello ringпрг','trust'), regex("[a-z]+", TRUE)) Returns a list: List of 2 $ : chr [1:2] "hello" "ring" $ : chr "trust" But I want to have a vector with strings of…
Shin
  • 251
  • 1
  • 3
  • 8
1
vote
1 answer

Custom words in Package 'qdap' in R

I am using the qdap package in R to do a spell check. I run the below code and gives an output like this which_misspelled("I use a 50Gb broadband connection") > 4 5 >"gb" "broadband" The words make sense but the corrections for these…
Jil Jung Juk
  • 690
  • 2
  • 9
  • 21
1
vote
0 answers

Is it possible to use a German dictionary in qdap check_spelling_interactive?

I'm analyzing German comments of an app and I want to use check_spelling_interactive of the qdap package. Is it possible to use a German dictionary instead of the qdap dictionary?
Hans_Bell
  • 11
  • 1