Creating list of data frames for transcriptions in R using lapply or a for loop

Asked Feb 10 '17 at 10:36

Active Feb 10 '17 at 10:52

Viewed 60 times

I'm trying to create a list of all my transcriptions that I would like to run text mining analyses on.

I'm using qdap to read in the transcriptions using the code below:

read.transcript(transcript1_filename,col.names = c("Person","Dialogue"),skip = 5)

This produces a data frame with two columns, one identifying the speaker and the other a strings of dialogue.

I have a lot of transcriptions so want to create a list to run further analyses on.

I've tried using lapply as so:

transcript_files = list.files("~/Transcripts",full.names = TRUE)
my_list = list()
my_list= lapply(transcript_files,read.transcript(),col.names = c("Person","Dialogue"),skip = 5)

But this produces the following error:

Error in regexpr("\\.([[:alnum:]]+)$", x) : argument "file" is missing, with no default

I have also tried a for loop as so:

for(i in length(transcript_files)){
my_list[[i]] = read.transcript(transcript_files[i],col.names = c("Person","Dialogue"),skip = 5)
}

But for some reason this only reads in the last file, all other entries in the list are NULL.

No idea what is going wrong here.

edited Feb 10 '17 at 10:52

asked Feb 10 '17 at 10:36

Gerard

How does `transcript_files` look like? – Tonio Liebrand Feb 10 '17 at 10:39
Try with `read.transcript` (no parens) instead of `read.transcript()` in `lapply()` – Aurèle Feb 10 '17 at 10:40
@apom Awesome, removing the parentheses worked! Guess I was just too used to always adding those in automatically! – Gerard Feb 10 '17 at 10:50
You need to enclose the file path in quotes in `list.files` – Jake Kaupp Feb 10 '17 at 10:51
@JakeKaupp ah yes, no that was just a typo in the question here, it is correctly done in my code. Will edit the question. – Gerard Feb 10 '17 at 10:52
@BigDataScientist transcript_files is a character vector of file paths for all of the .docx transcription files. – Gerard Feb 10 '17 at 10:54

Creating list of data frames for transcriptions in R using lapply or a for loop

0 Answers0