I have a data.frame of 30k records (company name and other attributes). dba_nm
is the company name field with longest element < 60 characters.
The R session's memory usage goes up from 100MB to 3GB and hangs when I try the code in ?tm::VectorSource
:
ds <- VectorSource(dat$dba_nm)
inspect(Corpus(ds))