How, to operate following standard operations, on a character vector?
(Need a dictionary for a DTM (classification). So in order to match the text entries, where this operations were already been made, i have to change the my dictionary terms accoringly.)
library(tm)
dicBin <- c("rosa", "rosig", "grĂ¼n ", "Blau", "gelb", "lila", "orange", "pink", "%", "mm", "mp", "*", "monat")
dicBin.corp <- tm_map(dicBin.corp, stemDocument, language = "german") # Initially I hoped that tm_map would work on a vector. Since it doesn't I tried to transform it to Corpus
dicBin.corp <- tm_map(dicBin.corp, stripWhitespace)
dicBin.corp <- tm_map(dicBin.corp, tolower)
Here the dicBin.corp
has only "%"
inside EDITED
## transform back to a vector
dicBin <- dicBin.corp # How to do also this properly?