I'm trying to analyse qualitative responses to a survey using tidy text mining in R. I have tokenised my data via sentences. In some cases, I have found that in one sentence, participants have reported multiple behaviours that I want to analyse separately (e.g. "apples and oranges"). Is it possible to recode the initial data to separate them during the tokenisation stage? I have tried separating the data by adding a full stop between the behaviours using the following code but it has not worked:
data <- data %>% mutate(behaviour = recode(column, "apples and oranges" = "apples. Oranges")) tidy_text_data <- data %>% unnest_tokens(output = "sentences", input = behaviour, token = "sentences")
Any suggestions?