Let's say I have a very large dataset with results from wine tastings with tasting descriptors stored in one of the variables.
data.frame(c("red","white","rose"),c("grapefruit, raspberry", "sweet, bold", "tannins, long finish"))
The number of possible descriptors is massive. I want to unpack them in a way such that it becomes usable for analysis with machine learning techniques. Should I put each possible descriptor in its own variable, or is there a more efficient and compact way to store such data?
Thank you in advance!