I have a pandas data frame that has account information and a reason for canceling. I have cleaned the data/lemmatized/removed my own stop words to come up with n grams and frequency. How do I add all of the ngrams back to the original data set so the frequencies are with the account level information? Ideally I want to take this and output a file that I can give to the business.
Is there a way I can use the sparse matrix to accomplish this? Not sure if this is possible or even scalable to larger data sets.
Below is a picture of the frequencies I want to attach to the original data frame some how.