Aster user here that is trying to move completely over to python for basic text analytics. I am trying to replicate the output of ASTER ngram in Python using nltk or some other module. I need to be able to do this for ngrams of 1 thru 4. Output to csv.
DATA:
Unique_ID, Text_Narrative
OUTPUT NEEDED:
Unique_id, ngram(token), ngram(frequency)
Example output:
- 023345 "I" 1
- 023345 "Love" 1
- 023345 "Python" 1