I am trying to find top 100/1000 words based on tfidfVectorizer output of Python's scikit-learn library. Is there a way to do it using a function from the scikit libraries?
Thanks for help
I am trying to find top 100/1000 words based on tfidfVectorizer output of Python's scikit-learn library. Is there a way to do it using a function from the scikit libraries?
Thanks for help
What do you mean by top 100/1000 words? The most frequent words in a dataset? You can use the Counter class of the Python standard library to do that. No need for scikit-learn.