I have a somewhat large document and want to do stop-word elimination and stemming on the words of this document with Python. Does anyone know an of the shelf package for these? If not a code which is fast enough for large documents is also welcome. Thanks
Asked
Active
Viewed 2,286 times
2 Answers
4
If for some reason you don't want to use NLTK, you can try PyStemmer. For stop words just download a list (google it) and filter them out.

Miki Tebeka
- 13,428
- 4
- 37
- 49