I want to classifier text by using sklearn. first I used bag of words to training the data, the feature of bag of words are really large, more than 10000 features, so I reduced this feature by using SVD to 100.
But here I want to add some other features like # of words, # of positive words, # of pronouns etc. the additional features are only 10 less features, which compare to the 100 of bag of words feature are really small
From this situation I raise 2 questions:
- Is there some function in sklearn that can change the additional features' weight to make them more important?
- How do I check the additional feature is important to classifier?