In my SVM, i am using tf-idf on the documents for feature extraction. These tf-idf are calculated on the whole of training documents.
Now when i get a test-document that i want to classify, how do i generate the vector for it ?
I used stemming before calculating tf-idf. I can perform that on test-document too. I have count_of_words for train-documents.
Should i increment count of words that are in the train-document count_of_words for calculating the tf-idf of test-document or should i use it directly ?