What's the difference between the types of feature scaling?

Question

Wikipedia states three methods of feature scaling. Which should be used when? (what are the considerations?)

To be specific, I need it for Sentiment Analysis on phrases, implemented with SVM.

(NOTE: I've seen this post. It explains the different methods quite well, but doesn't say anything about when each should be used).

Thank you :)

Which application you are targeting is by and large irrelevant. The answer depends entirely on what your feature model looks like. — tripleee, Jan 16 '14 at 09:14

score 0 · Accepted Answer · answered Jan 15 '14 at 19:23

Actually this is quite hard to give any reasonable rules for selecting scaling over standarization. Standarization of your data has a good theoretical justification and is less influenced by outliers than scaling. As the result the most commonly used method of preprocessing is standarization.

In particular, if you ask about standarization than you use some kind of bag of words representation of your data. In such a case tf-idf is the most obvious selection of the data representation, which in fact is hardly influenced by any scaling/standarization as it is quite well standarized itself (due to the internal normalization and log-scaling).

What's the difference between the types of feature scaling?

1 Answers1