I am trying to scale my feature vector for an algorithm. I have 3 features and 1 target variable. Feature 1: Has a Gaussian-like distribution Feature 2: Skewed, y-value decreases as x increases. Feature 3: Highly skewed. Almost all values are the same. Target Variable: Highly skewed.
My question is: I want to do a MinMaxScaling for the feature and target variables. Is it okay to scale all the features or the skewed ones.
Target Variable looks like feature 3. Since my data for Feature3 and target variable is mostly sparse, is there any alternative to MinMaxScaling?
Also is it okay to use different scalers on training data depending on the distribution of the respective columns ? I am happy to provide more info if the question is not clear :)