Questions tagged [feature-scaling]
51 questions
1
vote
2 answers
Normalizing test set but higher range
I'm normalizing and rescaling my training set with:
# zero mean
feat = (feat - feat.mean()) / feat.std()
# scale between -1, 1
feat = ((feat - feat.min()) / (feat.max() - feat.min())) * 2 - 1
This works great. I transform the test set in the exact…

rodrigo-silveira
- 12,607
- 11
- 69
- 123
1
vote
1 answer
Normalization (Feature scaling) of Point Cloud Dataset
I have point cloud data set where single data is represented by N * 3 where N is number of points. Similarly I have "M" number of points clouds in Dataset. The range of these point clouds varies largely. Some have very large values (e.g., in term of…

Rajat Sharma
- 47
- 1
- 15
1
vote
2 answers
Feature scaling converts different values in columns on a same scale
Scaling converts different columns with different values alike example Standard Scaler but when building a model out of it, the values which were different earlier are converted to same values with mean=0 and std = 1, so it should affect the model…

BlackEagle
- 143
- 2
- 11
1
vote
1 answer
Linear Regression - Implementing Feature Scaling
I was trying to implement Linear Regression in Octave 5.1.0 on a data set relating the GRE score to the probability of Admission.
The data set is of the sort,
337 0.92
324 0.76
316 0.72
322 0.8
. . .
My main Program.m file looks…

sayantank
- 119
- 1
- 1
- 6
0
votes
1 answer
Getting a negative prediction after min-max scaling the price in a linear regression
I was trying to make my mean squared error cost lower by scaling the target feature, primarily because it reaches 1e10's in digit
I use this dataset from kaggle to calculate land price X = LT, Y = Harga…
0
votes
0 answers
How can I see progress of all Features with a Progress bar
On Azure DevOps board, progress bars are available only at the user story level, but I want to see the progress at higher levels.
Do you have a suggestion?
I didn't find a way to reflect the progress of features

Shirel Golan
- 1
- 1
0
votes
0 answers
Is there any function to append values after onehotencoding to x and y values?
I am working on forest fire data and my task is to predict the fires based on some features.
I've done the encoding part and still could not use numeric data to use in my algorithm as its showing" could not convert string to float: 'nov' " when i…
0
votes
0 answers
In linear regression, is the coefficient of un-transformed (0,1) categorical data is the same scale as other coefficient of transformed features?
I want to perform a linear regression on house price dataset to rank the features that most impacting the price. I handled categorical data by one hot encoding. Then, I transformed (log) all the features except the categorical data (0,1). After I…

izzat shazwan
- 11
- 2
0
votes
0 answers
Feature scaling/normalization on data which has the same unit of measurement(ex. cm)
In my understanding of feature scaling/normalization you need to do this, because of different measurement units inside the features and bring them to a uniform scale. And in my opinion this only belongs to data of different measurement units (ex.…

mlaus
- 15
- 3
0
votes
0 answers
Feature Scaling in Hierarchal Clustering
I know that feature scaling is always a requirement for clustering algorithms. Currently I am implementing hierarchal clustering on this dataset, I will use only the annual income and the spending rate features. Now I am confused of whether to use…

AAA
- 305
- 1
- 7
0
votes
0 answers
Effect of Feature Scaling in Xgboost
Long time ago, I participated in this Kaggle competition: https://www.kaggle.com/competitions/bike-sharing-demand/data.
Check the code on Kaggle: https://www.kaggle.com/code/tchamna/bike-ride-sharing-prediction-xgboost-final:
Problem statement: You…

Shck Tchamna
- 127
- 7
0
votes
0 answers
Scaling with Kmeans Clustering
I have a clustering problem I'd like to solve and I'm wondering if scaling is recommended for the way my data is structured.
Below is a hypothetical problem which should be relatable to my actual use case. Say we're looking at data from a grocery,…

D Note
- 1
- 1
0
votes
0 answers
Calculate single weight based on variables of differing importance and thresholds
Seeking guidance please. I have a dataset of street-view image_ids and their detected road surface features. The feature values represent the percentage of pixels covering the road in each image.
I want to calculate a trafficability weight (between…

taylort139
- 11
- 2
0
votes
0 answers
Feature rescaling for k-means clustering
Is it correct to only rescale a large int feature and leave the % features as is when running k-means algorithm for segmentation analysis ?
e.g. a feature is the population of the city(150 000), then the rest features are % (e.g. 0.46) of energy…

Proddie
- 1
0
votes
2 answers
Why Does Tree and Ensemble based Algorithm don't need feature scaling?
Recently, I've been interested in Data analysis.
So I researched about how to do machine-learning project and do it by myself.
I learned that scaling is important in handling features.
So I scaled every features while using Tree model like Decision…

yoon-seul
- 13
- 3