Questions tagged [feature-scaling]

51 questions
1
vote
2 answers

Normalizing test set but higher range

I'm normalizing and rescaling my training set with: # zero mean feat = (feat - feat.mean()) / feat.std() # scale between -1, 1 feat = ((feat - feat.min()) / (feat.max() - feat.min())) * 2 - 1 This works great. I transform the test set in the exact…
1
vote
1 answer

Normalization (Feature scaling) of Point Cloud Dataset

I have point cloud data set where single data is represented by N * 3 where N is number of points. Similarly I have "M" number of points clouds in Dataset. The range of these point clouds varies largely. Some have very large values (e.g., in term of…
1
vote
2 answers

Feature scaling converts different values in columns on a same scale

Scaling converts different columns with different values alike example Standard Scaler but when building a model out of it, the values which were different earlier are converted to same values with mean=0 and std = 1, so it should affect the model…
BlackEagle
  • 143
  • 2
  • 11
1
vote
1 answer

Linear Regression - Implementing Feature Scaling

I was trying to implement Linear Regression in Octave 5.1.0 on a data set relating the GRE score to the probability of Admission. The data set is of the sort, 337 0.92 324 0.76 316 0.72 322 0.8 . . . My main Program.m file looks…
0
votes
1 answer

Getting a negative prediction after min-max scaling the price in a linear regression

I was trying to make my mean squared error cost lower by scaling the target feature, primarily because it reaches 1e10's in digit I use this dataset from kaggle to calculate land price X = LT, Y = Harga…
0
votes
0 answers

How can I see progress of all Features with a Progress bar

On Azure DevOps board, progress bars are available only at the user story level, but I want to see the progress at higher levels. Do you have a suggestion? I didn't find a way to reflect the progress of features
0
votes
0 answers

Is there any function to append values after onehotencoding to x and y values?

I am working on forest fire data and my task is to predict the fires based on some features. I've done the encoding part and still could not use numeric data to use in my algorithm as its showing" could not convert string to float: 'nov' " when i…
0
votes
0 answers

In linear regression, is the coefficient of un-transformed (0,1) categorical data is the same scale as other coefficient of transformed features?

I want to perform a linear regression on house price dataset to rank the features that most impacting the price. I handled categorical data by one hot encoding. Then, I transformed (log) all the features except the categorical data (0,1). After I…
0
votes
0 answers

Feature scaling/normalization on data which has the same unit of measurement(ex. cm)

In my understanding of feature scaling/normalization you need to do this, because of different measurement units inside the features and bring them to a uniform scale. And in my opinion this only belongs to data of different measurement units (ex.…
0
votes
0 answers

Feature Scaling in Hierarchal Clustering

I know that feature scaling is always a requirement for clustering algorithms. Currently I am implementing hierarchal clustering on this dataset, I will use only the annual income and the spending rate features. Now I am confused of whether to use…
0
votes
0 answers

Effect of Feature Scaling in Xgboost

Long time ago, I participated in this Kaggle competition: https://www.kaggle.com/competitions/bike-sharing-demand/data. Check the code on Kaggle: https://www.kaggle.com/code/tchamna/bike-ride-sharing-prediction-xgboost-final: Problem statement: You…
0
votes
0 answers

Scaling with Kmeans Clustering

I have a clustering problem I'd like to solve and I'm wondering if scaling is recommended for the way my data is structured. Below is a hypothetical problem which should be relatable to my actual use case. Say we're looking at data from a grocery,…
D Note
  • 1
  • 1
0
votes
0 answers

Calculate single weight based on variables of differing importance and thresholds

Seeking guidance please. I have a dataset of street-view image_ids and their detected road surface features. The feature values represent the percentage of pixels covering the road in each image. I want to calculate a trafficability weight (between…
0
votes
0 answers

Feature rescaling for k-means clustering

Is it correct to only rescale a large int feature and leave the % features as is when running k-means algorithm for segmentation analysis ? e.g. a feature is the population of the city(150 000), then the rest features are % (e.g. 0.46) of energy…
0
votes
2 answers

Why Does Tree and Ensemble based Algorithm don't need feature scaling?

Recently, I've been interested in Data analysis. So I researched about how to do machine-learning project and do it by myself. I learned that scaling is important in handling features. So I scaled every features while using Tree model like Decision…