Questions tagged [standardization]

Standardization, or normalization, is a process used to make a vector of real number values have a mean of zero and a standard deviation of one. Also called standard scores or z-scores.

72 questions
0
votes
0 answers

Data standardisation from a dataset in matlab, why do i have this error?

This program has to standardise data from 'LoadLargeDataSetGranese' (it's a dataset). I have to compare different variables in the same dataset. That's my error: Index in position 2 exceeds array bounds. Index must not exceed 114. Error in…
0
votes
0 answers

Is there any method to standarize time series data?

I want to do time series cluster analysis by using hclust or tsclust. My data is GRDP data by year for a region. the data is look like area_name 2012 2013 2014 2015 2016 2017 2018 .. area _ A value value value value value value…
0
votes
0 answers

Why the performance of a Feedforward Neural Network after performing normalization on the training data?

I have a feedforward neural network with 3 hidden layers. I train it with a data that is not standardized and it have accuracy of 90% on the test data. Then I normalize the data and the accuracy of the model decreases to 55%? (I expected it to be…
0
votes
0 answers

Standardising data for shrinkage with categorical variables

I am analysing a data set race_dfin R to do with races and their times. Each row has a categorical variable race_df$raceID (so I will use a factor variable to simulate an indicator variable for each race) and has associated race_df$lap_time. I want…
0
votes
0 answers

What is the name of the table operation that standardizes the template structure?

I am struggling to find the proper term for the following table operation: I take the cross product of all years (2018-2021 in our example) and companies (in the left table) left join the original dataset to the newly created dataset (~template) I…
0
votes
0 answers

Error in the standardization of variables

I'm trying to standardize my variables by using a code that I found in a previous question R loop to standardize variables using data.table However, I got an error message, so I'm not sure what the reason is. standardise = function(x){(x-mean(x,…
N_H
  • 25
  • 4
0
votes
0 answers

Why does pandas DataFrame giving 1-Dimesional Error?

I'm doing standardization on dataset after applying standardization I'm converting the resulting array into dataframe and then I got this 1-dimensional error. please help me out. X_train_scaled = pd.DataFrame(X_train_scaled,…
0
votes
1 answer

Why is another term for StandardScaler is Z-Score Normalization if it uses Standardization?

I found this online: "StandardScaler or Z-Score Normalization is one of the feature scaling techniques, here the transformation of features is done by subtracting from the mean and dividing by standard deviation. This is often called Z-score…
user21150913
0
votes
0 answers

How to apply principal component analysis to standardised multicentric data?

I have a question about principal component analysis. I am working with a dataset with 2 cohorts from 2 different centres. From each center I have a control group and 2 patient subgroups (drug-resistant and drug-responsive). My objective is to…
0
votes
0 answers

R Calculating relative differences between values that contain 0

I have a dataframe dat, which shows the number of daily visitors during certain days (case) and other days (control). I want to create now a 4th column, in which the difference between the Case and Control columns is being calculated for each Var.…
Essi
  • 761
  • 3
  • 12
  • 22
0
votes
0 answers

Should I standardize the second datset with the same scaling as in the first dataset?

I am in very much confusion. I have two datasets. One dataset is considered a source domain (Dataset A) and other dataset is considered a target domain (Dataset B). First, I standardized each column of Dataset A using mean and standard deviation…
0
votes
1 answer

Standardize same-scale variables?

thinking about a problem… should you standardize two predictors that are already on the same scale (say kilograms) but may have different ranges? The model is a KNN I think you should because the model will give the predictor eith the higher range…
Zachss
  • 1
0
votes
0 answers

scikit learn: interpretation of coefficients of linear models: After standardization by StandardScaler() to data, coefficients have values > 1

I'm new to Scikit learn and I'm trying to interpret the coefficients of my logistic regression model. model = make_pipeline(StandardScaler(), LogisticRegression()) model.fit(X_train, y_train) df_coefs = pd.DataFrame(model[1].coef_.T, index =…
0
votes
0 answers

The scale of output doesn't match scale of target

I am trying to use ANN for modelling. while the target is in the range of (min = 1, max = 21), I get the output in the range of (min = 9, max = 13). the result of modelling is relatively good though (R2 = .9). I have used standardized the data but…
Saeedeh
  • 1
  • 2
0
votes
0 answers

Correct way to standardize some climate indices

I am working on the relationship between climate extremes and conflict from 2010 to 2020 and I was wondering what is the correct way to standardize the following variables : The Warm Spell Duration Index where percentile (90) for the threshold is…