Standardization, or normalization, is a process used to make a vector of real number values have a mean of zero and a standard deviation of one. Also called standard scores or z-scores.
Questions tagged [standardization]
72 questions
0
votes
0 answers
How to standardize reported factor loadings from EFA
I hope, anyone can help me with my question.
In a paper, factor loadings (inclusive uniqueness and complexity indices) are reported in an article. An oblique rotation was used..
To my knowledge, the factor loadings can be interpreted as…

mascha
- 1
- 1
0
votes
0 answers
How to search parts of a string in a dictionary key in python and replace the matched string with dictionary
I have a database table column address and want to search for abbreviations (Address like STE,ST,AVE,BVLD, etc., and replace them all State, Street Avenue, etc.). I have the Abbreviation and Long form in a table in Snowflake and also a table having…

Nirmal Kumar Kosuru
- 49
- 1
- 7
0
votes
1 answer
Getting error while doing Standardization after Window Partitioning of Pyspark Dataframe
Dataframe:
Above is my dataframe, I want to add a new column with value 1, if first transaction_date for an item is after 01.01.2022, else 0.
To do this i use the below window.partition code:
windowSpec =…

snigdha mohapatra
- 3
- 1
- 4
0
votes
0 answers
Do we need standardization in K-prototypes algorithm
I want to use the K-prototype algorithm (a type of KNN algorithm used for mixed data :numerical and categorical data) for a clustering problem.
The algorithm handles the categorical values without numerical encoding, so I don't need to encode them…

anotherUser
- 23
- 5
0
votes
1 answer
Standardization or Normalization for selected columns
**from sklearn.preprocessing import StandardScaler
sc = StandardScaler()
X_train[:,5:7] = sc.fit_transform(X_train[:,5:7])**
Here I want to select 5th, 6th and 9th columns.
Is there a way to select the 9th column here??

Vaibhav Yalla
- 35
- 1
- 4
0
votes
0 answers
How can I report the coefficients of my regression in a way they have a reasonable scale?
I am running an OLS regression in R studio but my variables have different scales. For example variables such as my dependent variable is the log return of a stock index(too small) and my covid and vaccination variables are too big. Is there any way…
0
votes
1 answer
Does XGBoost need standardization or normalization?
In the link below, I confirmed that normalization is not required in XGBoost.
However, in the dataset we are using now, we need to use standardization
to get high performance.
Is standardization necessary if the scales between features constituting…

JeongJoo Lee
- 1
- 1
- 2
0
votes
1 answer
Create a loop with mutate() in order to get percentage values by column
In order to do an ACP I just need to "percentize" the data below (dummy data).
A <- c(1,3,4,5,2,3,4,1,3)
B <- c(1,3,7,3,7,3,7,3,6)
df <- data.frame(A, B)
df_2 <- mutate(df, A_perc = A/sum(df$A)) %>%
mutate(df, B_PERC = B/sum(df$B))
I dont…

Mathias Lauber
- 41
- 7
0
votes
0 answers
For cycle for Z score standardization
i'm a beginner in R. I would create a for cycle to standardize my variables. I have a df with 8 columns. This is my code
df_Z=df[,2:9]
for (i in df_Z) {
dfprova=(df_Z[,i] - mean(df_Z[,i])) / sd(df_Z[,i])
}
The error is shown below :
Error: Must…
0
votes
1 answer
Image Standardization
I want to ask about image preprocessing for classification tasks with cnn. I standardized the input data (mean = 0, standard deviation = 1), I got results with a range of [-1.1]. How to use that technique with result range [0,1]? so that the image…

hilyap
- 1
- 2
0
votes
1 answer
Standardize only numerical features with StandardScaler
I have the following dataset :
df=pd.read_csv('https://raw.githubusercontent.com/michalis0/DataMining_and_MachineLearning/master/data/HR_comma_sep.csv')
I encoded salary first with a label encoder le_salary, and then with an ordinal encoder…

cped
- 67
- 7
0
votes
1 answer
Adding particular rows and changing their positions
I have an example(left one in image description).
There are several indexes in the first column. However, the third (not just for the third one because I have data from more than a thousand repeating intervals) of the repeating characters is…

LoganLee
- 145
- 1
- 10
0
votes
0 answers
Standardizing dataframe with using for loops
survey_B <- data.frame("q1" = c(5, 3, 2, 7, 1, 9),
"q2" = c(4, 2, 2, 5, 1, 10),
"q3" = c(2, 1, 4, 2, 9, 10),
"q4" = c(10, 5, 2, 10, 4, 2),
sd_B2 <- t(apply(survey_B, 2, function(x)…

Matthias Prommer
- 11
- 1
0
votes
1 answer
Data normalization in Python CNN model training
I am doing a model training of CNN in Python and I have a question. I know that data normalization is important to scale the data in my dataframe between 0 and 1, but let's say I perform z-score normalization on my dataframe VERTICALLY (which means…

lalala
- 15
- 7
0
votes
1 answer
"standardize = " option in glmnet package
I have one question regarding the standardize option in a glmnet package.
I understand that scaling or standardizing dataset is necessary for the regression analysis in order to make the coefficients meaningful.
Usually, for just a linear…

KLee
- 105
- 1
- 9