In machine learning and statistics, dimensionality reduction or dimension reduction is the process of reducing the number of random variables under consideration, and can be divided into feature selection and feature extraction.
Questions tagged [dimensionality-reduction]
422 questions
-2
votes
1 answer
Applying PCA on a specific column of a pandas Dataframe
I'm trying to reduce the number of features of a dataset of images so that cosine similarity computes faster.
I have a pandas dataframe that has the following structure ["url", "cluster_id", "features"] and that contains 81 rows.
I would like to…

Copp
- 83
- 1
- 12
-2
votes
1 answer
Is there any method that can deal with high dimension reduction for regresssion model?
I have a training data set which is so huge, about 15gb, that I can't fit it into my computer memory for training any type of regression model. Each sample of the data set is a 4d matrix. You can flatten the sample as a vector but the dimension of…

tanjing
- 81
- 2
- 8