Questions tagged [feature-selection]

In machine learning, this is the process of selecting a subset of most relevant features to construction your data model.

Feature selection is an important step to remove irrelevant or redundant features from our data. For more details, see Wikipedia.

1533 questions
-1
votes
1 answer

Adding cross validation to a random forest regressor to see feature importance

I have the following code that does a random forest regression to see feature importance. I would like to do cross validation or k-folds. Here is my code for doing the regression, which gives me the features and their ranks. I have attempted…
nic.o
  • 61
  • 8
-1
votes
1 answer

Run and rank all combination of features to machine learning model

I have a train and test data set which contains 30 independent features and 1 target feature. All the features are numerical variables. An example of the train data set looks like. The test data set also has the same…
-1
votes
1 answer

Should I need to convert ordered numerical data into categorical for encoding or normalize it?

I'm working on a dataset that has both numerical and categorical columns. One of the numerical columns is fare rates ($) which has just 4 distinct values (200, 400, 600 and 800). I have done feature scaling on other numerical features but I'm stuck…
-1
votes
1 answer

How to select the best features from all features when data sets don't have target variable where feature importance can change over time?

My data sets have 200 features and 500 rows. from that I must select the best 30 features that can be used in the model instead of all 200 features for the sales prediction model, but the feature importance can change over time. Interesting thing is…
-1
votes
1 answer

Is query widget available in ArcGIS javascript API?

I want to allow users to find features based on a query expression. So I want to add a query widget to my project like in the web app builder. Is the query widget available in ArcGIS javascript API?
-1
votes
1 answer

new features in dataset

I'm now in the middle of the semester and trying to understand the background of the algorithms and features. I would like to understand some theory. If I have a dataset with N samples. each sample has 5 features for example. I have done 3 kinds of…
-1
votes
1 answer

How to select the best feature from each group

I have four predictor variables to model a soil property. Each of these predictor variables is generated by ten different methods (four groups of ten). Is there an algorithm (in R, Python, and ...) that selects the best type from each of these…
-1
votes
1 answer

Feature selection with Decision Tree

I'm supposed to perform feature selection of my dataset (independent variables: some aspects of a patient, target varibale: patient ill or not) using a dcision tree. After that with the features selected I've to implement a different ML model. My…
-1
votes
1 answer

What is causing this discrepancy between the metric displayed at Catboost.select_features's plot and the actual predictions of the fitted final model?

I'm performing feature selection with Catbost. This the training code: # Parameter grid params = { 'auto_class_weights': 'Balanced', 'boosting_type': 'Ordered', 'thread_count': -1, 'random_seed': 24, …
Miguel 2488
  • 1,410
  • 1
  • 20
  • 41
-1
votes
1 answer

Unable to downgrade/uninstall shap (windows)

I want to downgrade shap to version 0.34.0 from 0.39.0 !pip uninstall shap Unable to even downgrade with the below given syntax:
Gaurab Das
  • 21
  • 2
-1
votes
1 answer

Removing features based on variance

I am creating a model using an advanced regression house price dataset. It has 37 numerical features. I want to make a feature selection by removing features with zero or very low variance. I used Variance Threshold, and it didn't remove any…
Mohamed Abdullah
  • 129
  • 1
  • 1
  • 8
-1
votes
1 answer

How to plot a scatter plot to understand the general trend in data, when we have multiple features

Here, Features are X_train Target is y_train W​hen there is a dataset with 'n' number of features how will we select that one feature to make a scatter plot with the target variable to understand the general trend of the training data, to select a…
-1
votes
1 answer

Data pre-processing and feature engineering

I have been doing some reading on data pre-processing and feature engineering including feature selection, feature importance and feature construction. My understanding is that Feature engineer is applied in data preprocessing stage. Additionally,…
-1
votes
1 answer

Feature Extraction Using Representation Learning

I'm new to machine learning, and I've been given a task where I'm asked to extract features from a data set with continuous data using representation learning (for example a stacked autoencoder). Then I'm to combine these extracted features with the…
-1
votes
1 answer

Keep Feature Definitions in Dictionary and Rerturn the feature to the client

I have below dictionary for keeping feature definitions as strings. features = { "journey_email_been_sent_flag": "F.when(F.col('email_14days') > 0,F.lit(1)).otherwise(F.lit(0))", "journey_opened_flag": "F.when(F.col('opened_14days') > 0,…
sbs
  • 43
  • 10