Questions tagged [feature-extraction]

In pattern recognition and in image processing, feature extraction is a special form of dimensionality reduction. Transforming the input data into the set of features is called feature extraction. If the features extracted are carefully chosen it is expected that the features set will extract the relevant information from the input data in order to perform the desired task using this reduced representation instead of the full size input.

Feature extraction involves simplifying the amount of resources required to describe a large set of data accurately. When performing analysis of complex data one of the major problems stems from the number of variables involved. Analysis with a large number of variables generally requires a large amount of memory and computation power or a classification algorithm which overfits the training sample and generalizes poorly to new samples. Feature extraction is a general term for methods of constructing combinations of the variables to get around these problems while still describing the data with sufficient accuracy.

Best results are achieved when an expert constructs a set of application-dependent features. Nevertheless, if no such expert knowledge is available general dimensionality reduction techniques may help.

Source: Wikipedia

1664 questions
6
votes
3 answers

How to extract feature vector from single image in Pytorch?

I am attempting to understand more about computer vision models, and I'm trying to do some exploring of how they work. In an attempt to understand how to interpret feature vectors more I'm trying to use Pytorch to extract a feature vector. Below is…
user2355903
  • 593
  • 2
  • 8
  • 29
6
votes
0 answers

Visualize 1D CNN Feature Importance for Time Series Sequences

I am trying to extract feature importance from my 1D CNN. Most of the online documentation refers to 2D, 3D, image data and classification problems. I have a multivariate time series that outputs time series sequences. I have tried Shaply and…
6
votes
1 answer

Explanation of feature descriptors in computer vision and machine learning

I've started working with computer vision techniques quite a bit, mainly deep learning but I want to try and get a good understanding of the more traditional techniques as well for a good grounding. I have been playing around with some manual…
6
votes
1 answer

Getting TypeError: '(slice(None, None, None), array([0, 1, 2, 3, 4]))' is an invalid key

Trying to use BorutaPy for feature selection. but getting a TypeError: '(slice(None, None, None), array([0, 1, 2, 3, 4]))' is an invalid key. from sklearn.ensemble import RandomForestClassifier from boruta import BorutaPy rf =…
6
votes
1 answer

Any workaround to find optimal threshold for filtering raw features based on correlation matrix in R?

I intended to extract highly correlated features by measuring its Pearson correlation, and I got a correlation matrix by doing that. However, for filtering high correlated features, I selected correlation coefficient arbitrarily, I don't know the…
Jerry07
  • 929
  • 1
  • 10
  • 28
6
votes
1 answer

How to handle text classification problems when multiple features are involved

I am working on a text classification problem where multiple text features and need to build a model to predict salary range. Please refer the Sample dataset Most of the resources/tutorials deal with feature extraction on only one column and then…
Chetan Ambi
  • 159
  • 3
  • 9
6
votes
2 answers

Selecting a Specific Number of Features via Sklearn's RFECV (Recursive Feature Elimination with Cross-validation)

I'm wondering if it is possible for Sklearn's RFECV to select a fixed number of the most important features. For example, working on a dataset with 617 features, I have been trying to use RFECV to see which 5 of those features are the most…
6
votes
2 answers

How to extract keypoints from Harris Corner Detector using Opencv

First I would use cv::cornerHarris() to detect the corners (which i could do easily). Second I want to extract keypoints from Harris detector and store them in std::vector (which i have no idea how to do). I will use this later to…
HM Moniruzzaman
  • 135
  • 1
  • 11
6
votes
1 answer

How to save resulted face landmark image in dlib?

I am using dlib's face_landmark_detection_ex.cpp which display the detected face image and all face landmarks on the original image. I want to save the original image with all 68 face face landmarks to my computer. I know it can be done by save_png…
user3751794
6
votes
1 answer

PYTHON: How to pass tokenizer with keyword arguments to scikit's CountVectorizer?

I have a custom tokenizer function with some keyword arguments: def tokenizer(text, stem=True, lemmatize=False, char_lower_limit=2, char_upper_limit=30): do things... return tokens Now, how can I can pass this tokenizer with all its…
JRun
  • 669
  • 1
  • 10
  • 17
6
votes
2 answers

Feature space reduction for tag prediction

I am writing a ML module (python) to predict tags for a stackoverflow question (tag + body). My corpus is of around 5 million questions with title, body and tags for each. I'm splitting this 3:2 for training and testing. I'm plagued by the curse of…
6
votes
1 answer

Does it make sense to use both countvectorizer and tfidfvectorizer as feature vectors for text clustering with KMeans?

I am trying to build out my feature vectors from my csv file which contain about 1000 comments. One of my feature vector is tfidf using scikit learn's tfidf vectorizer. Does it make sense to also use count as a feature vector or is there a better…
jxn
  • 7,685
  • 28
  • 90
  • 172
6
votes
1 answer

BRISK and FREAK descriptors in MATLAB

I'm using MATLAB 2014a for extracting BRISK and FREAK descriptors. I couldn't find any option to reduce the bit string size to 128 or 256, they both generated as 512 bits. I think FREAK does not allow this by algorithm but BRISK should allow. Does…
guneykayim
  • 5,210
  • 2
  • 29
  • 61
6
votes
1 answer

Can i find PCA of a single image? - MATLAB

I am doing face recognition using PCA and SVM. My training set has an array of 400 images on which i have performed PCA and mapped the data into the eigenspace. Now for testing i have only a single image whose principal components i need to extract…
Sid
  • 249
  • 5
  • 16
6
votes
4 answers

Match Sketch(Drawing) face photo to digital color photo

I'm going to match the sketch face (drawing photo) in to the color photo. so for the research i want to find out what are the challenges that matching sketch drawing in to color faces. for now i have find out that resolution pixel difference…