Questions tagged [feature-extraction]

In pattern recognition and in image processing, feature extraction is a special form of dimensionality reduction. Transforming the input data into the set of features is called feature extraction. If the features extracted are carefully chosen it is expected that the features set will extract the relevant information from the input data in order to perform the desired task using this reduced representation instead of the full size input.

Feature extraction involves simplifying the amount of resources required to describe a large set of data accurately. When performing analysis of complex data one of the major problems stems from the number of variables involved. Analysis with a large number of variables generally requires a large amount of memory and computation power or a classification algorithm which overfits the training sample and generalizes poorly to new samples. Feature extraction is a general term for methods of constructing combinations of the variables to get around these problems while still describing the data with sufficient accuracy.

Best results are achieved when an expert constructs a set of application-dependent features. Nevertheless, if no such expert knowledge is available general dimensionality reduction techniques may help.

Source: Wikipedia

1664 questions
0
votes
1 answer

Region merging by using superpixels

I implemented SLIC algorithm to find labels and I obtained the labels. I would like to compute a color feature vector that contains the average of the color features for each region. For each pair of neighboring regions, if the Euclidean distance…
0
votes
2 answers

Is it possible to adapt the sci-kit CountVectorizer for other features (not just n-grams)?

I'm new to scikit and working with text data in general, and I've been using the sci-kit CountVectorizer as a start to get used to basic features of text data (n-grams) but I want to extend this to analyze for other features. I would prefer to adapt…
Nice-kun
  • 81
  • 1
  • 2
  • 9
0
votes
2 answers

Using Opencv to calculate the haar feature

I try to calculate Haar feature using opencv (Given an image). Input: an image output: haar feature For that, I am using the FeatureEvaluator from OpenCV. But I got an exception when I try to calculate one feature. Here is how I am…
cyh24
  • 266
  • 1
  • 3
  • 13
0
votes
0 answers

??? Attempt to reference field of non-structure array

I am working on an image retrieval project. It requires The VLFeat open source library which implements popular computer vision algorithms specializing in image understanding and local features extraction and matching. Even after successful…
0
votes
1 answer

how to calculate the feature vectors approximately for storing in excel sheet?

In my GUI I am using this matlab code to store the values in excel sheet.This code is calculating the glcm six features. function [Contrast,cor,ener,homo,Var,Entropy] = glcm_feature_extraction(I1) Contrast =…
0
votes
2 answers

Image extraction: when entering image code stops executing

I have to extract features of an image using the image extraction from MATLAB. It works successfully with the image provided by The MathWorks in their tutorial, but when I enter another image it doesn't work My code for image extraction is: boxImage…
0
votes
1 answer

Comic Balloon Detection: How can I count white pixels inside a vector Ellipse in OpenCV?

I've been looking everywhere I can for the answer but I can't find one. I'm making a comic balloon detection program and I need to find an ellipse that have a specific percentage of white inside the contour (percentage is to be decided later), thus…
0
votes
1 answer

Which feature vector does opencv's K-Nearest-Neighbor algorithm use to predict in recognition.

I am trying to develop a basic OCR for bangla character using openCV. I have used opencv's built-in CvKNearest object. everything works alright except the accuracy of recognition. I haven't used any pre-calculated feature. I have plan to use some…
Sourav
  • 21
  • 8
0
votes
0 answers

Python Scikit-learn CountVectorizer throwing ValueError: empty vocabulary

I'm trying to extract features from a text document. Here is my code: import sklearn from sklearn.datasets import load_files from sklearn.feature_extraction.text import CountVectorizer files =…
0
votes
1 answer

My observations are less than the feature vector of each. Any solution to overcome this?

I'm using GMM to fit my data to 256 Gaussians. I'm using Matlab's fitgmdist to achieve this. gmm{i} = fitgmdist(model_feats, gaussians, 'Options',statset('MaxIter',1000), ... 'CovType','diagonal', 'SharedCov',false, 'Regularize',0.01,…
0
votes
1 answer

NLP: How to correctly normalise a feature for gender classification?

NOTE Before I begin, this F-measure is not related to precision and recall, and its title and definition is taken from this paper. I have a feature known as the F-measure, which is used to measure formality in a given text. It is mostly used in…
mesllo
  • 545
  • 7
  • 29
0
votes
2 answers

K-means clustering

I want to use K-means clustering for my features which are of size 286 x 276 , So I can do clustering before using SVM. These features are of 16 different gestures. I am using MATLAB function IDX=kmeans(Feat_train,16). In IDX variable I am getting…
Frq Khan
  • 175
  • 2
  • 17
0
votes
0 answers

Metric for local texture similarity

I'm working on the following classification task: Take two patches of an image and compare for illumination invariant texture similarity. There are a couple of features which could be used to compare two image patches for texture similarity.…
tisch
  • 1,098
  • 3
  • 13
  • 30
0
votes
1 answer

PHP SVM - Feature engineering

We have data for people going to school. Things like previous education level. Interests. Gender. Military status. How do we convert these to integer/float values so we can throw them into an SVM model? Am I going about this the right way? Side…
0
votes
1 answer

PostgreSQL: Python cannot save array of strings

I have an array of strings. Each row of the string contains 1440 "numbers". Using the code below the program runs fine but when I check the feature column via pgAdmin it is empty. The column "image" which contains the path of the image and is…
user1798707
  • 351
  • 1
  • 3
  • 12
1 2 3
99
100