Questions tagged [classification]

In machine learning and statistics, classification is the problem of identifying which of a set of categories a new observation belongs to, on the basis of a training set of data containing observations whose category membership (label) is known.

In machine learning and statistics, classification refers to the problem of predicting category memberships based on a set of pre-labeled examples. It is thus a type of supervised learning.

Some of the most important classification algorithms are support vector machines svm, logistic regression, naive Bayes, random forest random-forest and artificial neural networks neural-network.

When we wish to associate inputs with continuous values in a supervised framework, the problem is instead known as regression. The unsupervised counterpart to classification is known as clustering (or cluster analysis), and involves grouping data into categories based on some measure of inherent similarity.

7859 questions

votes

3 answers

GBM R function: get variable importance separately for each class

I am using the gbm function in R (gbm package) to fit stochastic gradient boosting models for multiclass classification. I am simply trying to obtain the importance of each predictor separately for each class, like in this picture from the Hastie…

r machine-learning classification data-mining gbm

asked Apr 14 '15 at 20:49

Antoine

1,649
4
23
50

votes

1 answer

Finding K-nearest neighbors and its implementation

I am working on classifying simple data using KNN with Euclidean distance. I have seen an example on what I would like to do that is done with the MATLAB knnsearch function as shown below: load fisheriris x =…

matlab machine-learning classification knn

asked Dec 15 '14 at 00:58

Young_DataAnalyst

votes

5 answers

How to approach machine learning problems with high dimensional input space?

How should I approach a situtation when I try to apply some ML algorithm (classification, to be more specific, SVM in particular) over some high dimensional input, and the results I get are not quite satisfactory? 1, 2 or 3 dimensional data can be…

machine-learning classification svm

asked Feb 12 '10 at 23:42

sold

votes

9 answers

Determine whether the two classes are linearly separable (algorithmically in 2D)

There are two classes, let's call them X and O. A number of elements belonging to these classes are spread out in the xy-plane. Here is an example where the two classes are not linearly separable. It is not possible to draw a straight line that…

algorithm math machine-learning classification

asked Mar 19 '12 at 22:58

Håvard Geithus

5,544
7
36
51

votes

3 answers

What's the best open-source Java Bayesian spam filter library?

In other answers at Stackoverflow it's been suggested that Weka is good, but there are others (Classifier4j, jBNC, Naiban). Does anyone have actual experience with these?

java machine-learning spam-prevention classification bayesian

asked Jan 26 '09 at 17:47

Jason Cohen

81,399
26
107
114

votes

1 answer

Neural Network Ordinal Classification for Age

I have created a simple neural network (Python, Theano) to estimate a persons age based on their spending history from a selection of different stores. Unfortunately, it is not particularly accurate. The accuracy might be hurt by the fact that the…

machine-learning neural-network classification regression theano

asked Jul 14 '16 at 13:19

A. Dev

votes

1 answer

How to implement pixel-wise classification for scene labeling in TensorFlow?

I am working on a deep learning model using Google's TensorFlow. The model should be used to segment and label scenes. I am using the SiftFlow dataset which has 33 semantic classes and images with 256x256 pixels. As a result, at my final layer…

computer-vision classification tensorflow scene labeling

asked Feb 10 '16 at 13:49

Gooshan

2,361
1
20
15

votes

3 answers

Monitor training/validation process in Caffe

I'm training Caffe Reference Model for classifying images. My work requires me to monitor the training process by drawing graph of accuracy of the model after every 1000 iterations on entire training set and validation set which has 100K and 50K…

c++ classification deep-learning caffe conv-neural-network

asked Aug 13 '15 at 01:43

DucCuong

votes

5 answers

What is the difference between classification and prediction?

What is the difference between classification and prediction in machine learning?

machine-learning classification prediction definition

asked Apr 15 '15 at 15:57

James

votes

2 answers

Sentiment analysis with NLTK python for sentences using sample data or webservice?

I am embarking upon a NLP project for sentiment analysis. I have successfully installed NLTK for python (seems like a great piece of software for this). However,I am having trouble understanding how it can be used to accomplish my task. Here is my…

nlp nltk weka classification

asked May 14 '10 at 07:04

Ke.

2,484
8
40
78

votes

2 answers

Dealing with the class imbalance in binary classification

Here's a brief description of my problem: I am working on a supervised learning task to train a binary classifier. I have a dataset with a large class imbalance distribution: 8 negative instances every one positive. I use the f-measure, i.e. the…

python r machine-learning classification

asked Oct 06 '14 at 17:14

blueSurfer

5,651
13
42
63

votes

3 answers

Naive Bayes: Imbalanced Test Dataset

I am using scikit-learn Multinomial Naive Bayes classifier for binary text classification (classifier tells me whether the document belongs to the category X or not). I use a balanced dataset to train my model and a balanced test set to test it and…

python machine-learning classification scikit-learn text-classification

asked Jun 23 '14 at 13:25

Erol

6,478
5
41
55

votes

2 answers

Learning Weka on the Command Line

I am fairly new to Weka and even more new to Weka on the command line. I find documentation is poor and I am struggling to figure out a few things to do. For example, want to take two .arff files, one for training, one for testing and get an…

machine-learning classification weka

asked Mar 15 '13 at 20:21

Reily Bourne

5,117
9
30
41

votes

2 answers

How to read the classifier confusion matrix in WEKA

Sorry, I am new to WEKA and just learning. In my decision tree (J48) classifier output, there is a confusion Matrix: a b <----- classified as 130 8 a = functional 15 150 b = non-functional How do I read this matrix? What's the…

classification weka decision-tree

asked Mar 05 '13 at 01:21

JakeSays

2,048
8
29
43

votes

1 answer

Extract tf-idf vectors with lucene

I have indexed a set of documents using lucene. I also have stored DocumentTermVector for each document content. I wrote a program and got the term frequency vector for each document, but how can I get tf-idf vector of each document? Here is my code…

java lucene classification

asked Feb 08 '12 at 07:14

orezvani

3,595
8
43
57

Prev 1 2 3

…

99 100 Next