Questions tagged [multilabel-classification]

Multi-label classification refers to the problem in Machine Learning of assigning multiple target labels to each sample, where the labels represent a property of the sample point and need not be mutually exclusive.

844 questions
0
votes
1 answer

How to feed multi label data as HDF5 input in a multi task setup?

I have a dataset with each image having around 101 labels. I know, I have to use HDF5 data layer to feed my data into the network. But the problem is that I have a multi task setup. My network has shared parameters for the first 5 layers and then…
0
votes
1 answer

Difference between binary relevance and one hot encoding?

Binary relevance is a well known technique to deal with multi-label classification problems, in which we train a binary classifier for each possible value of a feature: http://link.springer.com/article/10.1007%2Fs10994-011-5256-5 On the other side,…
0
votes
1 answer

How to treat response OneVsRestClassifier in Scikit-learn for multilabel

I'm newbie in Scikit-learn and classification. My task is a multi-label classification problem. AS I understand predict returns array with n tuples which is the same as amount of features in sample. What does it mean? How I can get strict order and…
0
votes
1 answer

how to roll a multilabel dataframe into single column in R

I have the following dataframe originalData ID Y1 Y2 Y3 X1 X2 X3 1 111 TRUE TRUE FALSE 12 junior 45.55 2 112 FALSE FALSE TRUE 15 junior 458.54 3 113 TRUE TRUE FALSE 16 senior 48.79 I would like to get the following result…
S12000
  • 3,345
  • 12
  • 35
  • 51
0
votes
1 answer

Weka Classification Project Using StringToWordVector and SMO

I am working on a project in which I have about 18 classes with about 4,000 total instances. I have 7 attributes, 1 being string data, the rest nominal. I am currently using StringToWordVector on the string attribute with Platt's SMO classifier,…
0
votes
1 answer

OneVsRestClassifier(svm.SVC()).predict() gives continous values

I am trying to use y_scores=OneVsRestClassifier(svm.SVC()).predict() on datasets like iris and titanic .The trouble is that I am getting y_scores as continous values.like for iris dataset I am getting : [[ -3.70047231 -0.74209097 2.29720159] […
0
votes
1 answer

Rapidminer Classification

I am trying to solve a simple classification problem where the label has 12 different levels and need to classify each example into one of these 12. However, I want my output to look like refer the image: https://i.stack.imgur.com/49USG.png Here;…
0
votes
1 answer

How to exclude data with 0 variance in matlab implementation of Linear discriminant analysis

I am using Matlab to classify data using LDA. mdl = fitcdiscr(dbimgs1,indx,'DiscrimType','linear'); C=predict(mdl,testimgs1); I get the following error: Predictor x741 has zero variance. Either exclude this predictor or set 'discrimType' to…
user6220686
0
votes
1 answer

Azure machine learning even sampling

I'm trying to do some basic multi-label classification in Azure ML. I have some basic data in the following format: value_x value_y label x1 y1 label1 x2 y2 label1 x3 y3 label2 ..... My problem is that in my data…
0
votes
1 answer

Python sklearn OneVsRestClassifier : Score function gives ValueError

i am working on a multilabel classification problem as import pandas as pd import pickle from sklearn.feature_extraction.text import TfidfVectorizer from sklearn.linear_model import SGDClassifier from sklearn.multiclass import…
0
votes
1 answer

OnVsRestClassifier gives 0 accuracy

i am trying to solve a multilabel classification problem as from sklearn.preprocessing import MultiLabelBinarizer traindf = pickle.load("traindata.pkl","rb")) X = traindf['Col1'] …
0
votes
0 answers

Vowpal Wabbit same results always

I am using VW to try to predict multi classes. The strangest part is that it doesn't matter which parameters I use, the result is always the same. Should that happen, maybe because of my data? Details: Around 90k lines of data. A line of the data: 1…
0
votes
1 answer

How to preprocess this floating point data to use with scikit - Machine Learning

I have dataset with 4000 features and 35 samples. All the features are floating point numbers between 1 and 3. eg: 2.68244527684596. I'm struggling to get any classifier working on this data. I have used knn, svm (with linear,rbf,poly). Then I have…
0
votes
1 answer

Run multiple Meka (Weka) classifiers - load errors to file / table object

For those of you unfamiliar with Meka - it is an extension of Weka for multi-label classifiers. Meka and Weka are VERY similar, however, and so Weka users may be able to answer this question, too. Basically, I want my results from my runs of various…
lrthistlethwaite
  • 494
  • 2
  • 6
  • 23
0
votes
3 answers

sklearn-KNearestNeighbors with Multilabels

I have a dataset with features and their labels. it looks like this: X1, X2, X3, X4, X5 .. Xn L1, L2, L3 Y1, Y2, Y3, Y4, Y5 .. Yn L5, L2 .. I want to train a KNeighborsClassifier on this dataset. It seems like sklearn does not take multilabels. I…
pg2455
  • 5,039
  • 14
  • 51
  • 78