Questions tagged [scikit-multilearn]

scikit-multilearn is a Python module capable of performing multi-label learning tasks. It is built on-top of various scientific Python packages (numpy, scipy) and follows a similar API to that of scikit-learn.

28 questions
5
votes
1 answer

Multilabel classification ML-kNN vs KNN

This might be a stupid question but I was just wondering what the difference between ML-KNN implemented in scikit.ml and scikit-learn's KNeighborsClassifier is. According to sklearn's docs KNeighborsClassifier has support for multilabel…
4
votes
5 answers

How to install scikit-multilearn using Conda

To install scikit-multilearn, I have tried the following commands with no luck: conda install scikit-multilearn or, conda install -c condo-forge scikit-multilearn The official website of scikit-multilearn propose using pip: pip install…
Sahar
  • 741
  • 1
  • 8
  • 14
4
votes
1 answer

Why are Multilabel performance results the same as independant one despite strong label correlation?

I have a dataset with 2 labels which I know there's strong correlation among these 2 labels. However, when I use scikit multilearn binary relevance which doesn't consider correlation I get very similar results as Labelpowerset classifier which…
3
votes
2 answers

TypeError: A sparse matrix was passed, but dense data is required (multilabel K nearest neighbours)

I'm having trouble with fitting an instance of an MLkNN model (from scikit-multilearn) after doing GridSearchCV (from scikit-learn). I am getting an error. Here is the appropriate code: #From MachineLearningMastery:…
StatsSorceress
  • 3,019
  • 7
  • 41
  • 82
2
votes
1 answer

'Series' object has no attribute 'getformat'

I'm trying to run MLKnn classifier over my pandas dataframe and when I try to fit the classifier I get this error message: Series object has no attribute 'getformat' Here's the code: from skmultilearn.adapt import MLkNN from sklearn.model_selection…
2
votes
0 answers

python API "scikit-multilearn" is returning "column index (1) out of range" when calling "predict_proba" method

I am using a multi-label classifier that already runs in another context. The data classifier contains data from nodes visits from a graph, and it is available…
2
votes
1 answer

How to implement MultiLabelBinarizer on this dataframe?

I have a dataframe like this: mid value label ID 192 3 176.6 [9, 6, 8, 0, 8, 8, 7, 9, 2, 19... 192 4 73.6 [9, 6, 8, 0, 8, 8, 7, 9, 2, 19... 192 5 15.8 [9, 6, 8, 0, 8, 8, 7, 9, 2, 19... 194 3 9603.2 [0, 0, 0, 0, 0, 9,…
2
votes
1 answer

error installing skmultilearn.problem_transform package

The error I get is : Could not find a version that satisfies the requirement skmultilearn.problem_transform (from versions: ) No matching distribution found for skmultilearn.problem_transform
user8840625
  • 21
  • 1
  • 4
1
vote
1 answer

How to transform multi-label to multi-class in Python?

Let's say I have the following samples with their respective multi-label Where X1,X2,X3,X4,X5,X6 are samples and Y1,Y2,Y3,Y4 are labels X1 : {Y2, Y3} x2 : {Y1} X3 : {Y2} X4 : {Y2, Y3} X5 : {Y1, Y2, Y3, Y4} X6 : {Y2} How do I transform to X1 : y1 x2…
1
vote
1 answer

Setting the parameter for multilearn in pyBKT

I want to run pyBKT on the assistment data with a multilearn model and get the warning: 'multilearn default column not specified'. This is my code: model = Model(seed = SEED, parallel = True) %time model.fit(data=df_train, skills=SKILLS,…
Katinka
  • 11
  • 1
1
vote
1 answer

Convert probability binary values of multi labels to target labels

I am trying to classify text to multi labels and it is working good but as i want to consider the predicted labels below .5 threshold, it changed predict() to predict_proba() to get the all the probalities of labels and select the values based on…
1
vote
0 answers

Saving and loading scikit-multilearn model

I cannot find a way to reliably save and load a BinaryRelevance model from the scikit-multilearn library. I am using the Keras wrapper to be able to use a custom classifier. I have tried to pickle the model, but I get the error AttributeError: Can't…
1
vote
1 answer

What is the best value for the parameter class_weight in LinearSVC?

I have a multi label data (some classes have 2 and some 10 labels)and my model is overfitting for balanced and None values.What are the best values to set for the class_weight parameter. from sklearn.svm import LinearSVC svm =…
1
vote
1 answer

What is the difference between "OneVsRestClassifier" (Scikit-learn) and "Binary Relevance" (Scikit-multilearn)?

In scikit-learn, there is a strategy called sklearn.multiclass.OneVsRestClassifier, which can be used for both multiclass and multilabel problems. According to its documentation: "In the multilabel learning literature, OvR is also known as the…
1
vote
1 answer

How to pass epoch and batch size when using label powerset in keras

I have a multi-label problem and with some research, I was able to use Label powerset in conjunction with ML algorithms. Now I want to use the Label powerset with neural network and as per the official website I can use Label powerset. But I am not…
1
2