Difference between binary relevance and one hot encoding?

Question

Binary relevance is a well known technique to deal with multi-label classification problems, in which we train a binary classifier for each possible value of a feature:

http://link.springer.com/article/10.1007%2Fs10994-011-5256-5

On the other side, one hot encoders (OHE) are commonly used in natural language processing to encode a categorical feature taking multiple values as a binary vector:

http://cs224d.stanford.edu/lecture_notes/LectureNotes1.pdf

Can we consider that these two concepts are the same one? Or are there technical differences?

They look fairly different to me. Why do you think they are closely related? — Has QUIT--Anony-Mousse, Aug 08 '16 at 11:07
If you use binary relevance to encode a dataset having a single label per class, it looks like you are applying one-hot encoding on each instance, the vector would be the concatenation of the binary values for all the labels. In multi-target problems, the concepts are different of course. — mountrix, Aug 09 '16 at 14:16

Sayali Sonawane · Accepted Answer · 2016-08-08T12:26:40.990

3

Both methods are different.

1. One-Hot encoding

In one-hot encoding, vector is considered.

Above diagram represents binary classification problem.

2. Binary Relevance

In binary relevance, we do not consider vector. Following diagram represents class label generation using binary relevance method which is using scalar value.

edited Aug 08 '16 at 12:26

answered Aug 08 '16 at 12:16

Sayali Sonawane

12,289
5
46
47

Thank you, this is a detailed explanation, including the powerset representation. – mountrix Aug 09 '16 at 14:17
1

@mountrix for more information you can watch this video (Udacity, Google ->Deep Learning ): https://www.youtube.com/watch?v=2Uyr93f3C2M – Sayali Sonawane Aug 09 '16 at 15:31

Difference between binary relevance and one hot encoding?

1 Answers1