Is there a metric to evaluate similarity between two objects, based on their attributes?

Question

Suppose I have an object X with a set of 10 features: [0, 0, 0, 0, 0, 0, 0, 0, 0, 0].

Then, I have two more objects:

A : [2, 2, 2, 2, 2, 2, 2, 2, 2, 2]
B : [0, 0, 0, 0, 0, 0, 0, 0, 0, 20]

I need to know which from A or B is "closer" to X.

The idea I have in mind behind "similarity" is:

It is better that all features are nearly the same, rather than many are very close but some very different.

According to this "definition", A seems closer to X than B.

However, the arithmetic mean does not seem to be the right tool to implement this idea because it is 2 for both objects.

Is there a particular metric for this kind of problem, please?

score 1 · Accepted Answer · answered Oct 17 '15 at 17:47

1

What about the euclidean distance?

In your case, the Euclidean distance between A and X is the square root of 40 (= 6.32 approximately) and the distance between B and X is 20, so A is indeed more similar by that metric.

answered Oct 17 '15 at 17:47

jrsala

1,899
12
13

Well, this seems to be what I was looking for, thank you! – Delgan Oct 17 '15 at 17:58

score 1 · Answer 2 · edited Apr 13 '17 at 12:50

You could also consider using cosine similarity. Cosine similarity measures the similarity of vectors with respect to the origin, while Euclidean distance measures the distance between particular points of interest along the vector.

Here is a great article on when to pick one over the other.

Another common measure is Jaccard similarity. Here is an article comparing cosine to Jaccard similarity.

score 0 · Answer 3 · edited Apr 13 '17 at 12:44

0

In the case where the features are very unsimilar and may vary differently, the euclidian distance have to be normalized.

This can be done using the Mahalanobis distance which involves the variance of the features.

Also, see this question.

edited Apr 13 '17 at 12:44

Community

1
1

answered Oct 17 '15 at 18:39

Delgan

18,571
11
90
141

Is there a metric to evaluate similarity between two objects, based on their attributes?

3 Answers3