0

I have data of hundreds of matrices of order 7X6. I want to cluster them.

I tried using simple clustering algorithm where i converted it to a point in 42 dimensional space and chose Euclidian distance metric. but results were not satisfying since it doesn't work good in higher dimensions.

  • 1
    Welcome to SO it helps if you provide code. Have a read of [How to create a Minimal, Reproducible Example](https://stackoverflow.com/help/minimal-reproducible-example). – DrBwts Aug 18 '23 at 09:07
  • Please provide enough code so others can better understand or reproduce the problem. – Community Aug 18 '23 at 09:53

1 Answers1

0

First, I'd like to mention that it would be great if we could get more information regarding the matrices. Like, are they only containing numbers, or also strings?

Something you could look into is the calculation of similarities done in the field of Recommender Systems. These similarity measures are often used to recommend media content to a user depending on similar media (often based on metadata stored in a dataframe/matrix). One popular metric for checking how similar matrices are is the Pearson Correlation metric. In this link you get further information about how to calculate it.

For clustering, you could consider using the hierarchical agglomerative clustering method with the Pearson Correlation as the criterion (see link for some description in the SO answer).