Say I have items i1, ..., iN
I would like to cluster them in such a way that:
- If I ran the cluster many many times the probability that items iJ and iK would end up in the same cluster is high.
- The number of clusters and cluster memberships are relatively stable regardless of cluster seeds
Are there well known algorithms to achieve this?
Clarification:
say I want 3 clusters and say:
- in reality-1 I start with i1, i33, i89 as seeds for cluster c1 c2 c3
- in reality-2 I start with i44, i55, i77 as seeds for cluster c1 c2 c3
I want the resulting clusters in both realities to be largely similar