0

I have used K-mean Algorithm with euclidean distance to cluster my dataset, then i tried cosine distance, but the algorithm does not converge with cosine metrics (it is not stopping - iteration reach to 1000 )

any suggestion please

1 Answers1

0

Cosine should be okay, because it is equivalent to Euclidean on L2 normalized data. Do unless you have a programming error, it should converge.

So alternatively, you could L2 normalize your data, then use regular k-means instead og spherical k-means.

Note thst Mini-Batch by design doesn't converge, in case you are using that. If you sample batches, it cannot.

Has QUIT--Anony-Mousse
  • 76,138
  • 12
  • 138
  • 194