I want to do hierarchical clustering with the fastcluster module. When i the default (euclidian) distance metric, it works fine:
import fastcluster
import scipy.cluster.hierarchy
distance = spatial.distance.pdist(data)
linkage = fastcluster.linkage(distance,method="complete")
But the problem is when I want to use the "cosine similarity" as distance metric:
distance = spatial.distance.pdist(data,'cosine')
linkage = fastcluster.linkage(distance,method="complete")
The output is:
Traceback (most recent call last):
File "C:\djcode\mysite\mysite\scipytest.py", line 52, in <module>
linkage = fastcluster.linkage(distance,method="complete")
File "C:\Python33\lib\site-packages\fastcluster.py", line 245, in linkage
linkage_wrap(N, X, Z, mthidx[method])
FloatingPointError: NaN dissimilarity value.