-1

Please guide me how to measure similarity of text data for clustering, for numeric data we can measure with euclidean distance measure or any other distance measure. The data is keywords used for searching collected from website and the second data set is the collection of snippets returned of some searching. the similarity should be similar in meaning as well.

1 Answers1

0

Read about tf–idf and cosine similarity.

EricJ
  • 131
  • 3
  • 13