To check similarity between text data

Question

Please guide me how to measure similarity of text data for clustering, for numeric data we can measure with euclidean distance measure or any other distance measure. The data is keywords used for searching collected from website and the second data set is the collection of snippets returned of some searching. the similarity should be similar in meaning as well.

Read **any book on text mining**. Or answer would be as long as this book. — Has QUIT--Anony-Mousse, Jul 01 '15 at 17:47

score 0 · Answer 1 · answered Jul 03 '15 at 10:02

0

Read about tf–idf and cosine similarity.

answered Jul 03 '15 at 10:02

EricJ

131
3
13

To check similarity between text data

1 Answers1