TF-IDF
andCosine Similarity
is a commonly used combination for text clustering. Each document is represented by vectors of TF-IDF weights.
This is what my text book says.
With Cosine Similarity you can then compute the similarities between those documents.
But why are exactly those techniques used together?
What is the advantage?
Could for example Jaccard Similarity also be used?
I know, how it works, but I want to know, why exactly these techniques.