0

I want to implement a content based recommendation system that provides a list of recommended books based on user input.

I`ll be using TF-IDF to determine how important a word is to a given book and will create a Book Characteristic Vector for every Book.

I need to create a similarity matrix to determine to determine possible pair of books. I came across Euclidean Distance for doing that. Any other methods better than Euclidean?

neuromouse
  • 921
  • 1
  • 12
  • 32
John Constantine
  • 1,038
  • 4
  • 15
  • 43

1 Answers1

1

These are some good distance measures you might try:

  • (generalized) Jaccard distance
  • Manhattan distance
  • Hellinger distance
  • cosine similarity
neuromouse
  • 921
  • 1
  • 12
  • 32