I'm using the ParagraphVector tool in DeepLearning4j framework. What I'm doing is training a model on a set of text documents and then calculating the similarity between those documents.
Now, as the reference page (http://deeplearning4j.org/word2vec) says, the metric used by the tool to calculate similarity is cosine similarity, which should be included between 0 and 1. However, for some pair of documents, I get negative scores.
Can anybody tell why is that?
Thank you in advance.