I want to compare users based on responses to 10 questions. My original idea was to resolve each question to an integer [1, 5], but this idea won't work all the time. For example:
vec1 = [1,1,1,1,1,1,1,1,1,1]
vec2 = [5,5,5,5,5,5,5,5,5,5]
get_cos_sim(vec1, vec2) = 1
So even though the users responded completely dissimilarly, the vectors are the same.
I would like to get similar users based on similarity of their responses to each question. So for a given question, if person A's response resolved to 1 and person B's response resolved to 2, the similarity between the responses in those questions would be higher than person A's and person C's response, who answered 4.