Fastest way to compute cosine similarity in a GPU

Question

So I have a huge tfidf matrix with more than a million records, I would like to find the cosine similarity of this matrix with itself. I am using colab to run the code, but I am not sure how to best make use of the gpu provided by colab.

sequentially run code -

tfidf_matrix = tf.fit_transform(df['categories'])

cosine_similarities = linear_kernel(matrix, matrix)

Is there way we can parallelise the code using jit or any other way?

Poe Dator · Answer 1 · 2023-01-01T23:01:40.300

0

try simple torch code like in this example from sentence transformers library: https://github.com/UKPLab/sentence-transformers/blob/master/sentence_transformers/util.py#L31 or just import the function.
consider cuml library which uses CUDA acceleration https://docs.rapids.ai/api/cuml/nightly/api.html

edited Jan 01 '23 at 23:01

answered Jan 01 '23 at 22:34

Poe Dator

4,535
2
14
35

Fastest way to compute cosine similarity in a GPU

1 Answers1