How to write a stored procedure named cosine_similarity that takes in two input parameters doc1_ID (type:int) and doc2_ID (type:int), and one output parameter sim_val (type:double) to calculate the cosine-similarity value for any 2 records (corresponding to 2 documents i.e. DocIDs)?
delimiter $$
CREATE PROCEDURE cosine_similarity (IN doc1_ID int, IN doc2_ID int, OUT sim_val double)
begin'
declare var1_1 int;
declare var2_1 VARCHAR(100);
declare var1_2 int;
declare var2_2 VARCHAR(100);
Select term, frequency into var1_1, var2_1 from DOCTERMFREQ where DOCID=doc1_ID;
Select term, frequency into var1_2, var2_2 from DOCTERMFREQ where DOCID=doc2_ID;
set sim_val= (SUM(var2_1 * var2_2)/ SQRT(SUM(var2_1 * var2_1))/ SQRT(SUM(var2_2 * var2_2)
end $$
delimiter ;