How to make embedding models sensitive to numbers?

Asked Aug 21 '23 at 08:02

Active Aug 21 '23 at 08:46

Viewed 16 times

-2

I have a set of data, but it is presented in the form of logs such as v0.1.1, v0.2.3, and when I try it with a pretrained text2vec model I find it hard to pinpoint the exact version number or update date, seeing as it seems to be insensitive to the numbers, what is the best way to make the model more cognizant of the numbers? I thought of using fine-tune, but am still slowly progressing on the dataset construction.

Exchange of solutions. Papers and experiments are best.

edited Aug 21 '23 at 08:46

asked Aug 21 '23 at 08:02

Omnis

Please provide enough code so others can better understand or reproduce the problem. – Community Aug 21 '23 at 16:56
Welcome to Stackoverflow! Asking for recommendations might not be appropriate on the Stackoverflow (https://stackoverflow.com/help/how-to-ask) but it might be possible to ask the question on https://softwarerecs.stackexchange.com. Also, logging it on https://stackoverflow.com/collectives/nlp/beta/discussions/76949597 – alvas Aug 25 '23 at 16:30

How to make embedding models sensitive to numbers?

0 Answers0