I have a set of data, but it is presented in the form of logs such as v0.1.1, v0.2.3, and when I try it with a pretrained text2vec model I find it hard to pinpoint the exact version number or update date, seeing as it seems to be insensitive to the numbers, what is the best way to make the model more cognizant of the numbers? I thought of using fine-tune, but am still slowly progressing on the dataset construction.
Exchange of solutions. Papers and experiments are best.