Supervised fine tuning in pre-trained language model

Asked Feb 23 '23 at 17:26

Active Aug 07 '23 at 21:02

Viewed 137 times

Supervised find turning adds a extra output layer to the pre-trained model.

Does this extra layer alter the probability of words that are not related to the fine tune data?

edited Aug 07 '23 at 21:02

Nick ODell

asked Feb 23 '23 at 17:26

Chen APD

Can you please elaborate your question.? – Ashwin Geet D'Sa Feb 24 '23 at 08:27
Please provide enough code so others can better understand or reproduce the problem. – Community Feb 24 '23 at 18:33

0 Answers0