0

I am newbie in NLP
I am working on generating feedback for students answer and wondering what would be best evaluation metric for this case? my dataset consists of tuples, I am planning to use flan-t5 with prompting (where you add several examples as prefix) and fine-tune the model, so, BLUE doesn't seem the right evaluation metric?
HelloWorld
  • 77
  • 3
  • 9

0 Answers0