Asked
Active
Viewed 18 times
0
I am newbie in NLP
I am working on generating feedback for students answer and wondering what would be best evaluation metric for this case? my dataset consists of tuples, I am planning to use flan-t5 with prompting (where you add several examples as prefix) and fine-tune the model, so, BLUE doesn't seem the right evaluation metric?

HelloWorld
- 77
- 3
- 9