What is better custom training the bert model or use the model with pretrained data?

Question

I am coding my own models for a time but I saw huggingface and started using it. I wanted to know whether I should use the pretrained model or train model (the same hugging face model) with my own dataset. I am trying to make a question answering model.

I have dataset of 10k-20k questions.

score 2 · Accepted Answer · answered Aug 12 '22 at 13:09

2

The state-of-the-art approach is to take a pre-trained model that was pre-trained on tasks that are relevant to your problem and fine-tune the model on your dataset.

So assuming you have your dataset in English, you should take a pre-trained model on natural language English. You can then fine-tune it.

This will most likely work better than training from scratch, but you can experiment on your own. You can also load a model without the pre-trained weights in Huggingface.

answered Aug 12 '22 at 13:09

Berkay Berabi

1,933
1
10
26

so i should use the original weights and fine tune it with my dataset. And yes dataset is in english. – Naman Aug 12 '22 at 13:14
Yes exactly. Use the pretrained weights and fine-tune it. The idea is that your model starts with weights that already contain some information about the language. This is of course better than starting to learn from randomly initialized weights (from scratch) – Berkay Berabi Aug 12 '22 at 14:06

What is better custom training the bert model or use the model with pretrained data?

1 Answers1