Highest Voted 'gpt-2' Questions

2

votes

0 answers

Getting MemoryError fine-tuning GPT2(355M) model with small datasets (3MB) through aitextgen

I'm using aitextgen to fine-tune the 355M GPT-2 model using the train function. The datasets are small txt files consisting of lines like these (these are encoded texts for keyword-based text generation, hence the…

asked Jan 03 '22 at 06:34

Cephylist

21
2

2

votes

0 answers

GPT2 on apple M1 Pro chip

while trying to install GPT2 according to the instructions on the official github repo, I ended up with an Illigal hardware instruction error when I tried to use it. that means I shouldn't even think of trying GPT2 on an M1 pro chip (though the…

python tensorflow apple-m1 gpt-2

asked Nov 03 '21 at 20:33

user16510763

2

votes

1 answer

How to get onnx format from pretrained GPT2 models?

I'm trying to transform KoGPT2 model, which is pretrained by GPT2, to onnx format in order to change the model to tensorflow format. I used convert_graph_to_onnx in transformers but it didn't work because of some reasons. I don't know what this…

python tensorflow pytorch onnx gpt-2

asked Jun 13 '21 at 14:01

Sooyong

21
1

2

votes

1 answer

How to increase batch size in GPT2 training for translation task?

I am developing a code to use the pre-trained GPT2 model for a machine translation task. The length of my data's word-to-id is 91, and I developed the following code for my model: import torch from torch.utils.data import DataLoader from…

nlp pytorch gpt-2

asked May 08 '21 at 06:12

K.N

871
2
10
30

2

votes

1 answer

Mismatched tensor size error when generating text with beam_search (huggingface library)

I'm using the huggingface library to generate text using the pre-trained distilgpt2 model. In particular, I am making use of the beam_search function, as I would like to include a LogitsProcessorList (which you can't use with the generate…

python nlp pytorch huggingface-transformers gpt-2

asked Apr 22 '21 at 23:09

oregano

816
9
25

2

votes

2 answers

AttributeError: 'GPT2TokenizerFast' object has no attribute 'max_len'

I am just using the huggingface transformer library and get the following message when running run_lm_finetuning.py: AttributeError: 'GPT2TokenizerFast' object has no attribute 'max_len'. Anyone else with this problem or an idea how to fix it?…

tokenize huggingface-transformers transformer-model huggingface-tokenizers gpt-2

asked Apr 14 '21 at 10:20

m.b

45
1
4

2

votes

1 answer

Flask app serving GPT2 on Google Cloud Run not persisting downloaded files?

I have a Flask app running on Google Cloud Run, which needs to download a large model (GPT-2 from huggingface). This takes a while to download, so I am trying to set up so that it only downloads on deployment and then just serves this up for…

flask google-cloud-platform pytorch google-cloud-run gpt-2

asked Mar 30 '21 at 15:33

L Xandor

1,659
4
24
48

2

votes

2 answers

Modifying the Learning Rate in the middle of the Model Training in Deep Learning

Below is the code to configure TrainingArguments consumed from the HuggingFace transformers library to finetune the GPT2 language model. training_args = TrainingArguments( output_dir="./gpt2-language-model", #The output directory …

deep-learning pytorch huggingface-transformers language-model gpt-2

asked Feb 01 '21 at 05:42

Woody

930
9
23

2

votes

1 answer

How to use GPT-2 for topic modelling?

I want to generate topics and subtopics from a corpus. It would be great if someone could share the python code.

nlp topic-modeling bert-language-model gpt-2

asked Dec 01 '20 at 19:48

Jheel Patel

41
5

2

votes

1 answer

How to Get Rid of GPT-2 Warning Message?

Every time I run GPT-2, I am receiving this message. Is there a way I can get this to go away? Some weights of GPT2LMHeadModel were not initialized from the model checkpoint at gpt2 and are newly initialized: ['h.0.attn.masked_bias',…

python huggingface-transformers gpt-2

asked Aug 28 '20 at 00:57

Johnny

125
9

2

votes

4 answers

How can I find the probability of a sentence using GPT-2?

I'm trying to write a program that, given a list of sentences, returns the most probable one. I want to use GPT-2, but I am quite new to using it (as in I don't really know how to do it). I'm planning on finding the probability of a word given the…

python nlp probability gpt-2

asked Aug 23 '20 at 03:07

Elan SK

117
2
11

2

votes

3 answers

How to alter gpt-2 code to work with Tensorflow 2.0?

I am trying to use gpt-2 for text generation. I get compatibility errors, even after running the Tensorflow 2.0 code upgrade script. Steps I've followed: Clone repo From here on out, follow the directions in DEVELOPERS.md Run upgrade script on…

python docker tensorflow tensorflow2.0 gpt-2

asked Aug 11 '20 at 01:09

Nick

41
1
7

2

votes

1 answer

Cannot convert GPT-2 model using Tensorflow.JS

I'm trying to load a GPT-2 model on a Node.JS project. I believe this could be done using tfjs library. So I tried to convert the GPT-2 model to tfjs model. Following recommendations on this answer, I exported the GPT-2 model as SavedModel. !python3…

python tensorflow tensorflow.js gpt-2

asked Jul 08 '20 at 16:41

Mohamed Taher Alrefaie

15,698
9
48
66

2

votes

1 answer

Is gpt-2 unusable with python?

I was following this tutorial and ran across an issue while using train.py. the issue says Exception has occurred: ModuleNotFoundError No module named 'tensorflow.contrib' File "F:\PythonFiles\Post Generator\gpt-2\src\model.py", line 3, in…

python tensorflow gpt-2

asked Jun 13 '20 at 16:07

Arjun Basandrai

21
5

2

votes

0 answers

Adding tokens to GPT-2 BPE tokenizer

I want to add new words to my BPE tokenizer. I know the symbol Ġ means the end of a new token and the majority of tokens in vocabs of pre-trained tokenizers start with Ġ. Assume I want to add the word Salah to my tokenizer. I tried to add both Salah…

python nlp tokenize huggingface-transformers gpt-2

asked Jun 05 '20 at 15:56

Akim

139
6

Questions tagged [gpt-2]

References

Related Tags

Getting MemoryError fine-tuning GPT2(355M) model with small datasets (3MB) through aitextgen

GPT2 on apple M1 Pro chip

How to get onnx format from pretrained GPT2 models?

How to increase batch size in GPT2 training for translation task?

Mismatched tensor size error when generating text with beam_search (huggingface library)

AttributeError: 'GPT2TokenizerFast' object has no attribute 'max_len'

Flask app serving GPT2 on Google Cloud Run not persisting downloaded files?

Modifying the Learning Rate in the middle of the Model Training in Deep Learning

How to use GPT-2 for topic modelling?

How to Get Rid of GPT-2 Warning Message?

How can I find the probability of a sentence using GPT-2?

How to alter gpt-2 code to work with Tensorflow 2.0?

Cannot convert GPT-2 model using Tensorflow.JS

Is gpt-2 unusable with python?

Adding tokens to GPT-2 BPE tokenizer