Highest Voted 'gpt-2' Questions

0

votes

1 answer

Why is my streamlit app not correctly summarizing my mp3-transcription?

I'm working on a Streamlit app that processes MP3 files. The main steps include: Uploading an MP3 file. Splitting the audio into smaller chunks using pydub. Transcribing these chunks using OpenAI. Summarizing the transcriptions using transformers…

asked Aug 16 '23 at 23:55

Patrick Schmidt

1

0

votes

1 answer

How do fix GPT2 Tokenizer error in Langchain map_reduce (LLama2)?

I'm using AWS Sagemaker Jumpstart model for Llama2 13b: meta-textgeneration-llama-2-13b-f On running a Langchain summarize chain with chain_type="map_reduce" I get the below error. I do not have access to https://huggingface.co from my environment.…

amazon-sagemaker langchain gpt-2 llama

asked Aug 15 '23 at 20:52

apprunner2186

217
1
6

0

votes

0 answers

ValueError: Expected input batch_size (1052) to match target batch_size (508) when fine tuning GPT 2 model

Hello there I'm attempting to train a GPT 2 model how to summarize passages without compromising their emotional impact. Consider summarizing a chapter from a book, but we want the reader to experience the same emotions as the chapter itself. I…

nlp tokenize huggingface-tokenizers dataloader gpt-2

asked Aug 10 '23 at 15:36

Damika

622
2
8
17

0

votes

0 answers

'utf-8' codec can't decode byte 0xc3 error when using tensorflow.keras.layers import TextVectorization

I am trying to execute the steps given in a blog post (https://stackabuse.com/gpt-style-text-generation-in-python-with-tensorflowkeras/) but getting the error in the below block: vectorize_layer.adapt(text_list) vocab =…

python-3.x tensorflow keras gpt-2

asked Jul 28 '23 at 12:18

Vaibhav

102
1
9

0

votes

0 answers

GPT-2 PyTorch Custom Training

python machine-learning pytorch gpt-2 llm

asked Jul 27 '23 at 22:15

Yuki Arimo

38
6

0

votes

0 answers

tf.compat.v1.estimator.Estimator(): NameError: name 'model_fn' is not defined.Getting errors in add_argument as well. Not recognizing paths mentioned

I am trying to create a pet LLM using GPT-2 following instructions here: https://thomascherickal.medium.com/how-to-create-your-own-llm-model-2598615a039a The code gives syntax error while calling tf.compat.v1.estimator.Estimator() with model_fn as…

python tensorflow keras gpt-2 llm

asked Jul 23 '23 at 03:21

Mohit Agrawal

1

0

votes

0 answers

Title: Generating Sentences with TRL while Maintaining Sentiment - Issue with "AutoModelForCausalLMWithValueHead"

I am currently working on generating sentences with TRL (Transformers Reinforcement Learning) while preserving the same sentiment as the sample sentences. However, I've come across an issue with the TRL code that uses…

reinforcement-learning gpt-2 text-generation

asked Jul 19 '23 at 18:29

user11849691

41
4

0

votes

1 answer

Hugging Face Inference API returning short generated text with GPT-2 model

I'm using the Hugging Face API with a GPT-2 model to generate text based on a prompt. However, I'm encountering an issue where the generated text is consistently too short, even though I'm specifying a maximum number of new tokens and using other…

javascript artificial-intelligence huggingface gpt-2

asked Jul 17 '23 at 12:07

ma.ca

1
1

0

votes

1 answer

How to generate text using GPT2 model with Huggingface transformers?

I wanted to use GPT2Tokenizer, AutoModelForCausalLM for generating (rewriting) sample text. I have tried transformers==4.10.0, transformers==4.30.2 and --upgrade git+https://github.com/huggingface/transformers.git, however I get the error of…

python huggingface-transformers huggingface gpt-2 llm

asked Jul 11 '23 at 15:10

user11849691

41
4

0

votes

0 answers

Non-meaningful response from finetuned GPT-2 model

I am experimenting with the abilities of GPT-2 for question answering aiming at making a good task-based chatbot. I trained my model on the air_dialogue dataset from huggingface https://huggingface.co/datasets/air_dialogue. I used the code form…

python pytorch gpt-2 fine-tune

asked Jun 26 '23 at 20:37

Chukwujike

11
4

0

votes

0 answers

FineTune GPT2 on Insurance Domain data

I am new to LLM and I am trying to finetune GPT2 from Huggingface on Insurance domain data. I am not getting results from the Trained data instead I am getting different results. My Training Data is a word document (The content is not like Question…

dns huggingface-transformers gpt-2 fine-tune

asked Jun 26 '23 at 13:57

vjsat2k18

1

0

votes

0 answers

How to train gpt2 model to learn from the training text I have given?

I'm trying to train and fine tune my gpt2 model with my own sample training document. I'm using the code similar to this: https://www.kaggle.com/code/changyeop/how-to-fine-tune-gpt-2-for-beginners . But the text generated is not related to any text…

huggingface-transformers training-data huggingface gpt-2 fine-tune

asked Jun 21 '23 at 16:28

Bhavani Priya

39
4

0

votes

1 answer

Unsupervised fine-tuning on custom documents after the supervised fine tuning on general question-answers dataset. Will it be useful for GPT-2 model?

I know the formal way of training a GPT2 model on custom documents is to first do semi-supervised fine tuning on the text of the documents followed by supervised fine-tuning on question answers from the same documents. But the sole purpose of…

pre-trained-model gpt-2 large-language-model semisupervised-learning generative-pretrained-transformer

asked Jun 16 '23 at 10:51

Ratna Sambhav

1

0

votes

0 answers

tf.compat.v1.estimator.Estimator(): NameError: name 'model_fn' is not defined

I am trying to create a pet LLM using GPT-2 following instructions here: https://thomascherickal.medium.com/how-to-create-your-own-llm-model-2598615a039a The code gives syntax error while calling tf.compat.v1.estimator.Estimator() with model_fn as…

python tensorflow gpt-2 llm

asked Jun 15 '23 at 06:31

sm535

587
7
20

0

votes

0 answers

TypeError: argmax(): argument 'input' (position 1) must be Tensor, not numpy.ndarray

I am traning a model GPT-2 using my curated dataset, and getting the following error. When I am trying to debug any issue, a new error comes. Can anyone helpme to fix my script so that it can run. The traing process starts but later gets many…

nlp gpt-2

asked Jun 12 '23 at 15:01

Sushant Pandey

1
1

Questions tagged [gpt-2]

References

Related Tags

Why is my streamlit app not correctly summarizing my mp3-transcription?

How do fix GPT2 Tokenizer error in Langchain map_reduce (LLama2)?

ValueError: Expected input batch_size (1052) to match target batch_size (508) when fine tuning GPT 2 model

'utf-8' codec can't decode byte 0xc3 error when using tensorflow.keras.layers import TextVectorization

GPT-2 PyTorch Custom Training

tf.compat.v1.estimator.Estimator(): NameError: name 'model_fn' is not defined.Getting errors in add_argument as well. Not recognizing paths mentioned

Title: Generating Sentences with TRL while Maintaining Sentiment - Issue with "AutoModelForCausalLMWithValueHead"

Hugging Face Inference API returning short generated text with GPT-2 model

How to generate text using GPT2 model with Huggingface transformers?

Non-meaningful response from finetuned GPT-2 model

FineTune GPT2 on Insurance Domain data

How to train gpt2 model to learn from the training text I have given?

Unsupervised fine-tuning on custom documents after the supervised fine tuning on general question-answers dataset. Will it be useful for GPT-2 model?

tf.compat.v1.estimator.Estimator(): NameError: name 'model_fn' is not defined

TypeError: argmax(): argument 'input' (position 1) must be Tensor, not numpy.ndarray