Highest Voted 'huggingface' Questions

3

votes

1 answer

Using a custom trained huggingface tokenizer

I’ve trained a custom tokenizer using a custom dataset using this code that’s on the documentation. Is there a method for me to add this tokenizer to the hub and to use it as the other tokenizers by calling the AutoTokenizer.from_pretrained()…

asked Apr 18 '23 at 14:05

Dagim Ashenafi

33
5

3

votes

0 answers

Can I use LoRa and Prompt Tuning at the same time for text summarization with GPT?

LoRA is to insert and learn the rank composition matrix created by dimensionally reducing the weight matrix in the transformer. Prompt Tuning, on the other hand, typically uses a soft prompt that encodes the prompt within the model to learn, rather…

nlp huggingface-transformers transformer-model summarization huggingface

asked Mar 25 '23 at 08:28

cykim

31
1

3

votes

1 answer

How to use Huggingface Trainer with multiple GPUs?

Say I have the following model (from this script): from transformers import AutoTokenizer, GPT2LMHeadModel, AutoConfig config = AutoConfig.from_pretrained( "gpt2", vocab_size=len(tokenizer), n_ctx=context_length, …

machine-learning pytorch huggingface-transformers huggingface

asked Mar 22 '23 at 15:10

Penguin

1,923
3
21
51

3

votes

0 answers

Huggingface: ValueError: You have to specify either decoder_input_ids or decoder_inputs_embeds

I am fine-tuning 'microsoft/trocr-base-printed' image2text model to let it recognize the captcha text on it. I was able to find this link to try to avoid the error: ValueError: You have to specify either decoder_input_ids or decoder_inputs_embeds,…

python data-science huggingface

asked Jan 31 '23 at 07:55

wJoyW

51
1
5

3

votes

1 answer

How to use diffusers with custom ckpt file

Currently I have the current code which runs a prompt on a model which it downloads from huggingface. from diffusers import StableDiffusionPipeline, EulerDiscreteScheduler model_id = "stabilityai/stable-diffusion-2" # Use the Euler scheduler here…

python huggingface stable-diffusion

asked Dec 20 '22 at 00:30

Mohammad Razeghi

1,574
2
14
33

3

votes

0 answers

Huggingface: Fine-tuning (not enough values to unpack (expected 2, got 1))

I'm trying to fine-tune erfan226/persian-t5-paraphraser paraphrase generator model for Persian sentences. I used the Persian dataset of tapaco and reformatted it to match the glue (mrpc) dataset which is used in the fine-tuning documentation. I have…

nlp huggingface fine-tune

asked Dec 05 '22 at 17:35

Ali Ghasemi

61
2

3

votes

3 answers

How to use AWS Sagemaker with newer version of Huggingface Estimator?

When trying to use Huggingface estimator on sagemaker, Run training on Amazon SageMaker e.g. # create the Estimator huggingface_estimator = HuggingFace( entry_point='train.py', source_dir='./scripts', …

python docker pytorch amazon-sagemaker huggingface

asked Nov 23 '22 at 14:09

alvas

115,346
109
446
738

3

votes

1 answer

Text generation AI models generating repeated/duplicate text/sentences. What am I doing incorrectly? Hugging face models - Meta GALACTICA

Whole day I have worked with available text generation models Here you can find list of them : https://huggingface.co/models?pipeline_tag=text-generation&sort=downloads I want to generate longer text outputs, however, with multiple different models,…

python nlp huggingface-transformers huggingface gpt-2

asked Nov 19 '22 at 20:48

Furkan Gözükara

22,964
77
205
342

3

votes

0 answers

Huggingface datasets storing and loading image data

I have a huggingface dataset with an image column ds["image"][0] When I save to disk, load it later I get the image column as…

huggingface huggingface-datasets

asked Oct 30 '22 at 07:38

Vincent Claes

3,960
3
44
62

3

votes

1 answer

How to split input text into equal size of tokens, not character length, and then concatenate the summarization results for Hugging Face transformers

I am using the below methodology to summarize longer than 1024 token size long texts. Current method splits the text by half. I took this from another user's post and modified it slightly. So what I want to do is, instead of splitting into half,…

python nlp huggingface-transformers huggingface-tokenizers huggingface

asked Oct 29 '22 at 10:56

Furkan Gözükara

22,964
77
205
342

3

votes

4 answers

How to save a SetFit trainer locally after training

I am working on an HPC with no internet access on worker nodes and the only option to save a SetFit trainer after training, is to push it to HuggingFace hub. How do I go about saving it locally to disk? https://github.com/huggingface/setfit

sentence-similarity huggingface sentence-transformers

asked Oct 12 '22 at 18:23

Tanish Bafna

33
4

3

votes

1 answer

How to fine-tune gpt-j using Huggingface Trainer

I'm attempting to fine-tune gpt-j using the huggingface trainer and failing miserably. I followed the example that references bert, but of course, the gpt-j model isn't exactly like the bert model. The error indicates that the model isn't producing…

python machine-learning pytorch huggingface-transformers huggingface

asked Oct 10 '22 at 11:42

Erik Hyrkas

71
2
7

3

votes

0 answers

Batch size during training vs batch size during evaluation

I am confused about the difference between batch size during training versus batch size during evaluation. I am trying to measure how batch size influences the inference time(speed of prediction) of different NLP models after they have been trained…

testing huggingface-transformers huggingface

asked Jul 25 '22 at 09:58

MartinDK

33
3

3

votes

2 answers

ValueError: Tokenizer class MarianTokenizer does not exist or is not currently imported

Get this error when trying to run a MarianMT-based nmt model. Traceback (most recent call last): File "/home/om/Desktop/Project/nmt-marionmt-api/inference.py", line 45, in print(batch_inference(model_path="en-ar-model/Mark2",…

python huggingface nmt

asked May 04 '22 at 10:14

Om Rastogi

478
1
5
12

3

votes

1 answer

Is there any way I can use the downloaded pre-trained models for TIMM?

For some reason, I have to use TIMM package offline. But I found that if I use create_model(), for example: self.img_encoder = timm.create_model("swin_base_patch4_window7_224", pretrained=True) I would get http.client.RemoteDisconnected: Remote end…

pytorch computer-vision huggingface

asked Apr 16 '22 at 04:33

Yy X

31
3

Questions tagged [huggingface]