Highest Voted 'huggingface' Questions

2

votes

0 answers

SentencePiece tokenizer encodes to unknown token

I am using HuggigFace implementation of SentencePiece tokenizer, i.e., SentencePieceBPETokenizer and SentencePieceUnigramTokenizer classes. I train these tokenizers on dataset which has no unicode characters and then try to encode the string that…

asked Aug 02 '23 at 08:58

Shital Shah

63,284
17
238
185

2

votes

0 answers

call huggingface tokenizer in c#

I would like to call huggingface tokenizer in C# and wonder what might be the best way to achieve this. Specifically, I'd like to use mt5 tokenizer for CJK languages in C#. I have seen certain nuget packages developed such as BertTokenizer in C#,…

python c# huggingface-transformers huggingface

asked Jun 25 '23 at 18:22

exteral

991
2
12
33

2

votes

1 answer

Finetuning Open LLMs

I am a newbie trying to learn fine tuning. Started with falcon 7B instruct LLM as my base LLM and want to fine tune this with open assistant instruct dataset. I have 2080 Ti with 11G VRAM. So I am using 4 bit quantization and Lora. These are the…

huggingface falcon llm

asked Jun 23 '23 at 07:15

codemugal

81
1
1
4

2

votes

2 answers

AttributeError: 'AcceleratorState' object has no attribute 'distributed_type'

import transformers from datasets import load_dataset import tensorflow as tf tokenizer = transformers.AutoTokenizer.from_pretrained('roberta-base') df = load_dataset('csv', data_files={'train':'FinalDatasetTrain.csv',…

huggingface accelerate

asked Jun 20 '23 at 16:07

Amr Samer

21
1

2

votes

0 answers

How to finetune an LLM model on your own codebase?

I have 10 code repositories in Javascript (VueJS) (Each repository corresponds to 1 Theme) I want to train an LLM model on these 10 code repositories so that I can generate new themes using prompts. The LLM model take the context of 10 code…

code-generation huggingface large-language-model

asked Jun 14 '23 at 07:54

Aadesh

403
3
13

2

votes

1 answer

Look for good ways to prepare customized dataset for training controlnet with huggingface diffusers

I want to personally train the controlnet, but I find it inconvenient to prepare the datasets. As I follow the huggingface tutorial available at this link: https://huggingface.co/blog/train-your-controlnet, I believe I should organize the dataset in…

python pytorch huggingface stable-diffusion fine-tune

asked Jun 04 '23 at 13:15

Yun

21
2

2

votes

1 answer

How to use HuggingFace Inference endpoints for both tokenization and inference?

I am trying to set up separate endpoints for tokenization and inference using HuggingFace models. Ideally I would like to use HuggingFace inference endpoints. Is there a straightforward way to spin up endpoints for encoding, decoding, and inference…

huggingface-transformers huggingface-tokenizers huggingface

asked Jun 02 '23 at 15:09

Steven Krawczyk

21
1

2

votes

1 answer

Fine-tuning a pre-trained LLM for question-answering

Objective My goal is to fine-tune a pre-trained LLM on a dataset about Manchester United's (MU's) 2021/22 season (they had a poor season). I want to be able to prompt the fine-tuned model with questions such as "How can MU improve?", or "What are…

huggingface-transformers huggingface language-model fine-tune text-generation

asked May 31 '23 at 11:55

Tom Bomer

83
7

2

votes

1 answer

Stable Diffusion Webui ConnectTimeoutError while starting

I'm trying to set up stable diffusion on a server so that users can access it via RDP and generate what they need. The server in place I hosted on-premise and doesn't have any internet connection. I installed all needed libs via whl files. I was…

python huggingface stable-diffusion

asked May 11 '23 at 16:42

Sacul0815

21
2

2

votes

1 answer

Huggingface - Pipeline with a fine-tuned pre-trained model errors

I have a pre-trained model from facebook/bart-large-mnli I used the Trainer in order to train it on my own dataset. model = BartForSequenceClassification.from_pretrained("facebook/bart-large-mnli", num_labels=14, ignore_mismatched_sizes=True) And…

python pipeline huggingface-transformers text-classification huggingface

asked May 04 '23 at 07:32

Dolev Mitz

103
14

2

votes

0 answers

How to use huggingface + gradio API in javascript

I'm going over the fastai ML course and got stuck in the second lesson with a minor problem - a classifier app that works fine on huggingface spaces doesn't get called correctly by a script suggested by the course team. Here's the code: Having…

javascript huggingface gradio

asked May 01 '23 at 18:14

Non-chemist_dude

53
2

2

votes

1 answer

What is the function of the `text_target` parameter in Huggingface's `AutoTokenizer`?

I'm following the guide here: https://huggingface.co/docs/transformers/v4.28.1/tasks/summarization There is one line in the guide like this: labels = tokenizer(text_target=examples["summary"], max_length=128, truncation=True) I don't understand the…

python huggingface-transformers huggingface

asked Apr 28 '23 at 14:27

Betty

512
7
19

2

votes

1 answer

Export MarianMT model to ONNX

I would like to use the Helsinki-NLP/opus-mt-de-en model from HuggingFace to translate text. This works fine with the HuggingFace Inference API or a Transformers pipeline, e.g.: from transformers import AutoTokenizer, pipeline from…

huggingface-transformers onnx huggingface onnxruntime

asked Apr 24 '23 at 06:37

RGe

1,181
1
10
19

2

votes

0 answers

Huggingface - time out or connection error

I want to use LLaMA and OPT via the Hugging Face API. However, I always get the same two error messages. Either ConnectionError: ('Connection aborted.', RemoteDisconnected('Remote end closed connection without response')) Or {'error': 'Model…

python huggingface

asked Apr 15 '23 at 06:09

Frigoooo

51
4

2

votes

1 answer

validation loss shows 'no log' during fine-tuning model

I'm finetuning QA models from hugging face pretrained models using huggingface Trainer, during the training process, the validation loss doesn't show. My compute_metrices function returns accuracy and f1 score, which doesn't show in the log as…

validation logging pre-trained-model huggingface fine-tune

asked Apr 14 '23 at 07:46

Leran Zhang

21
1

Questions tagged [huggingface]