Highest Voted 'llama' Questions

0

votes

0 answers

Feasibility of using Falcon/Falcoder/Llama2 LLM while trying to use it on AWS EC2 Inferentia 2.8xlarge and G4dn.8xLarge Instances

Is it possible to do inference on the aforementioned machines as we are facing so many issues in Inf2 with Falcon model? Context: We are facing issues while using Falcon/Falcoder on the Inf2.8xl machine. We were able to run the same experiment on…

amazon-ec2 llm llama

asked Aug 03 '23 at 05:49

Amlan

1
1

0

votes

0 answers

How should I resolve this error in LLaMA：TypeError: init() got an unexpected keyword argument 'quantizer' ？

When I was running the LLaMA code, I encountered this error：TypeError: init() got an unexpected keyword argument 'quantizer', and I don't know how to resolve it. I have checked the version compatibility. Please help me come up with possible…

llama

asked Aug 02 '23 at 03:31

Tâm Linh Giang

1

0

votes

0 answers

how to specify temperature and max_new_tokens in the curl request to Llama 2 in Huggingface Inference Endpoint?

I'm new to AI, so apologies if wrong terminology used here. I'm extracting some information from a body of text, and have setup Llama 2 in Huggingface via their Inference Endpoint so I can call it via curl. The curl works for short inputs and…

artificial-intelligence huggingface llama

asked Jul 31 '23 at 20:36

Magnus

10,736
5
44
57

0

votes

1 answer

Finetune LlaMA 7B model using Pytorch Lightning Framework

Need Expert help to solve this issue. LLaMA 7B model for sentiment classification with instructional Finetuning. import torch import torch.nn as nn from torch.utils.data import Dataset, DataLoader from transformers import LlamaTokenizer,…

nlp pytorch-lightning llama

asked Jun 03 '23 at 04:54

wahid

1
1

-1

votes

0 answers

how to make a required output when finetuning a llama2-7b-chat model on GSM8K?

I am finetuning llama2-7b-chat on GSM8K, a dataset concudes about 8k expamples of graduate school math problems. e.g. "question": "Natalia sold clips to 48 of her friends in April, and then she sold half as many clips in May. How many clips did…

machine-learning nlp nlp-question-answering llama

asked Sep 01 '23 at 08:07

Xinlong lee

9
1

-1

votes

1 answer

I want to deploy LLM model on Sagemaker and it is giving me this error. I've tried with different models as well but still facing same error

I'm deploying TheBloke/Llama-2-7b-Chat-GPTQ " model on sagemaker. I'm running this code in sagemaker notebook instance. I've used "ml.g4dn.xlarge" instance for deployement. I've used the same code that have been shown on the deployment on Amazon…

amazon-web-services deployment cloud amazon-sagemaker llama

asked Aug 24 '23 at 10:45

Faiq Aslam

1

-2

votes

0 answers

torch.cuda.OutOfMemoryError: CUDA out of memory

When I run the fine-tuned llama model with lora to generate results using 1 GPU, this error happened torch.cuda.OutOfMemoryError: CUDA out of memory. My code: test_data = Dataset.from_list(torch.load(test_dataset_dir)) tokenizer =…

python pytorch huggingface-transformers llama

asked Aug 30 '23 at 14:29

a7777777

1
1

-2

votes

0 answers

Will Inconsistent Alternation of Responses Affect Fine-Tuning LLAMA2 with Chat History

I am working on fine-tuning LLAMA2 with a dataset containing chat history. While preparing the data, I've noticed that the dialogue doesn't always follow a pattern of alternating responses between speakers. In some cases, one person responds several…

artificial-intelligence llm llama llamacpp

asked Aug 20 '23 at 13:54

Ivo Oostwegel

374
2
20

-3

votes

0 answers

Can QLORAs on StableBeluga-7B learn a personality?

I'm building a repository of QLORA adapters that change the model's personality. The end vision is a hub of ready-to-go personality adapters. I'm hitting a snag when training the QLORAs for Paul Graham's personality on top of a 4-bit quantized…

machine-learning nlp artificial-intelligence large-language-model llama

asked Aug 24 '23 at 01:18

user7547462

1

-4

votes

0 answers

How to Delete GPT Models, Managing Storage Usage for Installed GPT Models and Packages

I have installed several Generative Pretrained Transformer (GPT) models on my local system for fine-tuning purposes, both within Python in Visual Studio Code and via the Command Prompt window during code execution. The installed models include…

python-3.x huggingface gpt-3 gpt-2 llama

asked Aug 05 '23 at 09:59

KARTHIK K

1
1

Questions tagged [llama]