Questions tagged [gpu]
57 questions
0
votes
0 answers
Interpretation of output of nvidia-smi and lspci | grep -i nvidia
I am very new to GPU servers. I submitted a slurm job and then checked "nvidia-smi". I got the following outputs.
This picture
Then, I ran "lspci | grep -i nvidia" where I got this output.
01:00.0 VGA compatible controller: NVIDIA Corporation…
0
votes
0 answers
AMD driver doesn't load after rebooting vm with GPU passthrough on virt manager
I'm having an issue with GPU passthrough, using virt manager on Debian 11. It's a very specific question, but i hope that someone can help me with this. I have an RX 5500XT GPU, and i passthrough it following the tutorial available on the Arch wiki…
0
votes
0 answers
How to size compute/gpu/storage/network for generative AI or LLM?
I would like to provision compute (servers), gpus (say 2 A100 80GB or H100), storage and network (may be 100GbE) to run OpenApaca 7B (https://huggingface.co/openlm-research/open_llama_7b) model.
How do I go about sizing this? AWS/GCP cluster sizing…

techele
- 1
0
votes
0 answers
How do I fix amdgpu and amdgpu-dkms packages not installing?
I'm trying to install the amdgpu package and it throws a bunch of errors:
Reading package lists... Done
Building dependency tree... Done
Reading state information... Done
The following packages were automatically installed and are no longer…

Mach
- 1
0
votes
1 answer
LM-Sensors - Intel Arc A750 + Dell R720 + Debian 11 (Kernel 6.2.2)
I have a Dell R720 at home running virtualized services under KVM. I have recently added an Intel ARC A750 which I've passed through a Debian 11 VM (with a Q35 machine model), where I have Jellyfin running. The sole purpose of the card is…

KiralyCraft
- 1
- 1
0
votes
0 answers
How to monitor windows GPU in zabbix
I am using zabbix 6.0 and I wan to monitor the gpu usage and temperature and other things related to GPU of my windows host, How can I do that, I have windows 10 and agent installed

biplab
- 5
- 2
0
votes
1 answer
How do you disable hardware-accelerated GPU scheduling via the command line in Windows 10/11?
I need to disable, programmatically, hardware-accelerated GPU scheduling in Windows if it's enabled.
Searching, I was pointed at the HKLM\SYSTEM\CurrentControlSet\Control\GraphicsDrivers - HwSchMode regkey. This is 2 when HW GPU scheduling is…

Jason Floyd
- 1,792
- 1
- 13
- 18
0
votes
1 answer
Dell PowerEdge R720XD - Fans Ramp Up When Video Card is Installed
I have a Dell PowerEdge R720XD with 2x Xeon E5-2640 v2 (16 cores total), 16 GB of RAM, and 2x 3 TB hard drives. I have Windows 10 running on the machine.
When I connect a video card to the system, the fans are significantly louder than if the system…
0
votes
0 answers
how so I troubleshoot intermittent node/kubelt reboots on a GKE
I am running workloads on a spot GPU node pool & intermittently getting 'NodeNotReady' followed by a reboot/restart of the node (& loss of the the workload pod), however the node does not go away but reboots & the kubelet and becomes ready again…

Rupert Lloyd
- 1
- 1
0
votes
0 answers
There is no dir "/sys/devices/pci0000:00/0000:00:02.0/mdev_supported_types" with 5.19.0-23-generic
I've an issue with 5.19.0-23-generic #24-Ubuntu SMP, — there is no /sys/devices/pci0000:00/0000:00:02.0/mdev_supported_types dir.
uname -a
Linux hp1 5.19.0-23-generic #24-Ubuntu SMP PREEMPT_DYNAMIC Fri Oct 14 15:39:57 UTC 2022 x86_64 x86_64 x86_64…
0
votes
0 answers
T4 GPU slows down automatically after sometime with power usage limited
I had been using T4 GPU with 8 virtual CPUs and 30 GB RAM Virtual Machine for more than a month without any issues.
I have been running some deep learning training jobs on it.
But since yesterday, I have been trying some new models on a larger…
0
votes
0 answers
Setting ep an AI server for multiple users
In the University I work, we want to buy a workstation with one or two powerful GPUs. We want students to be able to use the workstation remotely. This is the main restriction. It would be nice if multiple users could use the workstation…

Tensar
- 1
- 1
0
votes
1 answer
Can each GPU be used on Kubernetes as dedicated to a specific Pod?
I have the following environment:
Pods: Pod0, Pod1 (launched as a k8s Job)
GPUs: GPU0, GPU1
GPU0 is dedicated to Pod0, and GPU1 is dedicated to Pod1.
There can be multiple Pod0s and Pod1s at the same time. If there are two Pod0s, only a single Pod0…

Daigo
- 343
- 7
- 20
0
votes
0 answers
CloudStack and GPU support
Is there any document for GPU and VGPU support in cloudstack version 4.14 and later? For example how I find out CloudStack support Nvidia Quadro RTX series or not?

for1401
- 1
- 1
0
votes
1 answer
GCP: Cannot create any VM with GPU -> No capacity
I subscribed to GCP and received the $300 credits. Then I upgraded my account to "paid account". Next, I increased the limit for multiple VM types with GPU, in multiple regions, and received approval emails.
I tried creating a VM with GPU in…