Questions tagged [gpu]

57 questions
33
votes
7 answers

Does a server need a GPU?

Do I need a GPU on a text and console only server? No GPU as in no iGPU and dGPU. Im going to be using SSH, so I dont need a display out. Im using Linux, but the OS shouldn't affect the results
tymur999
  • 495
  • 1
  • 6
  • 8
4
votes
1 answer

Kubernetes: How can I get which pod schedule GPU?

I have three Nvidia GPUs in my cluster, and so many pods are running in my cluster. How can I find which one of these pods schedule GPU and how many GPUs they schedule? I use this link to enable the GPU schedule in my cluster for Nvidia…
Nader
  • 153
  • 8
4
votes
1 answer

What is the best metric for auto-scaling GPU instances for machine learning inference in the cloud?

We have an API in AWS with a GPU instance that does inference. We have an auto-scaler setup with the minimum and maximum number of instances, but aren’t sure which metric (GPU/CPU usage, RAM usage, average latency, etc) or combination of metrics…
elwray14
  • 41
  • 2
4
votes
1 answer

Use passthrough GPU in KVM/QEMMU and display in host OS in a window

I'm new to KVM/QEMMU. I have used virtual box to run Windows 10 in a virtual machine on my Arch host system (a laptop with both integrated and discrete GPUs). Being dissatisfied with the video responsiveness of the guest machine I decided to assign…
brett
  • 141
  • 1
  • 3
3
votes
0 answers

The GPU usage provided by nvidia-smi command is very different from GPU metrics from guest OS

I'm working on a project that can monitor virtual machines' vgpu usage. The hypervisor is vCenter, we have nvidia A16 cards installed on vCenter hosts, and assigned a16 vGPU to a couple of windows VMs on this host, theses vGPUs are allocated to the…
3
votes
1 answer

GPU Acceleration on a Windows Server without virtualization over RDP

I'm trying to find out if it's possible to run a Windows Server with one GPU which is shared between all RDP clients so that people could create a session on the server start some program with a UI which needs GPU acceleration disconnect afterwards…
2
votes
0 answers

Quota 'GPUS_ALL_REGIONS' exceeded. Limit: 0.0 globally

I am trying to create the VM instance with NVIDIA K80 GPUs in Asia-East1 so, I requested to increase the quota and team have adjusted the quota. However, when I am trying to create the VM instance by selecting the NVIDIA K80 GPUs with 4 CPUs and…
2
votes
1 answer

What socket(s) does the aux power for a GPU come from in a PowerEdge T550?

Server: Dell PowerEdge T550 Tower Server PSU: Single, Hot Plug, Non-Redundant Power Supply (1+0), 1100W, Mixed Mode Titanium GPU: NVIDIA A40 Photos: https://www.reddit.com/user/bigboyserver/comments/10jj1fj/dell_poweredge_t550_nvidia_a40/ Problem…
2
votes
1 answer

Pod is stuck in PodInitializing status when an initContainer is OOMKilled

I have the following on-prem Kubernetes environment: OS: Red Hat Enterprise Linux release 8.6 (Ootpa) Kubernetes: 1.23.7 (single-node, build with kubeadm) NVIDIA driver: 515.65.01 nvidia-container-toolkit: 1.10.0-1.x86_64 (rpm) containerd:…
Daigo
  • 343
  • 7
  • 20
2
votes
2 answers

GCP does not have enough resources available to fulfill the request for about a month

I've been trying to start my existing GCP VM that has an NVIDIA T4 GPU attached to it, for almost a month at this time. It has been working fine before but now I am constantly getting the error message: The zone '***' does not have enough resources…
masus04
  • 121
  • 2
  • 6
2
votes
0 answers

Slurm srun cannot allocate ressources for GPUs - Invalid generic resource specification

I am able to launch a job on a GPU server the traditional way (using CPU and MEM as consumables): ~ srun -c 1 --mem 1M -w serverGpu1 hostname serverGpu1 but trying to use the GPUs will give an error: ~ srun -c 1 --mem 1M --gres=gpu:1 hostname srun:…
user324810
  • 121
  • 1
  • 3
2
votes
1 answer

Considerations using consumer class (high-end) GPU in server?

Motivation: First of all, even if I have some knowledge of computer science, software development and server Linux administration, I never looked into a server hardware and I am a total "newbie" to it. Sorry if this question is trivial to most of…
Adrian Maire
  • 145
  • 1
  • 10
2
votes
0 answers

Make Headless Server Use Hardware Accelaration

I have a headless CentOS 8 Server with an AMD GPU. I want to use hardware acceleration but when I run OpenGL programs with xvfb-run (e.g glxinfo), the system reports I am using software rendering. How can I use hardware acceleration on a headless…
user548851
2
votes
3 answers

Why are GPUs accessible from docker containers running on Linux hosts, but not on Windows or MacOS hosts?

Recent versions of docker (or any version of nvidia-docker) allow direct(?) access to the host GPU from within docker containers, with full access to CUDA APIs. This is very convenient when deploying complex machine learning inference…
Will
  • 229
  • 3
  • 7
1
vote
1 answer

always available gpu servers

I need several GPU servers for rendering. Region doesn`t matter. Preemptible - I think it is ok for me. It is important that at any time I should be able to run a couple of instances with the GPU. As I understand it, Google can take away quotas for…
1
2 3 4