Questions tagged [gpu]

Acronym for "Graphics Processing Unit". For programming traditional graphical applications, see the tag entry for "graphics programming". For general-purpose programming using GPUs, see the tag entry for "gpgpu". For specific GPU programming technologies, see the popular tag entries for "opencl", "cuda" and "thrust".

Acronym for "Graphics Processing Unit". For programming traditional graphical applications, see the tag entry for . For general-purpose programming using GPUs, see the tag entry for . For specific GPU programming technologies, see the popular tag entries for , and .

More information on GPU at http://en.wikipedia.org/wiki/Graphics_processing_unit

8854 questions
32
votes
8 answers

CUDA apps time out & fail after several seconds - how to work around this?

I've noticed that CUDA applications tend to have a rough maximum run-time of 5-15 seconds before they will fail and exit out. I realize it's ideal to not have CUDA application run that long but assuming that it is the correct choice to use CUDA and…
rck
  • 2,020
  • 2
  • 23
  • 23
31
votes
3 answers

Why is the memory in GPU still in use after clearing the object?

Starting with zero usage: >>> import gc >>> import GPUtil >>> import torch >>> GPUtil.showUtilization() | ID | GPU | MEM | ------------------ | 0 | 0% | 0% | | 1 | 0% | 0% | | 2 | 0% | 0% | | 3 | 0% | 0% | Then I create a big enough…
alvas
  • 115,346
  • 109
  • 446
  • 738
31
votes
3 answers

GPU programming on Android devices

I have no knowledge of GPU programming and I'd like an overview of this. I must develop a project of Image Processing, working on smartphones' s GPU (on Android devices), but I don't know where to start. 1)Programming Smartphone's GPU and…
Salva
  • 467
  • 1
  • 5
  • 10
30
votes
4 answers

Tensorflow GPU Could not load dynamic library 'cusolver64_10.dll'; dlerror: cusolver64_10.dll not found

When i run import tensorflow as tf tf.test.is_gpu_available( cuda_only=False, min_cuda_compute_capability=None ) I get the following error
Haseeb
  • 2,039
  • 3
  • 11
  • 26
30
votes
11 answers

Why is Tensorflow not recognizing my GPU after conda install?

I am new to deep learning and I have been trying to install tensorflow-gpu version in my pc in vain for the last 2 days. I avoided installing CUDA and cuDNN drivers since several forums online don't recommend it due to numerous compatibility issues.…
Sarosij Bose
  • 403
  • 1
  • 5
  • 6
30
votes
2 answers

What is CUDA like? What is it for? What are the benefits? And how to start?

I am interested in developing under some new technology and I was thinking in trying out CUDA. Now... their documentation is too technical and doesn't provide the answers I'm looking for. Also, I'd like to hear those answers from people that've had…
PedroC88
  • 3,708
  • 7
  • 43
  • 77
30
votes
7 answers

How do I customize nvidia-smi 's output to show PID username?

The output of nvidia-smi shows the list of PIDs which are running on the GPU: Thu May 10 09:05:07 2018 +-----------------------------------------------------------------------------+ | NVIDIA-SMI 384.111 Driver Version: 384.111…
Dang Manh Truong
  • 795
  • 2
  • 10
  • 35
30
votes
5 answers

How to add report_tensor_allocations_upon_oom to RunOptions in Keras

I'm trying to train a neural net on a GPU using Keras and am getting a "Resource exhausted: OOM when allocating tensor" error. The specific tensor it's trying to allocate isn't very big, so I assume some previous tensor consumed almost all the…
dspeyer
  • 2,904
  • 1
  • 18
  • 24
30
votes
6 answers

Is there a way of determining how much GPU memory is in use by TensorFlow?

Tensorflow tends to preallocate the entire available memory on it's GPUs. For debugging, is there a way of telling how much of that memory is actually in use?
Maarten
  • 4,549
  • 4
  • 31
  • 36
30
votes
5 answers

Which OpenGL functions are not GPU-accelerated?

I was shocked when I read this (from the OpenGL wiki): glTranslate, glRotate, glScale Are these hardware accelerated? No, there are no known GPUs that execute this. The driver computes the matrix on the CPU and uploads it to the GPU. All the…
Xavier Ho
  • 17,011
  • 9
  • 48
  • 52
29
votes
4 answers

How do I list all currently available GPUs with pytorch?

I know I can access the current GPU using torch.cuda.current_device(), but how can I get a list of all the currently available GPUs?
vvvvv
  • 25,404
  • 19
  • 49
  • 81
29
votes
7 answers

Could not load dynamic library 'libcublas.so.10'; dlerror: libcublas.so.10: cannot open shared object file: No such file or directory;

When I try to run a python script , which uses tensorflow, it shows following error ... 2020-10-04 16:01:44.994797: I tensorflow/stream_executor/platform/default/dso_loader.cc:48] Successfully opened dynamic library libcudart.so.10.1 2020-10-04…
Ravi Dewangan
  • 293
  • 1
  • 3
  • 5
29
votes
2 answers

Synchronization between command buffers in Vulkan

There are several ways to handle synchronization in Vulkan. This is how I understand it: Fences are GPU to CPU syncs. Semaphores are GPU to GPU syncs, they are used to sync queue submissions (on the same or different queues). Events are more…
hidayat
  • 9,493
  • 13
  • 51
  • 66
29
votes
4 answers

Java GPU programming

Is it possible to do GPU programming in Java ? I mean without using native libraries. And how much of a performance improvement can one expect when we switch over to gpu's ? Edit: I am not looking at game programming, I want to do hard core number…
Anand Sunderraman
  • 7,900
  • 31
  • 90
  • 150
28
votes
7 answers

How to obtain OpenCL SDK?

I was perusing http://www.khronos.org/ web site and only found headers for OpenCL (not OpenGL which I don't care about). How can I obtain OpenCL SDK?
Roman Kagan
  • 10,440
  • 26
  • 86
  • 126