Questions tagged [nvidia]

For programming questions specifically related to Nvidia hardware. N.B. Questions about system configuration are usually off-topic here!

Nvidia is an American global technology company based in Santa Clara, California, best known for its graphics processors (GPUs).

More about Nvidia at http://en.wikipedia.org/wiki/Nvidia
Nvidia website at http://www.nvidia.com/content/global/global.php

3668 questions

votes

2 answers

Can I use C++11 in the .cu-files (CUDA5.5) in Windows7x64 (MSVC) and Linux64 (GCC4.8.2)?

When I compile the following code containing the design C++11, in Windows7x64 (MSVS2012 + Nsight 2.0 + CUDA5.5), then I do not get errors, and everything compiles and works well: #include int main() { …

asked Jan 30 '14 at 13:34

Alex

12,578
15
99
195

votes

4 answers

How to run CUDA without a GPU using a software implementation?

My laptop doesn't have a nVidia graphic cards, and I want to work on CUDA. The website says that CUDA can be used in emulation mode on non-cuda hardware too. But when I tried installing CUDA drivers downloaded from their website, it gives an error…

cuda nvidia

asked Nov 18 '09 at 05:00

emkrish

votes

5 answers

CUDA Runtime API error 38: no CUDA-capable device is detected

The Situation I have a 2 gpu server (Ubuntu 12.04) where I switched a Tesla C1060 with a GTX 670. Than I installed CUDA 5.0 over the 4.2. Afterwards I compiled all examples execpt for simpleMPI without error. But when I run ./devicequery I get…

cuda nvidia

asked Oct 24 '12 at 17:12

Framester

33,341
51
130
192

votes

2 answers

Number of Compute Units corresponding to the number of work groups

I need some clarification. I'm developing OpenCL on my laptop running a small nvidia GPU (310M). When I query the device for CL_DEVICE_MAX_COMPUTE_UNITS, the result is 2. I read the number of work groups for running a kernel should correspond to the…

opencl nvidia simd

asked Feb 17 '12 at 10:17

rdoubleui

3,554
4
30
51

votes

2 answers

How can I use 100% of VRAM on a secondary GPU from a single process on windows 10?

This is on windows 10 computer with no monitor attached to the Nvidia card. I've included output from nvida-smi showing > 5.04G was available. Here is the tensorflow code asking it to allocate just slightly more than I had seen previously: (I want…

tensorflow cuda windows-10 nvidia

asked Dec 17 '17 at 12:40

Steve Steiner

5,299
4
32
43

votes

2 answers

nvcc fatal : Value 'sm_20' is not defined for option 'gpu-architecture'

I've looked at many pages and either could not follow what they were saying because they were unclear and/or my knowledge is just not sufficient enough. I am trying to run: luarocks install…

cuda nvidia torch nvcc luarocks

asked Dec 05 '17 at 21:34

John Lexus

3,576
3
15
33

votes

2 answers

Which CUDA Toolkit version for older NVIDIA Driver

I have been provided an older NVIDIA graphics card (GeForce 8400 GS) to begin exploring some GPU computing. I have tried to complete the installation successfully but have stumbled upon a problem. Here are my steps (on Ubuntu 14.04) sudo apt-get…

cuda nvidia

asked Mar 24 '15 at 19:46

cdeterman

19,630
7
76
100

votes

3 answers

Compile cuda code for CPU

I'm study cuda 5.5 but i don't have any Nvidia GPU. In old version of nvcc have a flag --multicore to compile cuda code for CPU. In the new version of nvcc, what's is the option?? I'm working on Linux.

cuda nvidia nvcc

asked Feb 21 '14 at 22:45

F.N.B

1,539
6
23
39

votes

3 answers

PTX - what is a CTA?

I'm studying PTX and I don't understand how a CTA (compute thread array) is different from a CUDA block. Are they the same thing? It seems to me that for now (I'm just at the beginning of the PTX document) they're just the same

cuda nvidia gpu ptx

asked Jul 15 '13 at 08:13

Marco A.

43,032
26
132
246

votes

6 answers

Forcing hardware accelerated rendering

I have an OpenGL library written in c++ that is used from a C# application using C++/CLI adapters. My problem is that if the application is used on laptops with Nvidia Optimus technology the application will not use the hardware acceleration and…

c# opengl nvidia optimus

asked Jun 24 '13 at 07:49

JohanR

votes

2 answers

Keep getting CL_INVALID_KERNEL_ARGS on nvidia gpu

I'm using OpenCL on an nvidia GPU and I keep getting CL_INVALID_KERNEL_ARGS when I try to execute a kernel. I've stepped it down to a very simple program: __kernel void foo(int a, __write_only image2d_t bar) { int 2 coords = {0,…

opencl nvidia

asked Nov 08 '12 at 21:13

Trevor

1,369
2
13
28

votes

1 answer

What's the difference between PTX and CUBIN w.r.t. the NVCC compiler?

I have CUDA 4.0 installed, and a device with Compute Capability 2.0 (a GTX 460 card). What is the difference between the 'cubin' and the 'ptx' file? I think the cubin is a native code for the gpu so this is micro-architecture specific, and the ptx…

cuda nvidia nvcc ptx

asked Oct 08 '11 at 10:35

user973764

votes

6 answers

Could not load library cudnn_ops_infer64_8.dll. Error code 126 Please make sure cudnn_ops_infer64_8.dll is in your library path

Could not load library cudnn_ops_infer64_8.dll. Error code 126 Please make sure cudnn_ops_infer64_8.dll is in your library path. I've tried searching online but it's been hours and I haven't found anything. I would really appreciate anyone sharing…

python tensorflow nvidia

asked Feb 24 '21 at 17:02

Hissaan Ali

2,229
4
25
51

votes

4 answers

128 bit integer on cuda?

I just managed to install my cuda SDK under Linux Ubuntu 10.04. My graphic card is an NVIDIA geForce GT 425M, and I'd like to use it for some heavy computational problem. What I wonder is: is there any way to use some unsigned 128 bit int var? When…

integer cuda nvidia 128-bit

asked May 28 '11 at 14:10

Matteo Monti

8,362
19
68
114

votes

1 answer

How to get card specs programmatically in CUDA

I'm just starting out with CUDA. Is there a way of getting the card specs programmatically?

cuda nvidia specs

asked Apr 16 '11 at 19:31

cookM

Prev 1 2 3

…

99 100 Next