Questions tagged [nvidia]

For programming questions specifically related to Nvidia hardware. N.B. Questions about system configuration are usually off-topic here!

Nvidia is an American global technology company based in Santa Clara, California, best known for its graphics processors (GPUs).

More about Nvidia at http://en.wikipedia.org/wiki/Nvidia
Nvidia website at http://www.nvidia.com/content/global/global.php

3668 questions
16
votes
2 answers

Can I use C++11 in the .cu-files (CUDA5.5) in Windows7x64 (MSVC) and Linux64 (GCC4.8.2)?

When I compile the following code containing the design C++11, in Windows7x64 (MSVS2012 + Nsight 2.0 + CUDA5.5), then I do not get errors, and everything compiles and works well: #include int main() { …
Alex
  • 12,578
  • 15
  • 99
  • 195
16
votes
4 answers

How to run CUDA without a GPU using a software implementation?

My laptop doesn't have a nVidia graphic cards, and I want to work on CUDA. The website says that CUDA can be used in emulation mode on non-cuda hardware too. But when I tried installing CUDA drivers downloaded from their website, it gives an error…
emkrish
  • 191
  • 1
  • 2
  • 4
16
votes
5 answers

CUDA Runtime API error 38: no CUDA-capable device is detected

The Situation I have a 2 gpu server (Ubuntu 12.04) where I switched a Tesla C1060 with a GTX 670. Than I installed CUDA 5.0 over the 4.2. Afterwards I compiled all examples execpt for simpleMPI without error. But when I run ./devicequery I get…
Framester
  • 33,341
  • 51
  • 130
  • 192
15
votes
2 answers

Number of Compute Units corresponding to the number of work groups

I need some clarification. I'm developing OpenCL on my laptop running a small nvidia GPU (310M). When I query the device for CL_DEVICE_MAX_COMPUTE_UNITS, the result is 2. I read the number of work groups for running a kernel should correspond to the…
rdoubleui
  • 3,554
  • 4
  • 30
  • 51
15
votes
2 answers

How can I use 100% of VRAM on a secondary GPU from a single process on windows 10?

This is on windows 10 computer with no monitor attached to the Nvidia card. I've included output from nvida-smi showing > 5.04G was available. Here is the tensorflow code asking it to allocate just slightly more than I had seen previously: (I want…
Steve Steiner
  • 5,299
  • 4
  • 32
  • 43
15
votes
2 answers

nvcc fatal : Value 'sm_20' is not defined for option 'gpu-architecture'

I've looked at many pages and either could not follow what they were saying because they were unclear and/or my knowledge is just not sufficient enough. I am trying to run: luarocks install…
John Lexus
  • 3,576
  • 3
  • 15
  • 33
15
votes
2 answers

Which CUDA Toolkit version for older NVIDIA Driver

I have been provided an older NVIDIA graphics card (GeForce 8400 GS) to begin exploring some GPU computing. I have tried to complete the installation successfully but have stumbled upon a problem. Here are my steps (on Ubuntu 14.04) sudo apt-get…
cdeterman
  • 19,630
  • 7
  • 76
  • 100
15
votes
3 answers

Compile cuda code for CPU

I'm study cuda 5.5 but i don't have any Nvidia GPU. In old version of nvcc have a flag --multicore to compile cuda code for CPU. In the new version of nvcc, what's is the option?? I'm working on Linux.
F.N.B
  • 1,539
  • 6
  • 23
  • 39
15
votes
3 answers

PTX - what is a CTA?

I'm studying PTX and I don't understand how a CTA (compute thread array) is different from a CUDA block. Are they the same thing? It seems to me that for now (I'm just at the beginning of the PTX document) they're just the same
Marco A.
  • 43,032
  • 26
  • 132
  • 246
15
votes
6 answers

Forcing hardware accelerated rendering

I have an OpenGL library written in c++ that is used from a C# application using C++/CLI adapters. My problem is that if the application is used on laptops with Nvidia Optimus technology the application will not use the hardware acceleration and…
JohanR
  • 151
  • 1
  • 4
15
votes
2 answers

Keep getting CL_INVALID_KERNEL_ARGS on nvidia gpu

I'm using OpenCL on an nvidia GPU and I keep getting CL_INVALID_KERNEL_ARGS when I try to execute a kernel. I've stepped it down to a very simple program: __kernel void foo(int a, __write_only image2d_t bar) { int 2 coords = {0,…
Trevor
  • 1,369
  • 2
  • 13
  • 28
14
votes
1 answer

What's the difference between PTX and CUBIN w.r.t. the NVCC compiler?

I have CUDA 4.0 installed, and a device with Compute Capability 2.0 (a GTX 460 card). What is the difference between the 'cubin' and the 'ptx' file? I think the cubin is a native code for the gpu so this is micro-architecture specific, and the ptx…
user973764
  • 141
  • 1
  • 1
  • 3
14
votes
6 answers

Could not load library cudnn_ops_infer64_8.dll. Error code 126 Please make sure cudnn_ops_infer64_8.dll is in your library path

Could not load library cudnn_ops_infer64_8.dll. Error code 126 Please make sure cudnn_ops_infer64_8.dll is in your library path. I've tried searching online but it's been hours and I haven't found anything. I would really appreciate anyone sharing…
Hissaan Ali
  • 2,229
  • 4
  • 25
  • 51
14
votes
4 answers

128 bit integer on cuda?

I just managed to install my cuda SDK under Linux Ubuntu 10.04. My graphic card is an NVIDIA geForce GT 425M, and I'd like to use it for some heavy computational problem. What I wonder is: is there any way to use some unsigned 128 bit int var? When…
Matteo Monti
  • 8,362
  • 19
  • 68
  • 114
14
votes
1 answer

How to get card specs programmatically in CUDA

I'm just starting out with CUDA. Is there a way of getting the card specs programmatically?
cookM
  • 953
  • 3
  • 8
  • 11