Questions tagged [cudnn]

The NVIDIA CUDA Deep Neural Network library (cuDNN) is a GPU-accelerated library of primitives for deep neural networks.

The NVIDIA CUDA Deep Neural Network library (cuDNN) is a GPU-accelerated library of primitives for deep neural networks. cuDNN provides highly tuned GPU implementations of standard routines such as forward and backward convolution, pooling, normalization, and activation layers, exposed by a C++ API.

Please use this tag only if you are writing or building code which uses the cuDNN APIs. If you are having problems with installing or running frameworks or libraries which use cuDNN internally, please tag those questions with the library or framework in question, and refrain from tagging them with this tag.

385 questions
0
votes
1 answer

Does cuDNN have a device api?

I am wondering whether cuDNN has a device side api for dynamic parallelism (I want to call cuDNN kernels inside other kernels). I have found that cuBLAS has such an api: cuBlas but I could not find any information on whether cuDNN provides a similar…
M.Soyturk
  • 340
  • 3
  • 14
0
votes
0 answers

RTX 3080 and CUDA 10.2

I am not familar with CUDA so there might be some mistakse in my coments. I am currently using RTX 3080 and CUDA 11.4. For my project, CUDA 10.2 is recommended, So I used nvidia-docker and installed CUDA 10.2 but faced with this error. ValueError:…
Dongri
  • 1
  • 2
0
votes
1 answer

Equivalent AdaptiveAvgPool2d API in cuDNN

Is there an equivalent API in cuDNN as the AdaptiveAvgPool2d in Pytorch?
Daniel
  • 325
  • 3
  • 10
0
votes
1 answer

Identifier "cusparseXXX" is undefined on cuda11

I'm building a package tested for CUDA 9,10 from source, trying to compile it for CUDA11. I've already changed gencode=arch=compute_70 (was set on 30), and added target_link_libraries(tsnecuda ${CUDA_cusparse_LIBRARY}) Unfortunately, I still…
DsCpp
  • 2,259
  • 3
  • 18
  • 46
0
votes
1 answer

Cudnn issue while using OpenNMT-tf 2.10 with tensorflow 2.2 inAnaconda virtual env

I am trying to train OpenNMT-tf transformer model on GPU GeForce RTX 2060 8GB Memory. You can see steps Here. I have created Anaconda virtual environment and installed tensorflow-gpu using following commend. conda install…
0
votes
0 answers

How do I verify if CUDNN installed correctly?

I have tried this command cat /usr/include/cudnn.h | grep CUDNN_MAJOR -A 2 but it does not show anything. I am also trying to verify by the way given at Nvidia website but cp -r /usr/src/cudnn_samples_v7/ $HOME it says these are the same file. I am…
0
votes
1 answer

CUDA version crash

I installed CUDA 11.2 and realised that's not the version I wanted... Then I uninstall it via Control Panel "Uninstall a Program" and uninstall all CUDA 11.2 stuff(but I didn't uninstall those Nvidia Nsight stuff) After that I install CUDA 10.2, but…
mocha234
  • 127
  • 2
  • 11
0
votes
1 answer

CMAKE cannot recognize Cudnn whilte it's installed

I installed CUDA 10.2, and installed matching CUDNN, but CMAKE cannot recognize it while I try to install OpenCV with CUDA. I already copied Cudnn files from bin, include, and lib folders to the corresponding CUDA folders. I tried several versions…
Mohamed Hedeya
  • 153
  • 5
  • 22
0
votes
1 answer

RuntimeError: cudnn RNN backward can only be called in training mode

I have seen this problem the first time, I never encountered such an error in previous Python projects. Here is my training code: def train(net, opt, criterion,ucf_train, batchsize,i): opt.zero_grad() total_loss = 0 net=net.eval() …
ashwin
  • 25
  • 1
  • 8
0
votes
1 answer

Why does `conda list cudnn` have no output after `conda install pytorch torchvision cudatoolkit=10.2 -c pytorch` installation

Please feel free to vote "Reopen" at the bottom of this question. The reason is that I have marked this as a duplicate although the answers there are not clear enough for this question. As soon as the question is reopened, I can add the the…
questionto42
  • 7,175
  • 4
  • 57
  • 90
0
votes
1 answer

tensorflow 2 on g4dn.xlarge GPU crashes after 8 epochs

I'm trying to train cGAN on g4dn.xlarge GPU ec2 machine and it crashes every time after 8 epochs exactly with the following message: Traceback (most recent call last): File "pix2pix_tf2.py", line 841, in main() File…
user3424107
  • 117
  • 1
  • 5
0
votes
1 answer

Is it possible to manually set the SMs used for one CUDA stream?

By default, the kernel will use all available SMs of the device (if enough blocks). However, now I have 2 stream with one computational-intense and one memory-intense, and I want to limit the maximal SMs used for 2 stream respectively (after setting…
0
votes
2 answers

Issue while doing: cudnn-install / 2.4. Verifying The cuDNN Install On Linux / cudnn_samples_v7 testing

While verifying cuDNN install on Linux as the docs implied: Compile the mnistCUDNN sample. $make clean && make I encountered error as below: XXX@XXX ~/cudnn_samples_v7-master/mnistCUDNN $ make clean && make rm -rf *o rm -rf…
charliie
  • 21
  • 5
0
votes
2 answers

building c++ opencv 4 with cuda ubuntu 16/04

I'm trying to build opencv' with cuda 10.2. When the following command : cmake -DCMAKE_BUILD_TYPE=RELEASE \ -DOPENCV_EXTRA_MODULES_PATH=../../opencv_contrib/modules \ -DWITH_TBB=ON -DWITH_CUDA=ON \ -DBUILD_opencv_cudacodec=OFF…
0
votes
1 answer

is there any solution regarding dl4j with cuda support for this problem?

I am trying to execute MultiGpuLenetMnistExample.java and i have received following error " ... 12:41:24.129 [main] INFO Test - Load data.... 12:41:24.716 [main] INFO Test - Build model.... 12:41:25.500 [main] INFO…
explorer
  • 15
  • 4