I remotely access High-performance computing nodes. I am not sure about NVIDIA Collective Communications Library (NCCL) is installed in my directory or not. Is there any way to check whether the NCCL is installed or not?
Asked
Active
Viewed 1.8k times
9
-
did you try `nvcc --version`? – Charlie Parker Jul 22 '21 at 17:36
-
2you can do `python -c "import torch;print(torch.cuda.nccl.version())"` with pytorch. I wish I new the terminal command without pytorch. – Charlie Parker Jul 22 '21 at 17:41
2 Answers
17
You can try
locate nccl| grep "libnccl.so" | tail -n1 | sed -r 's/^.*\.so\.//'
or if you use PyTorch:
python -c "import torch;print(torch.cuda.nccl.version())"
Check it this link Command Cheatsheet: Checking Versions of Installed Software / Libraries / Tools for Deep Learning on Ubuntu
For containers, where no locate
is available sometimes, one might replace it with ldconfig -v
:
ldconfig -v | grep "libnccl.so" | tail -n1 | sed -r 's/^.*\.so\.//'

Sadra
- 2,480
- 2
- 20
- 32

Spéro ADONON
- 179
- 3
-
2When I enter `locate nccl| grep "libnccl.so" | tail -n1 | sed -r 's/^.*\.so\.//'1, it show nothing. – Ahmad Apr 07 '21 at 11:45
-4
You can usually do this in the command line:
nvcc --version
you might have to run:
sudo apt install nvidia-cuda-toolkit
too.
As the other answerer mentioned, you can do:
torch.cuda.nccl.version()
in pytorch. Copy paste this into your terminal:
python -c "import torch;print(torch.cuda.nccl.version())"
I am sure there is something like that in tensorflow.

Charlie Parker
- 5,884
- 57
- 198
- 323
-
NVCC is a general CUDA C++ compiler. It doesn't report NCCL (communications library) version. The first part of the answer is wrong. – Dima Mironov Sep 07 '21 at 08:33