A GPU kernel & API call profiling tool for the CUDA environment, with command-line and a GUI aspects; replaces parts of the older nvprof and NVIDIA Visual Profiler tools, and complements NSight Systems.
Questions tagged [nsight-compute]
34 questions
0
votes
1 answer
How to find gld_throughput and gst_throughput using nv-nsight-cu-cli
Not able to get it working and documentation is bit tricky to read.
Tried below and see the outputs as n/a.
root@teja:~/Projs/CUDA/05-Profiling# nv-nsight-cu-cli --device 0 --metrics gst_throughput,gld_throughput ./run 0
==PROF== Connected to…

Satya Teja
- 5
- 3
0
votes
1 answer
Nsight Compute can't profile Waveglow (PyTorch application)
I tried to profile https://github.com/NVIDIA/waveglow by this command:
nv-nsight-cu-cli --export ./nsight_output ~/.virtualenvs/waveglow/bin/python3 inference.py -f <(ls mel_spectrograms/*.pt) -w waveglow_256channels.pt -o . --is_fp16 -s…

minty99
- 327
- 1
- 2
- 9
0
votes
2 answers
nv-nsight-cu-cli caused Tensorflow to fail
I've downloaded the newest Nsight Compute profiling tool and I want to use it to benchmark Tensorflow applications. The code I'm using is here. It runs perfectly fine when I execute it and when I benchmark it with nvprof ./mnist.py it had no problem…

edhu
- 449
- 6
- 23
-1
votes
1 answer
Using ncu to profile pagefault in unified memory
is there any option to profile unified virtual memory CUDA application with Nsight Compute (NCU)? For example, I want to know the time spending on handling page fault and migration.

Daniel
- 325
- 3
- 10