I'm using nvprof to profile the CUDA code. Can someone help me how to calculate the compute to global memory access ratio using the information obtained through performing
nvprof --analysis-metrics
I'm using nvprof to profile the CUDA code. Can someone help me how to calculate the compute to global memory access ratio using the information obtained through performing
nvprof --analysis-metrics