I am trying to debug a program in cuda-gdb. I am able to successfully set breakpoints in code that runs on the host (CPU), but whenever I try to set a breakpoint in code that runs on the GPU, the debugger skips over the breakpoints and gives me the following error:
"warning: Cuda API error detected: cudaLaunchKernel returned (0x7)"
It then continues to successfully execute the rest of the code. How can I make these work?