Is there a trace or debug function that I can place in a CUDA kernel? I'm writing a program on Windows 7, VS2010 and I just discovered that to use NSIGHT Monitor I have to have 2 GPUs. I'm on a laptop unfortunately so this isn't really an option. I figured that I would fall back to tried and true debug/trace functions. Something akin to OutputDebugString. I don't see if one exists though.
Any help is appreciated. Thanks,
mj