Does cuMemcpy "care" about the current context?

Question

Suppose I have a GPU and driver version supporting unified addressing; two GPUs, G0 and G1; a buffer allocated in G1 device memory; and that the current context C0 is a context for G0.

Under these circumstances, is it legitimate to cuMemcpy() from my buffer to host memory, despite it having been allocated in a different context for a different device?

So far, I've been working under the assumption that the answer is "yes". But I've recently experienced some behavior which seems to contradict this assumption.

score 1 · Answer 1 · answered Oct 08 '22 at 18:59

Calling cuMemcpy from another context is legal, regardless of which device the context was created on. Depending on which case you are in, I recommend the following:

If this is a multi-threaded application, double-check your program and make sure you are not releasing your device memory before the copy is completed
If you are using the cuMallocAsync/cuFreeAsync API to allocate and/or release memory, please make sure that operations are correctly stream-ordered
Run compute-sanitizer on your program

If you keep experiencing issues after these steps, you can file a bug with NVIDIA here.

Surely this only applies on unified memory platforms? – talonmies Oct 09 '22 at 03:42 — talonmies, Oct 09 '22 at 03:42
UVA is expected/needed – Robert Crovella Oct 10 '22 at 02:56 — Robert Crovella, Oct 10 '22 at 02:56

Does cuMemcpy "care" about the current context?

1 Answers1