In CUDA docs, specifically in CUDA Runtime API in section Device Management about cudaSetDevice
, it is written like this
Any host memory allocated from this host thread using cudaMallocHost() or cudaHostAlloc() or cudaHostRegister() will have its lifetime associated with device
So my question is here:
If I allocate a host memory using cudaHostAlloc
with current device is dev 0, and then I transfer that host memory to device memory in dev 1, is there any limitation or problem?