When I use multi gpu to train on MXNet(CUDA8.0+cudnn7), I firstly initialize parameters on different context, then I perform scatter_nd on different contexts, the first time scatter nd would work perfectly, but when compute for the second gpu card, I got
F1217 23:53:01.012707 2619 stream_gpu-inl.h:62] Check failed: e == cudaSuccess CUDA: an illegal memory access was encountered