How the above function perform operation in cuda . Do we need to write CudaMalloc() or MemCopy() along with this or just a call with internally do all this.
I wrote
nppiResizeSqrPixel_32f_C4R(&in[0],sizeofImage,StepSize,&out[0],StepSizeOutput,DestRoi,Xfactor,YFactor,NULL,NULL,16);
Here 'in' is vector having input image and 'out' is an empty vector . But after executing the above function the output vector still '0' . Can you please guide me how the function resizes.