Imagine that we have some OpenCV usual image Mat:
cv::Mat usual_image = cv::imread(...)
This image stored as HWC/NHWC array in memory.
If that possible to copy this image as CHW/NCHW (separated channel arrays, in sense) into cuda memory without super high cost cv::split?
Just for visualization HWC and CHW: