I have been testing the CuPy library and done a simple matrix multiplication using einsum:
C = cp.einsum('pqrs,rs->pq', A, B)
Dimensions of A and B are, (41, 41, 41, 41) (41, 41), receptively. I also checked their sizes, which are 22606088 bytes, 13448 bytes.
While running the code, I am getting the following error message:
OutOfMemoryError: out of memory to allocate 38000834048 bytes (total 38023468032 bytes)
It indicates that I am running out of memory. Is there any option to sent data partially to the device and perform operations in terms of batches?