1

My question is quite simple: If a machine's CPU/OS fails, is its memory still usable via Remote Direct Memory Access (RDMA) from remote machine?

HuangJie
  • 1,488
  • 1
  • 16
  • 33

1 Answers1

0

This really depends on the kind of failure the system is experiencing. If the OS fails, it is possible that an HCA implementing RDMA can continue performing DMA on behalf of remote requests. It won't be possible though to register new memory without OS and OS driver cooperation.

If the CPU fails, it might be impossible for the HCA to access memory, as all memory accesses go through the PCIe root complex and the memory controller on the CPU.

haggai_e
  • 4,689
  • 1
  • 24
  • 37