CSV_REFS performs properly when the diskspd test is run on the disk’s Owner Node. Latency increases 35x for 64k blocks when the test is run on any other node in the 4-node cluster. I can switch the owner node around and run the test on the new owner and I will continue to get good performance. When I run the test from a non disk-owner, the results are poor. CSV_NTFS performs strong regardless of the node in which it runs. I’m considering giving up on CSV_REFS for CSV_NTFS because of this observation.
I’m running Windows Server 2019.
I have considered that RDMA may be the problem, but I can’t find any evidence that I’m having RDMA issues. The logs are clean, test-rdma.ps1 runs fine.
Does anyone have any thoughts as to why this would occur?