I run MAGMA testing_dgemm
code both on V100 and H100 GPU. With Nsight Systems, I found that on the V100 the code doesn't use tensor cores, but code on the H100 it does.
V100 result:
H100 result:
The tensor core has been used in Volta GPU according to NVIDIA web.
The NVIDIA Inside Volta blog seems not to mention the FP64 TC performances.