Questions tagged [cuda-wmma]

The CUDA warp matrix functions utilize NVIDIA GPU's "tensor cores" functional units to enable matrix multiply-add (MMA) operations in CUDA kernels.

1 questions
1
vote
1 answer

Warp Matrix-Multiply functions - are single-precision multiplicands supported?

In the CUDA Programming guide, v11.7, section B.24.6. Element Types & Matrix Sizes, there's a table of supported type combinations, in which the multiplications are either sub-single-precision floating point types, or double - never `float . But -…
einpoklum
  • 118,144
  • 57
  • 340
  • 684