According to the Kepler architecture whitepaper, a SMX has 192
CUDA cores and 64
Double Precision Units (DPUs). For a K20Xm there are 14
SMXs totalling at 2688
cores, which means that only the CUDA cores are counted. What exactly is then the usage of the DPUs for and how is their usage related to the cores?
My thoughts:
a) The CUDA cores can't do double precision operations and only the DPUs can. Therefore, the CUDA cores are free for other stuff while the DPUs are busy.
b) The CUDA cores somehow need a double precision unit to do double precision operations, therefore only 128
of the 192
CUDA cores are available for other stuff.
Cheers Andi