I am presently learning CUDA and I keep coming across phrases like
"GPUs have dedicated memory which has 5–10X the bandwidth of CPU memory"
See here for reference on the second slide
Now what does bandwidth really mean here? Specifically, What does one mean by
- bandwidth of the CPU
- bandwidth of the GPU
- bandwidth of the PCI-E slot the GPU's are fitted onto the motherboard. My background in computer architecute is very poor, so if someone can give a very simple explanation of these terms, it will be really helpful.
My very very limited understanding of bandwidth is the highest possible number of gigabytes that can be trasnferred per second from the CPU to the GPU. But that does not explain why we need to define three types of bandwidth.