I have a RX 570, These are the information i received from clGetDeviceInfo
MaxComputeUnitPerGPU: 32
MaxWorkGroupSize: 256
MaxWorkItemSize: 256
MaxGlobalMemoryOfDevice: 4294967296
MaxPrivateMemoryBytesPerWorkGroup: 16384
MaxLocalMemoryBytesPerWorkGroup: 32768
If I have 256 Work Groups and 256 Work Items per Work Group It would mean that
64 Bytes Of Private(l1?) Memory per work Item(16384/256)
32768 Bytes Of Local(l2) Memory per work Group
And if I use 17 floats would it overflow to L2?
or
If I use 15 float, and 2 private float would it overflow to L2?
also is float the same as private float? Answer: Same by default, By @doqtor
or
If I use 16 float and use functions like pow, sqrt and clamp would registry(l1?) overflow occur?