Questions tagged [prefix-sum]

63 questions
-1
votes
2 answers

CUDA Paralell prefix sum error

I'm trying to implement a three phase parallel scan as described in chapter 8 of Programming Massively Parallel Processors 3rd edition (there are any line of code but only instructions). This algorithm allow to use only 1 block with the maximum…
sgiraz
  • 141
  • 2
  • 11
-1
votes
1 answer

maximum supported size for cub library

Does anyone know what is the maximum supported size for cub::scan ? I got core dump for input sizes over 500 million. I wanted to make sure I'm not doing anything wrong... Here is my code: #define CUB_STDERR #include #include…
JacobS
  • 11
  • 3
-2
votes
1 answer

Multiple pointer accesses cause failure of printf in CUDA

I'm currently trying to implement the Blelloch Algorithm in CUDA. I want to use printf for debugging but it has this weird behaviour that it won't work when I'm accessing different positions of the same array in a Kernel. cudaDeviceSynchronize() is…
1 2 3 4
5