Questions tagged [cub]

CUB provides state-of-the-art, reusable software components for every layer of the CUDA programming model.

CUB (CUDA UnBound) is a C++ template library of components for use on NVIDIA GPUs running CUDA.

CUB includes common data parallel operations such as prefix scan, reduction, histogram and sort. CUB's collective primitives are not bound to any particular width of parallelism or to any particular data type and can be used at device, block, warp or thread scope.

It is used in the backend of other NVIDIA libraries, most prominently Thrust and RAPIDS.

CUB is developed by NVIDIA Research and it's website and documentation is hosted at https://nvlabs.github.io/cub with the most recent source code being available on GitHub. It is also distributed with the CUDA Toolkit since at least CUDA 11.1.1 (first version where CUB documentation is linked from CUDA Tookit documentation).

48 questions
-1
votes
1 answer

Using both CUB and Thrust for parallel sum scan

I am trying to do parallel sum scan on a test vector. I am using both Thrust and CUB library for this purpose struct CustomSum { template CUB_RUNTIME_FUNCTION __forceinline__ T operator()(const T &a, const T &b) const { …
Gaara
  • 695
  • 3
  • 8
  • 23
-1
votes
1 answer

maximum supported size for cub library

Does anyone know what is the maximum supported size for cub::scan ? I got core dump for input sizes over 500 million. I wanted to make sure I'm not doing anything wrong... Here is my code: #define CUB_STDERR #include #include…
JacobS
  • 11
  • 3
-2
votes
1 answer

Compilation of CUB library with clang as the CUDA compiler

I set clang++-12 as the CUDA compiler in CMake as follows: cmake .. -DCMAKE_BUILD_TYPE=Debug \ -DCMAKE_CUDA_ARCHITECTURES="75" \ -DCMAKE_CUDA_COMPILER=clang++-12 \ -DCMAKE_CXX_COMPILER=clang++-12 -DCMAKE_C_COMPILER=clang-12 I try to compile a…
Serge Rogatch
  • 13,865
  • 7
  • 86
  • 158
1 2 3
4