An LLVM-IR variant used as a higher-level intermediate language in CUDA.
NVVM is a variant of llvm-ir, used in the cuda GPU execution ecosystem as a higher-level intermediate-language; it gets further compiled into the lower-level intermediate language, ptx, and/or the GPU-microarchitecture-specific sass assembly.
A specification of NVVM is available as part of the CUDA documentation.