Questions tagged [magma]

MAGMA (Matrix Algebra on GPU and Multicore Architectures) is a collection of linear algebra libraries for heterogeneous processor architectures. If you have questions about the MAGMA computational algebra system, please use the [magma-ca] tag.

Designed by a team at the Innovative Computing Laboratory at the University of Tennessee, MAGMA allows applications to exploity the power of heterogeneous systems of multi-core CPU's and multi-GPU's to deliver the most accurate solution in the fastest time within specified energy constraints.

42 questions
0
votes
1 answer

Special way to handle matrices with large dynamic range for cuSolverSp routines?

I have matrices like the one at this link: https://www.dropbox.com/s/tte3rlfsrprgtt8/ExampleMatrix.txt?dl=0 This example is a 9x9 sparse matrix where the values of the elements differ in magnitude significantly. For example the magnitude of the…
0
votes
1 answer

Extract upper triangular matrix from the result of "magma_dgeqrf2_gpu" (which is a general matrix) without transferring to host

I'm building a kernel which among other things uses the Magma function magma_dgeqrf2_gpu to perform a QR factorization. This outputs the upper triangular matrix R into a general matrix d_A on the GPU device. Without transferring d_A back to host…
Jim
  • 317
  • 1
  • 13
0
votes
0 answers

CUDA lapack librairies (CULA & MAGMA) as device functions

Could calls to lapack routines in CUDA (CULA or MAGMA) be instantiated within a CUDA kernel and not from the Host? (__device functions and not __global functions) If it is not possible, how to therefore implement (_device) linear algebra routines…
Didon
  • 383
  • 2
  • 4
  • 13
0
votes
1 answer

using magma_dysevd in mex file matlab

I try to write use magma library in matlab, so basically I write a mexfunction which incorporate c code using magma function and then compile this mexfunction into mexa64 file, thus I could use in matlab. The mexfunction or source c code is…
yuxin
  • 1
  • 1
0
votes
1 answer

CMake thinks Intel Composer XE 2013 is broken

I'm trying to compile Magma-1.4.1 for Microsoft Visual Studio Professional 2013 on Windows 7-64 bit using CMake 2.8.12.2. I also installed the Intel Fortran Compiler. I open CMake -> I specify the path for the source code and for the binaries -> I…
-1
votes
1 answer

Why does the magma_dgemm function not use tensor cores on the V100 GPU?

I run MAGMA testing_dgemm code both on V100 and H100 GPU. With Nsight Systems, I found that on the V100 the code doesn't use tensor cores, but code on the H100 it does. V100 result: H100 result: The tensor core has been used in Volta GPU according…
ingridli
  • 5
  • 2
-1
votes
1 answer

double free or corruption (out) on ipiv magma_getrs_gpu

I have the following code: magma_int_t *h_ipiv; magma_imalloc_cpu( &h_ipiv, k); for (i=0;i
eldev09
  • 39
  • 8
-1
votes
1 answer

Running Magma library

I'm trying to run one of the examples given with the source. for the MAGMA library (http://icl.cs.utk.edu/magma/) on my GPU. I installed MAGMA through the arch user repository and after setting the paths correctly I can built it fine: make all …
Stein
  • 3,179
  • 5
  • 27
  • 51
-1
votes
1 answer

cuda magma matrix-matrix addition kernel

I tried using similar format as magmablas_sgeadd_q kernel, however I am not getting proper outputs, moreover every time I run it, I get a different output. The code that I used is given below: #include #include #include…
aish
  • 13
  • 1
  • 1
  • 4
-2
votes
1 answer

Decreasing performance by using cuda kernel inside loop

I was trying to improve the performance of a slow code. That code used cblas and i was trying to upgrade the performance by using magma and cuda. First i just passed cblas calls to magma. But it needs CPU <-> GPU copies inside the loop and so it…
eldev09
  • 39
  • 8
-2
votes
1 answer

CUDA Library for Computing Kronecker Product

I have an application that requires me to calculate some large Kronecker products of 2D matrices and multiply the result by large 2D matrices. I would like to implement this on a GPU in CUDA and would prefer to use a tuned library implementation…
Michael Puglia
  • 145
  • 2
  • 9
-3
votes
1 answer

Magma 2.0 Error: CUDA driver version is insufficient for CUDA runtime version

While running the program to add two simple matrix I get following error CUDA runtime error: CUDA driver version is insufficient for CUDA runtime version (35) in magma_getdevice at interface_cuda/interface.cpp:547 Error: magmablasGetQueue…
Aafaque Abdullah
  • 361
  • 3
  • 13
1 2
3