Questions tagged [blas]

The Basic Linear Algebra Subprograms are a standard set of interfaces for low-level vector and matrix operations commonly used in scientific computing.

A reference implementation is available at NetLib; optimized implementations are also available for all high-performance computing architectures, for example:

The BLAS routines are divided into three levels:

  • Level 1: vector operations e.g. vector addition, dot product
  • Level 2: matrix-vector operations e.g. matrix-vector multiplication
  • Level 3: matrix-matrix operations e.g. matrix multiplication
906 questions
0
votes
1 answer

Sparse Blas in Fortran 95

I want to use the Sparse Blas in Fortran95 just for the creation of the matrices and I am using the point entry construction. After creation of the matrix using the command call duscr_begin(n,n,a,istat) here a is the handle to the matrix n by n.…
0
votes
1 answer

Serial version slower than parallel with MKL_DYNAMIC=TRUE

I have implemented Conjugate Gradient in FORTRAN by replacing the Linear Algebra subroutines in the wikipedia example by (Fortran) Intel MKL subroutines. (DGEMV, DAXPY and DNRM only. It turns out that a=b is faster than DCOPY and a=2*a is faster…
user1132648
0
votes
2 answers

What is the alternative method for Diag(k) in C++?

I'm converting a matlab code to C++ code and trying to find a function in Lapack, Blas or even Atlas that does the same job at Diag(k) function in matlab ? Any suggestions ?
Mohamed Taher Alrefaie
  • 15,698
  • 9
  • 48
  • 66
-1
votes
1 answer

Why does the magma_dgemm function not use tensor cores on the V100 GPU?

I run MAGMA testing_dgemm code both on V100 and H100 GPU. With Nsight Systems, I found that on the V100 the code doesn't use tensor cores, but code on the H100 it does. V100 result: H100 result: The tensor core has been used in Volta GPU according…
ingridli
  • 5
  • 2
-1
votes
1 answer

LAPACK different outputs when using solvers for banded matricies

I've been stuck at this for hours and hoping someone can figure out what I am missing. I am solving Ax=B firstly using DGESV which I am 99% sure is correct. Then I am puting A into a banded form and using DGBSV and getting a different output. Here…
-1
votes
1 answer

NVBLAS through CBLAS

I have a C++ code and I used CBLAS to compute dgemm and dtrsm. I am interested to use GPUs for performance. With the tests I have done I could use NVBLAS using nvblas.h, however it is not close to CBLAS. I think I can change my code to call dgemm…
Aznaveh
  • 558
  • 8
  • 27
-1
votes
1 answer

ImportError: DLL load failed while importing _fblas, when running Python from Visual Studio C# via CFFI

I'm trying to run an image processing Python script from Visual Studio C# using CFFI. I have compiled the Python script feature-extraction.py into the DLL feature-extraction.dll, which I wrap into C# code via…
Bogdan
  • 51
  • 4
-1
votes
2 answers

CUDA CSR Matrix-Matrix product transpose by itself

I have a very large, very sparse least-squares design matrix (A), which I would like to multiply by itself, as follows: N = A^T * A, where A & N are stored in CSR format. Obviously, A has more rows than columns. I normally form N directly…
Dar Cos
  • 87
  • 1
  • 1
  • 3
-1
votes
1 answer

CUBLAS Sgemm confusing results

For two matrices X and Q of size 4x3 and 2x3 which in memory look like x = [0 1 2 3 4 5 6 7 8 9 10 11] q = [3 4 5 6 7 8] I tried to use cublas multiplication cublasSgemm, but I couldn't manage to get expected results. Since they are stored in…
zinsek
  • 9
  • 2
-1
votes
2 answers

Compiling Blas on ubuntu

I'm trying to run the testblas.c example from this website: http://www.seehuhn.de/pages/linear#installation I installed liblapack3 and libblas-common, but I couldn't find anything for atlas in the repositories. Compiling it will result in: gcc…
Stein
  • 3,179
  • 5
  • 27
  • 51
-1
votes
2 answers

Lapack set internal FORTRAN parameter in C

IF you look at this link, you'll see that MAXITR is an internal parameter of the dbsqr FORTRAN LAPACK routine. I have found this link, to call FORTRAN functions in C, but how do I set MAXITR? It doesn't seem to be a parameter. Is there any way to do…
Syd Kerckhove
  • 783
  • 5
  • 19
-1
votes
1 answer

Can you use cublasDdot() to use blas operations in non-GPU memory?

So I have a code that performs matrix multiplicaiton, but the problem is it returns just zeroes when I use the library -lcublas and the compiler nvcc; however, the code runs great with just a few tweaks to function names when I use the compiler, g++…
Mechy
  • 259
  • 1
  • 4
  • 14
-1
votes
1 answer

Fortran 77 with BLAS - can't figure out how to compile

I'm trying to get BLAS working with in a FORTRAN 77 program, but so far I've been unsuccesful and I can't figure out how to get going with this. For reference I'm doing this under Ubuntu 12.10. This is the code of the program I'm trying to…
Tomas Aschan
  • 58,548
  • 56
  • 243
  • 402
-2
votes
1 answer

Efficient sum, vector-matrix-vector by Fortran

I want to compute the summation more efficient. There are nested loops, which I want to avoid. ! If i+j <= I,then A_{j} = \sum_{m,n,i} C_{m, i+j} G_{ m, n, i, j} C_{n, i} ! else if i+j >= I, then A_{j} = \sum_{m,n,i} C_{m, i+j-I} G_{ m, n, i, j}…
Mao Yang
  • 3
  • 2
-2
votes
1 answer

Could not install ChatterBot. Raises errors

i tried to install chatterbot but i got the following error I tried downgrading to chatterbot==1.0.5 still it raises errors Gives errors while Installing dependencies Please help me No Silly answers Thank you in…
1 2 3
60
61