Questions tagged [openmp]

OpenMP is a cross-platform multi-threading API which allows fine-grained task parallelization and synchronization using special compiler directives.

OpenMP is a cross-platform multi-threading API which allows fine-grained task parallelization and synchronization using special compiler directives. OpenMP offers easy access to multi-threading without requiring knowledge of system-dependent details. At the same time, it is reasonably efficient compared to fine-tuned implementations with the bonus of being easiest to write multi-threads code. Forums and complete information on OpenMP is at https://openmp.org/.

OpenMP is based on multi-thread model, and offers Shared Memory parallelism and heterogeneous programming for coprocessors through compiler directives, library routines and environment variables. It is restricted to C/C++ and Fortran applications, however provides portability across different Shared Memory architectures.

It is through directives, added by the programmer to the code, that the compiler adds parallelism in the application. OpenMP can be used in single or multi-cores machines, in the first architecture the compiler directives are ignored, thus the application is executed in a sequential manner, promoting portability between the two architectures.

Latest version is 5.2 (November 2021): Official OpenMP specifications.

Definitive Book Guide

Using OpenMP: Portable Shared Memory Parallel Programming - Barbara Chapman et al.
Using OpenMP - The Next Step: Affinity, Accelerators, Tasking, and SIMD - Ruud van der Pas et al.
Parallel Programming in OpenMP - Rohit Chandra.
An Introduction to Parallel Programming - Peter Pacheco.
Parallel Programming in C with MPI and OpenMP - Michael J. Quinn.

Helpful links

6462 questions

votes

1 answer

Is collapse clause with non-rectangular loops allowed by the OpenMP 5.1 Spec?

Consider the following OpenMP code: #pragma omp target teams distribute parallel for collapse(4) map(tofrom: a) private(i,j,k,l) for (i = 0; i < SIZE_N; i++) { for (j = 0; j < SIZE_M; j++) { for (k = i; k < SIZE_N; k++) { for (l = 0; l…

asked Jul 14 '21 at 17:05

Simone Atzeni

votes

1 answer

What loop size to multithread?

Imagine a simple loop: constexpr int N; // some big number #pragma omp parallel for for(int i=0; i

c++ multithreading openmp

asked Jul 12 '21 at 01:27

Bbllaaddee

votes

0 answers

Running Fortran OpenMP codes with OpenMP Tools interface

I'm trying to run a simple Fortran OpenMP code with a library using the OpenMP Tools Interface (OMPT). I have this working with a C++ code using clang + llvm openmp runtime, just by doing OMP_TOOL_LIBRARIES=/home/path/to/libotter.so…

fortran openmp

asked Jul 06 '21 at 12:45

LonelyCat

votes

1 answer

Where to see what OMP schedule(auto) picks?

Is there a way to find out what scheduling scheme the OMP runtime chooses for schedule(auto)? I found that (and intuitvely it makes sense) for my problemschedule(static) is the fastest, so I am wondering if that's what the runtime chooses when is…

c++ openmp

asked Jul 05 '21 at 22:06

Marcel Braasch

1,083
1
10
19

votes

1 answer

Can I deallocate a shared variable by a single thread using OpenMP?

I am using OpenMP in order to parallelize a code. Here is the most important part of the code according to the question that I will ask: !$OMP PARALLEL PRIVATE(num_thread) & !$OMP…

multithreading fortran openmp shared-memory master-slave

asked Jun 15 '21 at 09:14

hakim

votes

0 answers

How to use two nodes for one OpenMp Fortran90 code in SLURM Cluster?

I am freshly new to using SLURM in CLUSTER. I am now struggling with OpenMP fortran 90. I try to calculate integrals using two nodes (node1 and node2) through SLURM. What I want is to return one value by combining the calculations of node 1 and node…

fortran openmp cluster-computing slurm

asked Jun 02 '21 at 14:02

Goring

votes

1 answer

Speed up and scheduling with OpenMP

i'm using OpenMP for a kNN project. The two parallelized for loops are: #pragma omp parallel for for(int ii=0;ii

multithreading performance parallel-processing openmp knn

asked May 31 '21 at 14:39

kiflomz

votes

1 answer

error: reduction variable is private in outer context (omp reduction)

I am confused about the data sharing scope of the variable acc in the flowing two cases. In the case 1 I get following compilation error: error: reduction variable ‘acc’ is private in outer context, whereas the case 2 compiles without any…

c++ parallel-processing openmp simd

asked May 26 '21 at 17:24

Misslinska

votes

1 answer

Confused about OMP_NUM_THREADS and numactl NUMA-cores bindings

I'm confused about how multiple launches of same python command bind to cores on a NUMA Xeon machine. I read that OMP_NUM_THREADS env var sets the number of threads launched for a numactl process. So if I ran numactl --physcpubind=4-7 --membind=0…

parallel-processing openmp cpu intel numactl

asked May 25 '21 at 17:51

Joe Black

votes

1 answer

Parallel code with OpenMP takes more time to execute than serial code

I'm trying to make this code to run in parallel. It's a chunk of code from a big project. I thought I started parallelizing slowly to see if there is a problem step by step (I don't know if that's a good tactic so please let me know). double…

c multithreading performance parallel-processing openmp

asked May 20 '21 at 18:01

dada dudu

votes

1 answer

How to parallelise a code inside a while using OpenMP

I am trying to parallelise the heat_plate algorithm but I am stuck at this bit of code inside my while: while(1) { ..... ..... #pragma omp parallel shared(diff, u, w) private(i, j, my_diff) { my_diff = 0.0; #pragma omp for for (i = 1; i <…

c multithreading performance parallel-processing openmp

asked May 15 '21 at 17:25

D K

votes

1 answer

openmp Linker flags in MSVC

when I try to compile my project in MSVC2008 with the linker flag (Configuration properties>>Linker>>Command line>> Additional options) set to : "/STACK:10000000 /machine:x64 /openmp" it warns me that the /openmp flag is unknown. "LINK : warning…

visual-c++ linker openmp

asked Jul 18 '11 at 19:51

Nima Nouri

votes

1 answer

When should I overlook critical sections and when nowait is needed ? OpenMp

I am studying OpenMP and I have some questions that I believe will clear up my thoughts. I have a small example of a matrix multiplication A*B where A,B,C are global variables. I know how we can parallelize the for loops one at a time or both…

c multithreading performance parallel-processing openmp

asked May 03 '21 at 13:57

gregni

votes

1 answer

Why would executing a function in parallel significantly slowdown the program?

I am trying to parallelize a code using OpenMP, the serial time for my current input size is around 9 seconds, I have a code of the following form: int main() { /* do some stuff*/ myfunction(); } void myfunction() { for (int i=0; i

c openmp

asked May 01 '21 at 09:11

Sergio

votes

1 answer

C++ call to LAPACKE run on a single thread while NumPy uses all threads

I wrote a C++ code whose bottleneck is the diagonalization of a possibly large symmetric matrix. The code uses OpenMP, CBLAS and LAPACKE C-interfaces. However, the call on dsyev runs on a single thread both on my local machine and on a HPC cluster…

c++ performance numpy parallel-processing openmp

asked Apr 19 '21 at 15:46

Toool

Prev 1 2 3

…

99 100 Next