Difference between revisions of "How to Use OpenMP"
Line 37: | Line 37: | ||
$ export OMP_NUM_THREADS=24 | $ export OMP_NUM_THREADS=24 | ||
If you simply run your application with <code>$ ./omp_code.exe</code> next, this value will be used automatically. | If you simply run your application with <code>$ ./omp_code.exe</code> next, this value will be used automatically. | ||
+ | |||
+ | === Thread Pinning === | ||
+ | |||
+ | Threads are "pinned" by setting certain OpenMP-related environment variables. It is an advanced way to control how your system distributes the threads across the available cores. | ||
+ | |||
+ | OMP_PLACES is employed to specify places on the machine where the threads are put. However, this variable on its own does not determine thread pinning completely, because your system still won't know in what pattern to assign the threads to the given places. Therefore, you also need to set OMP_PROC_BIND. | ||
+ | |||
+ | OMP_PROC_BIND specifies a binding policy which basically sets criteria by which the threads are distributed. |
Revision as of 15:28, 4 April 2018
Basics
This will give you a general overview of how to compile and execute a program that has been parallelized with OpenMP. As opposed to MPI, you do not have to load any modules to use OpenMP.
How to Compile OpenMP Code
Additional compiler flags tell the compiler to enable OpenMP. Otherwise, the OpenMP pragmas in the code will be ignored by the compiler.
Depending on which compiler you have loaded, use one of the flags below to compile your code.
Compiler | Flag |
GNU | -fopenmp |
Intel | -openmp |
Oracle | -xopenmp |
For example: if you plan to use an Intel compiler for your OpenMP code written in C, you have to type this to create an application called "omp_code.exe":
$ icc -fopenmp omp_code.c -o omp_code.exe
How to Run an OpenMP Application
Setting OMP_NUM_THREADS
If you forget to set OMP_NUM_THREADS to any value, the default value of your cluster environment will be used. In most cases, the default is 1, so that your program is executed serially.
One way to specify the number of threads is by passing an extra argument when running the executable file. In order to start the parallel regions of the example program above with 12 threads, you'd have to type:
$ OMP_NUM_THREADS=12 ./omp_code.exe
This automatically sets the environment variable OMP_NUM_THREADS to 12, but it is reset to its default value after the execution of "omp_code.exe" finished.
Another way to set the number of threads is by changing your environment variable. This example will increment it up to 24 threads and override the default value:
$ export OMP_NUM_THREADS=24
If you simply run your application with $ ./omp_code.exe
next, this value will be used automatically.
Thread Pinning
Threads are "pinned" by setting certain OpenMP-related environment variables. It is an advanced way to control how your system distributes the threads across the available cores.
OMP_PLACES is employed to specify places on the machine where the threads are put. However, this variable on its own does not determine thread pinning completely, because your system still won't know in what pattern to assign the threads to the given places. Therefore, you also need to set OMP_PROC_BIND.
OMP_PROC_BIND specifies a binding policy which basically sets criteria by which the threads are distributed.