Openmp Application Programming Interface

OpenMP Application Programming Interface Version 4.5 November 2015 Copyright c 1997-2015 OpenMP Architecture Review Board. Permission to copy without fee all or part of this material is granted, provided the OpenMP Architecture Review Board copyright notice and the title of this document appear. Notice is given that copying is by permission of OpenMP Architecture Review Board. This page intentionally left blank in published version. Contents 1 Introduction1 1.1 Scope . .1 1.2 Glossary . .2 1.2.1 Threading Concepts . .2 1.2.2 OpenMP Language Terminology . .2 1.2.3 Loop Terminology . .8 1.2.4 Synchronization Terminology . .9 1.2.5 Tasking Terminology . .9 1.2.6 Data Terminology . 11 1.2.7 Implementation Terminology . 13 1.3 Execution Model . 14 1.4 Memory Model . 17 1.4.1 Structure of the OpenMP Memory Model . 17 1.4.2 Device Data Environments . 18 1.4.3 The Flush Operation . 19 1.4.4 OpenMP Memory Consistency . 20 1.5 OpenMP Compliance . 21 1.6 Normative References . 21 1.7 Organization of this Document . 23 2 Directives 25 2.1 Directive Format . 26 2.1.1 Fixed Source Form Directives . 28 2.1.2 Free Source Form Directives . 29 2.1.3 Stand-Alone Directives . 32 2.2 Conditional Compilation . 33 2.2.1 Fixed Source Form Conditional Compilation Sentinels . 34 i 2.2.2 Free Source Form Conditional Compilation Sentinel . 34 2.3 Internal Control Variables . 36 2.3.1 ICV Descriptions . 36 2.3.2 ICV Initialization . 37 2.3.3 Modifying and Retrieving ICV Values . 39 2.3.4 How ICVs are Scoped . 41 2.3.4.1 How the Per-Data Environment ICVs Work . 42 2.3.5 ICV Override Relationships . 43 2.4 Array Sections . 44 2.5 parallel Construct . 46 2.5.1 Determining the Number of Threads for a parallel Region . 50 2.5.2 Controlling OpenMP Thread Aﬃnity . 52 2.6 Canonical Loop Form . 53 2.7 Worksharing Constructs . 56 2.7.1 Loop Construct . 56 2.7.1.1 Determining the Schedule of a Worksharing Loop . 64 2.7.2 sections Construct . 65 2.7.3 single Construct . 67 2.7.4 workshare Construct . 69 2.8 SIMD Constructs . 72 2.8.1 simd Construct . 72 2.8.2 declare simd Construct . 76 2.8.3 Loop SIMD Construct . 81 2.9 Tasking Constructs . 83 2.9.1 task Construct . 83 2.9.2 taskloop Construct . 87 2.9.3 taskloop simd Construct . 91 2.9.4 taskyield Construct . 93 2.9.5 Task Scheduling . 94 2.10 Device Constructs . 95 2.10.1 target data Construct . 95 2.10.2 target enter data Construct . 97 2.10.3 target exit data Construct . 100 ii OpenMP API – Version 4.5 November 2015 2.10.4 target Construct . 103 2.10.5 target update Construct . 107 2.10.6 declare target Directive . 110 2.10.7 teams Construct . 114 2.10.8 distribute Construct . 117 2.10.9 distribute simd Construct . 119 2.10.10 Distribute Parallel Loop Construct . 121 2.10.11 Distribute Parallel Loop SIMD Construct . 122 2.11 Combined Constructs . 124 2.11.1 Parallel Loop Construct . 124 2.11.2 parallel sections Construct . 125 2.11.3 parallel workshare Construct . 127 2.11.4 Parallel Loop SIMD Construct . 128 2.11.5 target parallel Construct . 129 2.11.6 Target Parallel Loop Construct . 131 2.11.7 Target Parallel Loop SIMD Construct . 132 2.11.8 target simd Construct . 134 2.11.9 target teams Construct . 135 2.11.10 teams distribute Construct . 136 2.11.11 teams distribute simd Construct . 137 2.11.12 target teams distribute Construct . 139 2.11.13 target teams distribute simd Construct . 140 2.11.14 Teams Distribute Parallel Loop Construct . 141 2.11.15 Target Teams Distribute Parallel Loop Construct . 142 2.11.16 Teams Distribute Parallel Loop SIMD Construct . 144 2.11.17 Target Teams Distribute Parallel Loop SIMD Construct . 145 2.12 if Clause . 147 2.13 Master and Synchronization Constructs and Clauses . 148 2.13.1 master Construct . 148 2.13.2 critical Construct . 149 2.13.3 barrier Construct . 151 2.13.4 taskwait Construct . 153 2.13.5 taskgroup Construct . 153 Contents iii 2.13.6 atomic Construct . 155 2.13.7 flush Construct . 162 2.13.8 ordered Construct . 166 2.13.9 depend Clause . 169 2.14 Cancellation Constructs . 172 2.14.1 cancel Construct . 172 2.14.2 cancellation point Construct . 176 2.15 Data Environment . 178 2.15.1 Data-sharing Attribute Rules . 179 2.15.1.1 Data-sharing Attribute Rules for Variables Referenced in a Construct179 2.15.1.2 Data-sharing Attribute Rules for Variables Referenced in a Region but not in a Construct . 183 2.15.2 threadprivate Directive . 183 2.15.3 Data-Sharing Attribute Clauses . 188 2.15.3.1 default Clause . 189 2.15.3.2 shared Clause . 190 2.15.3.3 private Clause . 192 2.15.3.4 firstprivate Clause . 196 2.15.3.5 lastprivate Clause . 199 2.15.3.6 reduction Clause . 201 2.15.3.7 linear Clause . 207 2.15.4 Data Copying Clauses . 211 2.15.4.1 copyin Clause . 211 2.15.4.2 copyprivate Clause . 213 2.15.5 Data-mapping Attribute Rules and Clauses . 215 2.15.5.1 map Clause . ..

Openmp Application Programming Interface

Heterogeneous Task Scheduling for Accelerated Openmp

Threads Chapter 4

Lecture 10: Introduction to Openmp (Part 2)

The Problem with Threads

Parallel Programming

Openmp API 5.1 Specification

Openmp Made Easy with INTEL® ADVISOR

Introduction to Openmp Paul Edmon ITC Research Computing Associate

Modifiable Array Data Structures for Mesh Topology

Parallel Programming with Openmp

CUDA Binary Utilities

Lithe: Enabling Efficient Composition of Parallel Libraries