OpenMP : 16th International Workshop on OpenMP, IWOMP 2020, Austin, TX, USA, September 22-24, 2020, Proceedings / Kent Milfeld, Bronis R. de Supinski, Lars Koesterke, Jannis Klinkenberg (editors).
Material type:
TextSeries: Lecture notes in computer science ; 12295. | LNCS sublibrary. SL 2, Programming and software engineering.Publisher: Cham : Springer, 2020Description: 1 online resource (342 pages)Content type: - text
- computer
- online resource
- 9783030581442
- 3030581446
- IWOMP 2020
- OpenMP (Application program interface) -- Congresses
- OpenMP (Application program interface)
- Programming languages (Electronic computers) -- Congresses
- Application program interfaces (Computer software) -- Congresses
- Interfaces de programmation d'applications -- Congrès
- Application program interfaces (Computer software)
- Programming languages (Electronic computers)
- 005.1 23
- QA76.76.A63 I96 2020eb
| Item type | Current library | Collection | Call number | Status | Date due | Barcode | Item holds | |
|---|---|---|---|---|---|---|---|---|
eBook
|
e-Library | eBook LNCS | Available |
Print version record.
International conference proceedings.
"In 2020, IWOMP continued the series with technical papers and tutorials presented through a virtual conference, due to the COVID-19 pandemic."
Intro -- Preface -- Organization -- Contents -- Performance Methodologies -- FAROS: A Framework to Analyze OpenMP Compilation Through Benchmarking and Compiler Optimization Analysis -- 1 Introduction -- 2 FAROS: Design and Implementation -- 2.1 Harness and Configuration -- 2.2 Analyzing Compiler Optimization and Performance -- 3 Evaluation -- 4 Related Work -- 5 Conclusion and Future Work -- References -- Evaluating the Efficiency of OpenMP Tasking for Unbalanced Computation on Diverse CPU Architectures -- 1 Introduction -- 2 UTS: The Benchmark and Its Implementation -- 3 Test Problem
4 Experimental Setup -- 5 Results -- 5.1 Comparing Parallel Efficiency -- 5.2 Thread Scalability and Simultaneous Multithreading -- 5.3 Quantifying Load Balancing Operations -- 6 Related Work -- 7 Conclusions -- References -- Applications -- A Case Study of Porting HPGMG from CUDA to OpenMP Target Offload -- 1 Introduction -- 2 Related Work -- 3 The HPGMG Mini Application -- 3.1 HPGMG-CUDA -- 3.2 Porting HPGMG-CUDA to OpenMP Target Offload -- 3.3 Adding Explicit Data Management to HPGMG -- 4 Experimental Methodology -- 4.1 Hardware and Software Environment -- 4.2 Application Configuration
5 Performance Evaluation -- 5.1 Performance When Using Managed Memory -- 5.2 Performance When Using Explicit Data Management -- 6 Discussion -- 6.1 Assessment of OpenMP Abstractions -- 6.2 Assessment of Compiler Maturity and Performance -- 7 Conclusion -- References -- P-Aevol: An OpenMP Parallelizationpg of a Biological Evolution Simulator, Through Decomposition in Multiple Loops -- 1 Introduction -- 2 Aevol, An Irregular Stochastic Program -- 2.1 Aevol: A Simulation of Darwinian Evolution -- 2.2 Computational Workflow of Aevol -- 2.3 Dynamic Characterization of the Computation
3 Parallelization of the Evolutionary Loop -- 3.1 Straightforward Performance with OpenMP Loop Schedulers -- 3.2 Scheduling Iterations Based on Their Data -- 3.3 Predicting the Execution Time -- 3.4 LDNA, A Scheduling Algorithm for Aevol -- 3.5 OpenMP Implementation of LDNA -- 4 Experimental Results -- 4.1 Protocol of Experimentation -- 4.2 Results -- 4.3 Evaluation on Larger Populations -- 5 Related Work -- 6 Conclusion and Future Work -- References -- Evaluating Performance of OpenMP Tasks in a Seismic Stencil Application -- 1 Introduction -- 2 Related Work -- 3 Minimod Description
4 Code Structure and Taskification of Minimod -- 5 Evaluation -- 5.1 Experimental Setup -- 5.2 Results -- 6 Discussion -- 7 Conclusions -- References -- OpenMP Extensions -- Unified Sequential Optimization Directives in OpenMP -- 1 Introduction -- 2 Sequential Optimizations -- 2.1 Aliasing -- 2.2 Inlining -- 2.3 Optimization Control -- 2.4 Side Effects -- 2.5 Alignment -- 3 Directive Interaction -- 4 Case Study: Inlining in RAJA -- 4.1 RAJA -- 4.2 Evaluation -- 5 Conclusion -- References -- Supporting Data Shuffle Between Threads in OpenMP -- 1 Introduction -- 2 Motivation
2.1 CUDA shuffle Instruction for NVIDIA GPUs