Amazon cover image
Image from Amazon.com

OpenMP : 16th International Workshop on OpenMP, IWOMP 2020, Austin, TX, USA, September 22-24, 2020, Proceedings / Kent Milfeld, Bronis R. de Supinski, Lars Koesterke, Jannis Klinkenberg (editors).

By: Contributor(s): Material type: TextTextSeries: Lecture notes in computer science ; 12295. | LNCS sublibrary. SL 2, Programming and software engineering.Publisher: Cham : Springer, 2020Description: 1 online resource (342 pages)Content type:
  • text
Media type:
  • computer
Carrier type:
  • online resource
ISBN:
  • 9783030581442
  • 3030581446
Other title:
  • IWOMP 2020
Subject(s): Genre/Form: Additional physical formats: Print version:: OpenMP: Portable Multi-Level Parallelism on Modern Systems.DDC classification:
  • 005.1 23
LOC classification:
  • QA76.76.A63 I96 2020eb
Online resources:
Contents:
Intro -- Preface -- Organization -- Contents -- Performance Methodologies -- FAROS: A Framework to Analyze OpenMP Compilation Through Benchmarking and Compiler Optimization Analysis -- 1 Introduction -- 2 FAROS: Design and Implementation -- 2.1 Harness and Configuration -- 2.2 Analyzing Compiler Optimization and Performance -- 3 Evaluation -- 4 Related Work -- 5 Conclusion and Future Work -- References -- Evaluating the Efficiency of OpenMP Tasking for Unbalanced Computation on Diverse CPU Architectures -- 1 Introduction -- 2 UTS: The Benchmark and Its Implementation -- 3 Test Problem
4 Experimental Setup -- 5 Results -- 5.1 Comparing Parallel Efficiency -- 5.2 Thread Scalability and Simultaneous Multithreading -- 5.3 Quantifying Load Balancing Operations -- 6 Related Work -- 7 Conclusions -- References -- Applications -- A Case Study of Porting HPGMG from CUDA to OpenMP Target Offload -- 1 Introduction -- 2 Related Work -- 3 The HPGMG Mini Application -- 3.1 HPGMG-CUDA -- 3.2 Porting HPGMG-CUDA to OpenMP Target Offload -- 3.3 Adding Explicit Data Management to HPGMG -- 4 Experimental Methodology -- 4.1 Hardware and Software Environment -- 4.2 Application Configuration
5 Performance Evaluation -- 5.1 Performance When Using Managed Memory -- 5.2 Performance When Using Explicit Data Management -- 6 Discussion -- 6.1 Assessment of OpenMP Abstractions -- 6.2 Assessment of Compiler Maturity and Performance -- 7 Conclusion -- References -- P-Aevol: An OpenMP Parallelizationpg of a Biological Evolution Simulator, Through Decomposition in Multiple Loops -- 1 Introduction -- 2 Aevol, An Irregular Stochastic Program -- 2.1 Aevol: A Simulation of Darwinian Evolution -- 2.2 Computational Workflow of Aevol -- 2.3 Dynamic Characterization of the Computation
3 Parallelization of the Evolutionary Loop -- 3.1 Straightforward Performance with OpenMP Loop Schedulers -- 3.2 Scheduling Iterations Based on Their Data -- 3.3 Predicting the Execution Time -- 3.4 LDNA, A Scheduling Algorithm for Aevol -- 3.5 OpenMP Implementation of LDNA -- 4 Experimental Results -- 4.1 Protocol of Experimentation -- 4.2 Results -- 4.3 Evaluation on Larger Populations -- 5 Related Work -- 6 Conclusion and Future Work -- References -- Evaluating Performance of OpenMP Tasks in a Seismic Stencil Application -- 1 Introduction -- 2 Related Work -- 3 Minimod Description
4 Code Structure and Taskification of Minimod -- 5 Evaluation -- 5.1 Experimental Setup -- 5.2 Results -- 6 Discussion -- 7 Conclusions -- References -- OpenMP Extensions -- Unified Sequential Optimization Directives in OpenMP -- 1 Introduction -- 2 Sequential Optimizations -- 2.1 Aliasing -- 2.2 Inlining -- 2.3 Optimization Control -- 2.4 Side Effects -- 2.5 Alignment -- 3 Directive Interaction -- 4 Case Study: Inlining in RAJA -- 4.1 RAJA -- 4.2 Evaluation -- 5 Conclusion -- References -- Supporting Data Shuffle Between Threads in OpenMP -- 1 Introduction -- 2 Motivation
Holdings
Item type Current library Collection Call number Status Date due Barcode Item holds
eBook eBook e-Library eBook LNCS Available
Total holds: 0

Print version record.

International conference proceedings.

"In 2020, IWOMP continued the series with technical papers and tutorials presented through a virtual conference, due to the COVID-19 pandemic."

Intro -- Preface -- Organization -- Contents -- Performance Methodologies -- FAROS: A Framework to Analyze OpenMP Compilation Through Benchmarking and Compiler Optimization Analysis -- 1 Introduction -- 2 FAROS: Design and Implementation -- 2.1 Harness and Configuration -- 2.2 Analyzing Compiler Optimization and Performance -- 3 Evaluation -- 4 Related Work -- 5 Conclusion and Future Work -- References -- Evaluating the Efficiency of OpenMP Tasking for Unbalanced Computation on Diverse CPU Architectures -- 1 Introduction -- 2 UTS: The Benchmark and Its Implementation -- 3 Test Problem

4 Experimental Setup -- 5 Results -- 5.1 Comparing Parallel Efficiency -- 5.2 Thread Scalability and Simultaneous Multithreading -- 5.3 Quantifying Load Balancing Operations -- 6 Related Work -- 7 Conclusions -- References -- Applications -- A Case Study of Porting HPGMG from CUDA to OpenMP Target Offload -- 1 Introduction -- 2 Related Work -- 3 The HPGMG Mini Application -- 3.1 HPGMG-CUDA -- 3.2 Porting HPGMG-CUDA to OpenMP Target Offload -- 3.3 Adding Explicit Data Management to HPGMG -- 4 Experimental Methodology -- 4.1 Hardware and Software Environment -- 4.2 Application Configuration

5 Performance Evaluation -- 5.1 Performance When Using Managed Memory -- 5.2 Performance When Using Explicit Data Management -- 6 Discussion -- 6.1 Assessment of OpenMP Abstractions -- 6.2 Assessment of Compiler Maturity and Performance -- 7 Conclusion -- References -- P-Aevol: An OpenMP Parallelizationpg of a Biological Evolution Simulator, Through Decomposition in Multiple Loops -- 1 Introduction -- 2 Aevol, An Irregular Stochastic Program -- 2.1 Aevol: A Simulation of Darwinian Evolution -- 2.2 Computational Workflow of Aevol -- 2.3 Dynamic Characterization of the Computation

3 Parallelization of the Evolutionary Loop -- 3.1 Straightforward Performance with OpenMP Loop Schedulers -- 3.2 Scheduling Iterations Based on Their Data -- 3.3 Predicting the Execution Time -- 3.4 LDNA, A Scheduling Algorithm for Aevol -- 3.5 OpenMP Implementation of LDNA -- 4 Experimental Results -- 4.1 Protocol of Experimentation -- 4.2 Results -- 4.3 Evaluation on Larger Populations -- 5 Related Work -- 6 Conclusion and Future Work -- References -- Evaluating Performance of OpenMP Tasks in a Seismic Stencil Application -- 1 Introduction -- 2 Related Work -- 3 Minimod Description

4 Code Structure and Taskification of Minimod -- 5 Evaluation -- 5.1 Experimental Setup -- 5.2 Results -- 6 Discussion -- 7 Conclusions -- References -- OpenMP Extensions -- Unified Sequential Optimization Directives in OpenMP -- 1 Introduction -- 2 Sequential Optimizations -- 2.1 Aliasing -- 2.2 Inlining -- 2.3 Optimization Control -- 2.4 Side Effects -- 2.5 Alignment -- 3 Directive Interaction -- 4 Case Study: Inlining in RAJA -- 4.1 RAJA -- 4.2 Evaluation -- 5 Conclusion -- References -- Supporting Data Shuffle Between Threads in OpenMP -- 1 Introduction -- 2 Motivation

2.1 CUDA shuffle Instruction for NVIDIA GPUs

Powered by Koha