On the Acceleration of Wavefront Applications using Distributed Many-Core Architectures

S. J. Pennycook; S. D. Hammond; G. R. Mudalige; S. A. Wright; S. A. Jarvis

doi:10.1093/comjnl/bxr073

Back

On the Acceleration of Wavefront Applications using Distributed Many-Core Architectures

Journal article

Peer reviewed

On the Acceleration of Wavefront Applications using Distributed Many-Core Architectures

S. J. Pennycook, S. D. Hammond, G. R. Mudalige, S. A. Wright and S. A. Jarvis

Computer journal, Vol.55(2), pp.138-153

01/02/2012

DOI: https://doi.org/10.1093/comjnl/bxr073

Abstract

Computer Science

Computer Science, Hardware & Architecture

Computer Science, Information Systems

Computer Science, Software Engineering

Computer Science, Theory & Methods

Science & Technology

Technology

In this paper we investigate the use of distributed graphics processing unit (GPU)-based architectures to accelerate pipelined wavefront applications-a ubiquitous class of parallel algorithms used for the solution of a number of scientific and engineering applications. Specifically, we employ a recently developed port of the LU solver (from the NAS Parallel Benchmark suite) to investigate the performance of these algorithms on high-performance computing solutions from NVIDIA (Tesla C1060 and C2050) as well as on traditional clusters (AMD/InfiniBand and IBM BlueGene/P). Benchmark results are presented for problem classes A to C and a recently developed performance model is used to provide projections for problem classes D and E, the latter of which represents a billion-cell problem. Our results demonstrate that while the theoretical performance of GPU solutions will far exceed those of many traditional technologies, the sustained application performance is currently comparable for scientific wavefront applications. Finally, a breakdown of the GPU solution is conducted, exposing PCIe overheads and decomposition constraints. A new k-blocking strategy is proposed to improve the future performance of this class of algorithm on GPU-based architectures.

Metrics

1 Record Views

Details

Title: On the Acceleration of Wavefront Applications using Distributed Many-Core Architectures
Creators: S. J. Pennycook - University of Warwick
S. D. Hammond - University of Warwick
G. R. Mudalige - University of Oxford
S. A. Wright - University of Warwick
S. A. Jarvis - University of Warwick
Publication Details: Computer journal, Vol.55(2), pp.138-153
Publisher: Oxford Univ Press
Number of pages: 16
Publication Date: 01/02/2012
Grant note: CDK0660; CDK0724 / UK Atomic Weapons Establishment IF090020 / Royal Society; Royal Society UK KTP006740 / AWE IF090020/AM / Royal Society; Royal Society UK KTP006740 / TSB
Identifiers: 991103785402346; WOS:000300042300002
Academic Unit: President & VC's Office (VC01)
Language: English
Resource Type: Journal article

On the Acceleration of Wavefront Applications using Distributed Many-Core Architectures

Abstract

Metrics

Details

Usage Policy