Logo image
Towards automated kernel fusion for the optimisation of scientific applications
Conference proceeding

Towards automated kernel fusion for the optimisation of scientific applications

Andrew Lamzed-Short, Timothy R Law, Andrew Mallinson, Gihan R Mudalige and Stephen A Jarvis
11/01/2021

Abstract

HPC LLVM kernel-fusion loop-fusion optimisation
In this paper we introduce a novel transformation pass written using LLVM that performs kernel fusion. We demonstrate the correctness and performance of the pass on several example programs inspired by scientific applications of interest. The method achieves up to 4× speedup relative to unfused versions of the programs, and exact performance parity with manually fused versions. In contrast to previous work, it also requires minimal user intervention. Our approach is facilitated by a new loop fusion algorithm capable of interprocedurally fusing both skewed and unskewed loops in different kernels.

Metrics

1 Record Views

Details

Logo image

Usage Policy