Automatically Optimized GPU Acceleration of Element Subroutines in Finite Element Method

Investor logo

Warning

This publication doesn't include Faculty of Economics and Administration. It includes Faculty of Informatics. Official publication website can be found on muni.cz.
Authors

FILIPOVIČ Jiří FOUSEK Jan LAKOMÝ Bedřich MADZIN Matúš

Year of publication 2012
Type Article in Proceedings
Conference Symposium on Application Accelerators in High Performance Computing
MU Faculty or unit

Faculty of Informatics

Citation
Field Informatics
Keywords GPGPU; code optimization; kernel fusion; FEM
Description The element subroutines in finite element method (FEM) provides enough parallelism to be successfully accelerated by contemporary GPUs. However, their efficient implementation is not straightforward and requires time-consuming exploration of numerous implementation variants. In this paper, we present optimization by kernel fusion for element subroutines. Moreover, we show how the optimization is automated using our source-to-source compiler. We demonstrate the optimization of the element subroutines for FEM model using St.\,Venant-Kirchhoff material. The performance of code generated by our compiler outperforms our previously published hand-tuned implementation by factor of 1.32 -- 1.54 depending on used GPU architecture. Although the optimization technique is demonstrated on element subroutines for using St.\,Venant-Kirchhoff material, it is generally usable for wider area of computationally-demanding problems.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.