Imperial College London

ProfessorPaulKelly

Faculty of EngineeringDepartment of Computing

Professor of Software Technology
 
 
 
//

Contact

 

+44 (0)20 7594 8332p.kelly Website

 
 
//

Location

 

Level 3 (upstairs), William Penney Building, room 304William Penney LaboratorySouth Kensington Campus

//

Summary

 

Publications

Citation

BibTex format

@inproceedings{Bisbas:2021:10.1109/IPDPS49936.2021.00058,
author = {Bisbas, G and Luporini, F and Louboutin, M and Nelson, R and Gorman, GJ and Kelly, PHJ},
doi = {10.1109/IPDPS49936.2021.00058},
pages = {497--506},
publisher = {IEEE COMPUTER SOC},
title = {Temporal blocking of finite-difference stencil operators with sparse "off-the-grid" sources},
url = {http://dx.doi.org/10.1109/IPDPS49936.2021.00058},
year = {2021}
}

RIS format (EndNote, RefMan)

TY  - CPAPER
AB - Stencil kernels dominate a range of scientific applications, including seismic and medical imaging, image processing, and neural networks. Temporal blocking is a performance optimization that aims to reduce the required memory bandwidth of stencil computations by re-using data from the cache for multiple time steps. It has already been shown to be beneficial for this class of algorithms. However, applying temporal blocking to practical applications' stencils remains challenging. These computations often consist of sparsely located operators not aligned with the computational grid (“off-the-grid”). Our work is motivated by modelling problems in which source injections result in wavefields that must then be measured at receivers by interpolation from the grided wavefield. The resulting data dependencies make the adoption of temporal blocking much more challenging. We propose a methodology to inspect these data dependencies and reorder the computation, leading to performance gains in stencil codes where temporal blocking has not been applicable. We implement this novel scheme in the Devito domain-specific compiler toolchain. Devito implements a domain-specific language embedded in Python to generate optimized partial differential equation solvers using the finite-difference method from high-level symbolic problem definitions. We evaluate our scheme using isotropic acoustic, anisotropic acoustic, and isotropic elastic wave propagators of industrial significance. After auto-tuning, performance evaluation shows that this enables substantial performance improvement through temporal blocking over highly-optimized vectorized spatially-blocked code of up to 1.6x.
AU - Bisbas,G
AU - Luporini,F
AU - Louboutin,M
AU - Nelson,R
AU - Gorman,GJ
AU - Kelly,PHJ
DO - 10.1109/IPDPS49936.2021.00058
EP - 506
PB - IEEE COMPUTER SOC
PY - 2021///
SN - 1530-2075
SP - 497
TI - Temporal blocking of finite-difference stencil operators with sparse "off-the-grid" sources
UR - http://dx.doi.org/10.1109/IPDPS49936.2021.00058
UR - http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:000695273000050&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=1ba7043ffcc86c417c072aa74d649202
UR - https://ieeexplore.ieee.org/document/9460483
UR - http://hdl.handle.net/10044/1/92275
ER -