Imperial College London

ProfessorPaulKelly

Faculty of EngineeringDepartment of Computing

Professor of Software Technology
 
 
 
//

Contact

 

+44 (0)20 7594 8332p.kelly Website

 
 
//

Location

 

Level 3 (upstairs), William Penney Building, room 304William Penney LaboratorySouth Kensington Campus

//

Summary

 

Publications

Citation

BibTex format

@article{Kelly:2015:10.1109/TPDS.2015.2453972,
author = {Kelly, PHJ and Reguly, IZ and Mudalige, GR and Bertolli, C and Giles, MB and Betts, A and Radford, D},
doi = {10.1109/TPDS.2015.2453972},
journal = {IEEE Transactions on Parallel and Distributed Systems},
pages = {1265--1278},
title = {Acceleration of a Full-scale Industrial CFD Application with OP2},
url = {http://dx.doi.org/10.1109/TPDS.2015.2453972},
volume = {27},
year = {2015}
}

RIS format (EndNote, RefMan)

TY  - JOUR
AB - Hydra is a full-scale industrial CFD application used for the design of turbomachinery at Rolls Royce plc., capable ofperforming complex simulations over highly detailed unstructured mesh geometries. Hydra presents major challenges in dataorganization and movement that need to be overcome for continued high performance on emerging platforms. We present research inachieving this goal through the OP2 domain-specific high-level framework, demonstrating the viability of such a high-level programmingapproach. OP2 targets the domain of unstructured mesh problems and enables execution on a range of back-end hardware platforms.We chart the conversion of Hydra to OP2, and map out the key difficulties encountered in the process. Specifically we show howdifferent parallel implementations can be achieved with an active library framework, even for a highly complicated industrial applicationand how different optimizations targeting contrasting parallel architectures can be applied to the whole application, seamlessly,reducing developer effort and increasing code longevity. Performance results demonstrate that not only the same runtime performanceas that of the hand-tuned original code could be achieved, but it can be significantly improved on conventional processor systems, andmany-core systems. Our results provide evidence of how high-level frameworks such as OP2 enable portability across a wide range ofcontrasting platforms and their significant utility in achieving near-optimal performance without the intervention of the applicationprogrammer.
AU - Kelly,PHJ
AU - Reguly,IZ
AU - Mudalige,GR
AU - Bertolli,C
AU - Giles,MB
AU - Betts,A
AU - Radford,D
DO - 10.1109/TPDS.2015.2453972
EP - 1278
PY - 2015///
SN - 1558-2183
SP - 1265
TI - Acceleration of a Full-scale Industrial CFD Application with OP2
T2 - IEEE Transactions on Parallel and Distributed Systems
UR - http://dx.doi.org/10.1109/TPDS.2015.2453972
UR - http://hdl.handle.net/10044/1/25176
VL - 27
ER -