Imperial College London

Dr Lluis Vilanova

Faculty of EngineeringDepartment of Computing

Senior Lecturer
 
 
 
//

Contact

 

+44 (0)20 7594 8328vilanova Website

 
 
//

Location

 

556Huxley BuildingSouth Kensington Campus

//

Summary

 

Publications

Citation

BibTex format

@inproceedings{Jordà:2013:10.1109/MCSoC.2013.40,
author = {Jordà, M and Tanasic, I and Cabezas, J and Vilanova, L and Gelado, I and Navarro, N},
doi = {10.1109/MCSoC.2013.40},
pages = {135--140},
title = {Auto-tuning of data communication on heterogeneous systems},
url = {http://dx.doi.org/10.1109/MCSoC.2013.40},
year = {2013}
}

RIS format (EndNote, RefMan)

TY  - CPAPER
AB - Heterogeneous systems formed by traditional CPUs and compute accelerators, such as GPUs, are becoming widely used to build modern supercomputers. However, many different system topologies (i.e., how CPUs, accelerators, and I/O devices are interconnected) are being deployed. Each system organization presents different trade-offs when transferring data between CPUs, accelerators, and nodes within a cluster, requiring different software implementations to achieve optimal data communication bandwidth. In this paper we explore the potential impact of two optimizations to achieve optimal data transfer bandwidth: topology-aware process placement policies, and double-buffering. We design a set of experiments to evaluate all possible alternatives, and run each of them on different hardware configurations. We show that optimal data transfer mechanisms depend on both the hardware topology and the application dataset size. Our experimental evaluation shows that auto-tuning applications to match the hardware topology, and to find the best double-buffering configuration can improve the data transfers bandwidth up to 70% for local communication and is key to achieve optimal bandwidth in remote communication for data transfers larger than 128KB. © 2013 IEEE.
AU - Jordà,M
AU - Tanasic,I
AU - Cabezas,J
AU - Vilanova,L
AU - Gelado,I
AU - Navarro,N
DO - 10.1109/MCSoC.2013.40
EP - 140
PY - 2013///
SP - 135
TI - Auto-tuning of data communication on heterogeneous systems
UR - http://dx.doi.org/10.1109/MCSoC.2013.40
ER -