Imperial College London

ProfessorChristos-SavvasBouganis

Faculty of EngineeringDepartment of Electrical and Electronic Engineering

Professor of Intelligent Digital Systems
 
 
 
//

Contact

 

+44 (0)20 7594 6144christos-savvas.bouganis Website

 
 
//

Location

 

904Electrical EngineeringSouth Kensington Campus

//

Summary

 

Publications

Citation

BibTex format

@inproceedings{Rajagopal:2021:10.1109/ICCVW54120.2021.00112,
author = {Rajagopal, A and Bouganis, C-S},
doi = {10.1109/ICCVW54120.2021.00112},
pages = {963--971},
publisher = {IEEE COMPUTER SOC},
title = {perf4sight: a toolflow to model CNN training performance on Edge GPUs},
url = {http://dx.doi.org/10.1109/ICCVW54120.2021.00112},
year = {2021}
}

RIS format (EndNote, RefMan)

TY  - CPAPER
AB - The increased memory and processing capabilities of today’s edge devices create opportunities for greater edge intelligence. In the domain of vision, the ability to adapt a Convolutional Neural Network’s (CNN) structure and parameters to the input data distribution leads to systems with lower memory footprint, latency and power consumption. However, due to the limited compute resources and memory budget on edge devices, it is necessary for the system to be able to predict the latency and memory footprint of the training process in order to identify favourable training configurations of the network topology and device combination for efficient network adaptation. This work proposes perf4sight, an automated methodology for developing accurate models that predict CNN training memory footprint and latency given a target device and network. This enables rapid identification of network topologies that can be retrained on the edge device with low resource consumption. With PyTorch as the framework and NVIDIA Jetson TX2 as the target device, the developed models predict training memory footprint and latency with 95% and 91% accuracy respectively for a wide range of networks, opening the path towards efficient network adaptation on edge GPUs.
AU - Rajagopal,A
AU - Bouganis,C-S
DO - 10.1109/ICCVW54120.2021.00112
EP - 971
PB - IEEE COMPUTER SOC
PY - 2021///
SN - 2473-9936
SP - 963
TI - perf4sight: a toolflow to model CNN training performance on Edge GPUs
UR - http://dx.doi.org/10.1109/ICCVW54120.2021.00112
UR - https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:000739651101005&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=a2bf6146997ec60c407a63945d4e92bb
UR - https://ieeexplore.ieee.org/document/9607636
UR - http://hdl.handle.net/10044/1/110555
ER -