Imperial College London

Professor Peter Y. K. Cheung

Faculty of EngineeringDyson School of Design Engineering

Professor of Digital Systems
 
 
 
//

Contact

 

+44 (0)20 7594 6200p.cheung Website

 
 
//

Assistant

 

Mrs Wiesia Hsissen +44 (0)20 7594 6261

 
//

Location

 

910BElectrical EngineeringSouth Kensington Campus

//

Summary

 

Publications

Citation

BibTex format

@inproceedings{Davis:2014:10.1109/FPL.2014.6927447,
author = {Davis, J and Cheung, PYK},
doi = {10.1109/FPL.2014.6927447},
pages = {1--6},
publisher = {IEEE},
title = {Achieving Low-overhead Fault Tolerance for Parallel Accelerators with Dynamic Partial Reconfiguration},
url = {http://dx.doi.org/10.1109/FPL.2014.6927447},
year = {2014}
}

RIS format (EndNote, RefMan)

TY  - CPAPER
AB - While allowing for the fabrication of increasingly complex and efficient circuitry, transistor shrinkage and count-per-device expansion have major downsides: chiefly increased variation, degradation and fault susceptibility. For this reason, design-time consideration of fault tolerance will have to be given to increasing numbers of electronic systems in the future to ensure yields, reliabilities and lifetimes remain acceptably high. Many commonly implemented operators are suited to modification resulting in datapath error detection capabilities with low area overheads. FPGAs are uniquely placed to allow further area savings to be made when incorporating fault avoidance mechanisms thanks to their dynamic reconfigurability. In this paper, we examine the practicalities and costs involved in implementing hardware-software fault tolerance on a test platform: a parallel matrix multiplication accelerator in hardware, with controller in software, running on a Xilinx Zynq system-on-chip. A combination of `bolt-on' error detection logic and software-triggered routing reconfiguration serve to provide low-overhead datapath fault tolerance at runtime. Rapid yet accurate fault diagnoses along with low hardware (area), software (configuration storage) and performance penalties are achieved.
AU - Davis,J
AU - Cheung,PYK
DO - 10.1109/FPL.2014.6927447
EP - 6
PB - IEEE
PY - 2014///
SN - 1946-147X
SP - 1
TI - Achieving Low-overhead Fault Tolerance for Parallel Accelerators with Dynamic Partial Reconfiguration
UR - http://dx.doi.org/10.1109/FPL.2014.6927447
UR - http://ieeexplore.ieee.org/document/6927447/
UR - http://hdl.handle.net/10044/1/23357
ER -