Imperial College London

ProfessorPeterHarrison

Faculty of EngineeringDepartment of Computing

Emeritus Professor in Mathematical Modelling
 
 
 
//

Contact

 

+44 (0)20 7594 8363p.harrison Website

 
 
//

Location

 

353Huxley BuildingSouth Kensington Campus

//

Summary

 

Publications

Citation

BibTex format

@inproceedings{Harrison:2016:10.1109/INFOCOM.2016.7524365,
author = {Harrison, PG and Qiu, Z and Perez, JF},
doi = {10.1109/INFOCOM.2016.7524365},
pages = {1--9},
publisher = {IEEE},
title = {Variability-aware request replication for latency curtailment},
url = {http://dx.doi.org/10.1109/INFOCOM.2016.7524365},
year = {2016}
}

RIS format (EndNote, RefMan)

TY  - CPAPER
AB - Processing time variability is commonplace in distributed systems, where resources display disparate performance due to, e.g., different workload levels, background processes, and contention in virtualized environments. However, it is paramount for service providers to keep variability in response time under control in order to offer responsive services. We investigate how request replication can be used to exploit processing time variability to reduce response times, considering not only mean values but also the tail of the response time distribution. We focus on the distributed setup, where replication is achieved by running copies of requests on multiple servers that otherwise evolve independently, and waiting for the first replica to complete service. We construct models that capture the evolution of a system with replicated requests using approximate methods and observe that highly variable service times offer the best opportunities for replication ¿¿¿ reducing the response time tail in particular. Further, the effect of replication is non-uniform over the response time distribution: gains in one metric, e.g., the mean, can be at the cost of another, e.g., the tail percentiles. This is demonstrated in wide range of numerical virtual experiments. It can be seen that capturing service time variability is key to the evaluation of latency tolerance strategies and in their design.
AU - Harrison,PG
AU - Qiu,Z
AU - Perez,JF
DO - 10.1109/INFOCOM.2016.7524365
EP - 9
PB - IEEE
PY - 2016///
SP - 1
TI - Variability-aware request replication for latency curtailment
UR - http://dx.doi.org/10.1109/INFOCOM.2016.7524365
UR - http://hdl.handle.net/10044/1/29904
ER -