Imperial College London

Professor Dan Graham

Faculty of EngineeringDepartment of Civil and Environmental Engineering

Professor of Statistical Modelling
 
 
 
//

Contact

 

+44 (0)20 7594 6088d.j.graham Website

 
 
//

Assistant

 

Ms Maya Mistry +44 (0)20 7594 6100

 
//

Location

 

611Skempton BuildingSouth Kensington Campus

//

Summary

 

Publications

Citation

BibTex format

@article{Bansal:2021:10.1016/j.csda.2020.107152,
author = {Bansal, P and Krueger, R and Graham, DJ},
doi = {10.1016/j.csda.2020.107152},
journal = {Computational Statistics & Data Analysis},
pages = {1--19},
title = {Fast Bayesian estimation of spatial count data models},
url = {http://dx.doi.org/10.1016/j.csda.2020.107152},
volume = {157},
year = {2021}
}

RIS format (EndNote, RefMan)

TY  - JOUR
AB - Spatial count data models are used to explain and predict the frequency of phenomena such as traffic accidents in geographically distinct entities such as census tracts or road segments. These models are typically estimated using Bayesian Markov chain Monte Carlo (MCMC) simulation methods, which, however, are computationally expensive and do not scale well to large datasets. Variational Bayes (VB), a method from machine learning, addresses the shortcomings of MCMC by casting Bayesian estimation as an optimisation problem instead of a simulation problem. Considering all these advantages of VB, a VB method is derived for posterior inference in negative binomial models with unobserved parameter heterogeneity and spatial dependence. Pólya-Gamma augmentation is used to deal with the non-conjugacy of the negative binomial likelihood and an integrated non-factorised specification of the variational distribution is adopted to capture posterior dependencies. The benefits of the proposed approach are demonstrated in a Monte Carlo study and an empirical application on estimating youth pedestrian injury counts in census tracts of New York City. The VB approach is around 45 to 50 times faster than MCMC on a regular eight-core processor in a simulation and an empirical study, while offering similar estimation and predictive accuracy. Conditional on the availability of computational resources, the embarrassingly parallel architecture of the proposed VB method can be exploited to further accelerate its estimation by up to 20 times.
AU - Bansal,P
AU - Krueger,R
AU - Graham,DJ
DO - 10.1016/j.csda.2020.107152
EP - 19
PY - 2021///
SN - 0167-9473
SP - 1
TI - Fast Bayesian estimation of spatial count data models
T2 - Computational Statistics & Data Analysis
UR - http://dx.doi.org/10.1016/j.csda.2020.107152
UR - https://www.sciencedirect.com/science/article/pii/S0167947320302437?via%3Dihub
UR - http://hdl.handle.net/10044/1/85180
VL - 157
ER -