Imperial College London

DrVahidShahrezaei

Faculty of Natural SciencesDepartment of Mathematics

Reader in Biomathematics
 
 
 
//

Contact

 

+44 (0)20 7594 8516v.shahrezaei Website

 
 
//

Location

 

301BSir Ernst Chain BuildingSouth Kensington Campus

//

Summary

 

Publications

Citation

BibTex format

@article{Tang:2020:bioinformatics/btz726,
author = {Tang, W and Bertaux, F and Thomas, P and Stefanelli, C and Saint, M and Marguerat, S and Shahrezaei, V},
doi = {bioinformatics/btz726},
journal = {Bioinformatics},
pages = {1174--1181},
title = {bayNorm: Bayesian gene expression recovery, imputation and normalisation for single cell RNA-sequencing data},
url = {http://dx.doi.org/10.1093/bioinformatics/btz726},
volume = {36},
year = {2020}
}

RIS format (EndNote, RefMan)

TY  - JOUR
AB - Motivation:Normalisation of single cell RNA sequencing (scRNA-seq) data is a prerequisite to theirinterpretation. The marked technical variability, high amounts of missing observations and batch effecttypical of scRNA-seq datasets make this task particularly challenging. There is a need for an efficient andunified approach for normalisation, imputation and batch effect correction.Results:Here, we introduce bayNorm, a novel Bayesian approach for scaling and inference of scRNA-seq counts. The method’s likelihood function follows a binomial model of mRNA capture, while priorsare estimated from expression values across cells using an empirical Bayes approach. We first validateour assumptions by showing this model can reproduce different statistics observed in real scRNA-seqdata. We demonstrate using publicly-available scRNA-seq datasets and simulated expression data thatbayNorm allows robust imputation of missing values generating realistic transcript distributions that matchsingle molecule FISH measurements. Moreover, by using priors informed by dataset structures, bayNormimproves accuracy and sensitivity of differential expression analysis and reduces batch effect comparedto other existing methods. Altogether, bayNorm provides an efficient, integrated solution for global scalingnormalisation, imputation and true count recovery of gene expression measurements from scRNA-seqdata.Availability:The R package “bayNorm” is available at https://github.com/WT215/bayNorm. The code foranalysing data in this paper is available at https://github.com/WT215/bayNorm_papercode.Contact:samuel.marguerat@imperial.ac.uk or v.shahrezaei@imperial.ac.ukSupplementary information:Supplementary data are available atBioinformaticsonline.
AU - Tang,W
AU - Bertaux,F
AU - Thomas,P
AU - Stefanelli,C
AU - Saint,M
AU - Marguerat,S
AU - Shahrezaei,V
DO - bioinformatics/btz726
EP - 1181
PY - 2020///
SN - 1367-4803
SP - 1174
TI - bayNorm: Bayesian gene expression recovery, imputation and normalisation for single cell RNA-sequencing data
T2 - Bioinformatics
UR - http://dx.doi.org/10.1093/bioinformatics/btz726
UR - https://academic.oup.com/bioinformatics/article/36/4/1174/5581401
UR - http://hdl.handle.net/10044/1/73640
VL - 36
ER -