Imperial College London

DrVahidShahrezaei

Faculty of Natural SciencesDepartment of Mathematics

Reader in Biomathematics
 
 
 
//

Contact

 

+44 (0)20 7594 8516v.shahrezaei Website

 
 
//

Location

 

301BSir Ernst Chain BuildingSouth Kensington Campus

//

Summary

 

Publications

Citation

BibTex format

@article{Tang:2023:bioinformatics/btad395,
author = {Tang, W and Jørgensen, ACS and Marguerat, S and Thomas, P and Shahrezaei, V},
doi = {bioinformatics/btad395},
journal = {Bioinformatics},
pages = {1--9},
title = {Modelling capture efficiency of single-cell RNA-sequencing data improves inference of transcriptome-wide burst kinetics},
url = {http://dx.doi.org/10.1093/bioinformatics/btad395},
volume = {39},
year = {2023}
}

RIS format (EndNote, RefMan)

TY  - JOUR
AB - MOTIVATION: Gene expression is characterised by stochastic bursts of transcription that occur at brief and random periods of promoter activity. The kinetics of gene expression burstiness differs across the genome and is dependent on the promoter sequence, among other factors. Single-cell RNA sequencing (scRNA-seq) has made it possible to quantify the cell-to-cell variability in transcription at a global genome-wide level. However, scRNA-seq data is prone to technical variability, including low and variable capture efficiency of transcripts from individual cells. RESULTS: Here, we propose a novel mathematical theory for the observed variability in scRNA-seq data. Our method captures burst kinetics and variability in both the cell size and capture efficiency, which allows us to propose several likelihood-based and simulation-based methods for the inference of burst kinetics from scRNA-seq data. Using both synthetic and real data, we show that the simulation-based methods provide an accurate, robust and flexible tool for inferring burst kinetics from scRNA-seq data. In particular, in a supervised manner, a simulation-based inference method based on neural networks proves to be accurate and useful when applied to both allele and non-allele-specific scRNA-seq data. AVAILABILITY: The code for Neural Network and Approximate Bayesian Computation inference is available at https://github.com/WT215/nnRNA and https://github.com/WT215/Julia_ABC respectively. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
AU - Tang,W
AU - Jørgensen,ACS
AU - Marguerat,S
AU - Thomas,P
AU - Shahrezaei,V
DO - bioinformatics/btad395
EP - 9
PY - 2023///
SN - 1367-4803
SP - 1
TI - Modelling capture efficiency of single-cell RNA-sequencing data improves inference of transcriptome-wide burst kinetics
T2 - Bioinformatics
UR - http://dx.doi.org/10.1093/bioinformatics/btad395
UR - https://www.ncbi.nlm.nih.gov/pubmed/37354494
UR - https://academic.oup.com/bioinformatics/article/39/7/btad395/7206880
UR - http://hdl.handle.net/10044/1/105251
VL - 39
ER -