DrOliverRatmann

Faculty of Natural Sciences, Department of Mathematics

Reader in Statistics and Machine Learning for Public Good

Contact

oliver.ratmann05 Website

Location

525Huxley BuildingSouth Kensington Campus

Summary

Publications

Le Vu S, Ratmann O, Delpech V, Brown AE, Gill ON, Tostevin A, Dunn D, Fraser C, Volz Eet al., 2019, HIV-1 transmission patterns in men who have sex with men: insights from genetic source attribution analysis, AIDS Research and Human Retroviruses, Vol: 39, Pages: 805-813, ISSN: 0889-2229

BACKGROUND: Near 60% of new HIV infections in the United Kingdom are estimated to occur in men who have sex with men (MSM). Age-disassortative partnerships in MSM have been suggested to spread the HIV epidemics in many Western developed countries and to contribute to ethnic disparities in infection rates. Understanding these mixing patterns in transmission can help to determine which groups are at a greater risk and guide public health interventions. METHODS: We analyzed combined epidemiologic data and viral sequences from MSM diagnosed with HIV at the national level. We applied a phylodynamic source attribution model to infer patterns of transmission between groups of patients. RESULTS: From pair probabilities of transmission between 14 603 MSM patients, we found that potential transmitters of HIV subtype B were on average 8 months older than recipients. We also found a moderate overall assortativity of transmission by ethnic group and a stronger assortativity by region. CONCLUSIONS: Our findings suggest that there is only a modest net flow of transmissions from older to young MSM in subtype B epidemics and that young MSM, both for Black or White groups, are more likely to be infected by one another than expected in a sexual network with random mixing.

Journal article

Abeler-Dorner L, Grabowski MK, Rambaut A, Pillay D, Fraser C, Ayles H, Bonsall D, Bowden R, Calvez V, Essex M, Fidler S, Golubchik T, Hayes R, Herbeck JT, Kagaayi J, Kaleebu P, Lingappa JR, Novitsky V, Quinn TC, Ratmann O, Seeley J, Ssemwanga D, Tanser F, Wawer MJet al., 2019, PANGEA-HIV 2: Phylogenetics and networks for generalised epidemics in Africa, Current Opinion in HIV and AIDS, Vol: 14, Pages: 173-180, ISSN: 1746-630X

Purpose of review The HIV epidemic in sub-Saharan Africa is far from being under control and the ambitious UNAIDS targets are unlikely to be met by 2020 as declines in per-capita incidence being largely offset by demographic trends. There is an increasing number of proven and specific HIV prevention tools, but little consensus on how best to deploy them.Recent findings Traditionally, phylogenetics has been used in HIV research to reconstruct the history of the epidemic and date zoonotic infections, whereas more recent publications focus on HIV diversity and drug resistance. However, it is also the most powerful method of source attribution available for the study of HIV transmission. The PANGEA (Phylogenetics And Networks for Generalized Epidemics in Africa) consortium has generated over 18 000 NGS HIV sequences from five countries in sub-Saharan Africa. Using phylogenetic methods, we will identify characteristics of individuals or groups, which are most likely to be at risk of infection or at risk of infecting others.Summary Combining phylogenetics, phylodynamics and epidemiology will allow PANGEA to highlight where prevention efforts should be focussed to reduce the HIV epidemic most effectively. To maximise the public health benefit of the data, PANGEA offers accreditation to external researchers, allowing them to access the data and join the consortium. We also welcome submissions of other HIV sequences from sub-Saharan Africa to the database.

Journal article

Ratmann O, Grabowski MK, Hall M, Golubchik T, Wymant C, Abeler-Dörner L, Bonsall D, Hoppe A, Brown AL, de Oliveira T, Gall A, Kellam P, Pillay D, Kagaayi J, Kigozi G, Quinn TC, Wawer MJ, Laeyendecker O, Serwadda D, Gray RH, Fraser Cet al., 2019, Inferring HIV-1 transmission networks and sources of epidemic spread in Africa with deep-sequence phylogenetic analysis, Nature Communications, Vol: 10, ISSN: 2041-1723

To prevent new infections with human immunodeficiency virus type 1 (HIV-1) in sub-Saharan Africa, UNAIDS recommends targeting interventions to populations that are at high risk of acquiring and passing on the virus. Yet it is often unclear who and where these ‘source’ populations are. Here we demonstrate how viral deep-sequencing can be used to reconstruct HIV-1 transmission networks and to infer the direction of transmission in these networks. We are able to deep-sequence virus from a large population-based sample of infected individuals in Rakai District, Uganda, reconstruct partial transmission networks, and infer the direction of transmission within them at an estimated error rate of 16.3% [8.8–28.3%]. With this error rate, deep-sequence phylogenetics cannot be used against individuals in legal contexts, but is sufficiently low for population-level inferences into the sources of epidemic spread. The technique presents new opportunities for characterizing source populations and for targeting of HIV-1 prevention interventions in Africa.

Journal article

Metzig C, Ratmann O, Bezemer D, Colijn Cet al., 2019, Phylogenies from dynamic networks, PLoS Computational Biology, Vol: 15, ISSN: 1553-734X

The relationship between the underlying contact network over which a pathogen spreads and the pathogen phylogenetic trees that are obtained presents an opportunity to use sequence data to learn about contact networks that are difficult to study empirically. However, this relationship is not explicitly known and is usually studied in simulations, often with the simplifying assumption that the contact network is static in time, though human contact networks are dynamic. We simulate pathogen phylogenetic trees on dynamic Erdős-Renyi random networks and on two dynamic networks with skewed degree distribution, of which one is additionally clustered. We use tree shape features to explore how adding dynamics changes the relationships between the overall network structure and phylogenies. Our tree features include the number of small substructures (cherries, pitchforks) in the trees, measures of tree imbalance (Sackin index, Colless index), features derived from network science (diameter, closeness), as well as features using the internal branch lengths from the tip to the root. Using principal component analysis we find that the network dynamics influence the shapes of phylogenies, as does the network type. We also compare dynamic and time-integrated static networks. We find, in particular, that static network models like the widely used Barabasi-Albert model can be poor approximations for dynamic networks. We explore the effects of mis-specifying the network on the performance of classifiers trained identify the transmission rate (using supervised learning methods). We find that both mis-specification of the underlying network and its parameters (mean degree, turnover rate) have a strong adverse effect on the ability to estimate the transmission parameter. We illustrate these results by classifying HIV trees with a classifier that we trained on simulated trees from different networks, infection rates and turnover rates. Our results point to the importance of correctly est

Journal article

Ratmann O, Camacho A, Hu S, Colijn Cet al., 2019, Informed Choices: How to Calibrate ABC with Hypothesis Testing, HANDBOOK OF APPROXIMATE BAYESIAN COMPUTATION, Editors: Sisson, Fan, Beaumont, Publisher: CRC PRESS-TAYLOR & FRANCIS GROUP, Pages: 289-319, ISBN: 978-1-4398-8150-7

Book chapter

Lachlan RF, Ratmann O, Nowicki S, 2018, Cultural conformity generates extremely stable traditions in bird song, Nature Communications, Vol: 9, ISSN: 2041-1723

Cultural traditions have been observed in a wide variety of animal species. It remains unclear, however, what is required for social learning to give rise to stable traditions: what level of precision and what learning strategies are required. We address these questions by fitting models of cultural evolution to learned bird song. We recorded 615 swamp sparrow (Melospiza georgiana) song repertoires, and compared syllable frequency distributions to the output of individual-based simulations. We find that syllables are learned with an estimated error rate of 1.85% and with a conformist bias in learning. This bias is consistent with a simple mechanism of overproduction and selective attrition. Finally, we estimate that syllable types could frequently persist for more than 500 years. Our results demonstrate conformist bias in natural animal behaviour and show that this, along with moderately precise learning, may support traditions whose stability rivals those of humans.

Journal article

Le Vu SOK, Ratmann O, Delpech V, Brown AE, Gill ON, Tostevin A, Fraser C, Volz EMet al., 2018, Comparison of cluster-based and source-attribution methods for estimating transmission risk using large HIV sequence databases, Epidemics, Vol: 23, Pages: 1-10, ISSN: 1755-4365

Phylogenetic clustering of HIV sequences from a random sample of patients can reveal epidemiological transmission patterns, but interpretation is hampered by limited theoretical support and statistical properties of clustering analysis remain poorly understood. Alternatively, source attribution methods allow fitting of HIV transmission models and thereby quantify aspects of disease transmission.A simulation study was conducted to assess error rates of clustering methods for detecting transmission risk factors. We modeled HIV epidemics among men having sex with men and generated phylogenies comparable to those that can be obtained from HIV surveillance data in the UK. Clustering and source attribution approaches were applied to evaluate their ability to identify patient attributes as transmission risk factors.We find that commonly used methods show a misleading association between cluster size or odds of clustering and covariates that are correlated with time since infection, regardless of their influence on transmission. Clustering methods usually have higher error rates and lower sensitivity than source attribution method for identifying transmission risk factors. But neither methods provide robust estimates of transmission risk ratios. Source attribution method can alleviate drawbacks from phylogenetic clustering but formal population genetic modeling may be required to estimate quantitative transmission risk factors.

Journal article

Volz E, Le Vu S, Ratmann O, Tostevin A, Orkin C, O'Shea S, Delpech V, Brown A, Fraser NGCet al., 2018, Molecular epidemiology of HIV-1 subtype B reveals heterogeneous transmission risk: Implications for intervention and control, Journal of Infectious Diseases, Vol: 217, Pages: 1522-1529, ISSN: 0022-1899

BackgroundThe impact of HIV pre-exposure prophylaxis (PrEP) depends on infections averted by protecting vulnerable individuals as well as infections averted by preventing transmission by those who would have been infected if not receiving PrEP. Analysis of HIV phylogenies reveals risk factors for transmission, which we examine as potential criteria for allocating PrEP.MethodsWe analyzed 6912 HIV-1 partial pol sequences from men who have sex with men (MSM) in the United Kingdom combined with global reference sequences and patient-level metadata. Population genetic models were developed that adjust for stage of infection, global migration of HIV lineages, and changing incidence of infection through time. Models were extended to simulate the effects of providing susceptible MSM with PrEP.ResultsWe found that young age <25 years confers higher risk of HIV transmission (relative risk = 2.52 [95% confidence interval, 2.32–2.73]) and that young MSM are more likely to transmit to one another than expected by chance. Simulated interventions indicate that 4-fold more infections can be averted over 5 years by focusing PrEP on young MSM.ConclusionsConcentrating PrEP doses on young individuals can avert more infections than random allocation.

Journal article

Wymant C, Hall M, Ratmann O, Bonsall D, Golubchik T, de Cesare M, Gall A, Cornelissen M, Fraser C, STOP-HCV Consortium, The Maela Pneumococcal Collaboration, and The BEEHIVE Collaborationet al., 2018, PHYLOSCANNER: Inferring Transmission from Within- and Between-Host Pathogen Genetic Diversity., Mol Biol Evol, Vol: 35, Pages: 719-733

A central feature of pathogen genomics is that different infectious particles (virions and bacterial cells) within an infected individual may be genetically distinct, with patterns of relatedness among infectious particles being the result of both within-host evolution and transmission from one host to the next. Here, we present a new software tool, phyloscanner, which analyses pathogen diversity from multiple infected hosts. phyloscanner provides unprecedented resolution into the transmission process, allowing inference of the direction of transmission from sequence data alone. Multiply infected individuals are also identified, as they harbor subpopulations of infectious particles that are not connected by within-host evolution, except where recombinant types emerge. Low-level contamination is flagged and removed. We illustrate phyloscanner on both viral and bacterial pathogens, namely HIV-1 sequenced on Illumina and Roche 454 platforms, HCV sequenced with the Oxford Nanopore MinION platform, and Streptococcus pneumoniae with sequences from multiple colonies per individual. phyloscanner is available from https://github.com/BDI-pathogens/phyloscanner.

Journal article

Wymant C, Blanquart F, Golubchik T, Gall A, Bakker M, Bezemer D, Croucher NJ, Hall M, Hillebregt M, Ong SH, Ratmann O, Albert J, Bannert N, Fellay J, Fransen K, Gourlay A, Grabowski MK, Gunsenheimer-Bartmeyer B, Günthard HF, Kivelä P, Kouyos R, Laeyendecker O, Liitsola K, Meyer L, Porter K, Ristola M, van Sighem A, Berkhout B, Cornelissen M, Kellam P, Reiss P, Fraser C, BEEHIVE Collaborationet al., 2018, Easy and accurate reconstruction of whole HIV genomes from short-read sequence data with shiver, Virus Evolution, Vol: 4, ISSN: 2057-1577

Studying the evolution of viruses and their molecular epidemiology relies on accurate viral sequence data, so that small differences between similar viruses can be meaningfully interpreted. Despite its higher throughput and more detailed minority variant data, next-generation sequencing has yet to be widely adopted for HIV. The difficulty of accurately reconstructing the consensus sequence of a quasispecies from reads (short fragments of DNA) in the presence of large between- and within-host diversity, including frequent indels, may have presented a barrier. In particular, mapping (aligning) reads to a reference sequence leads to biased loss of information; this bias can distort epidemiological and evolutionary conclusions. De novo assembly avoids this bias by aligning the reads to themselves, producing a set of sequences called contigs. However contigs provide only a partial summary of the reads, misassembly may result in their having an incorrect structure, and no information is available at parts of the genome where contigs could not be assembled. To address these problems we developed the tool shiver to pre-process reads for quality and contamination, then map them to a reference tailored to the sample using corrected contigs supplemented with the user's choice of existing reference sequences. Run with two commands per sample, it can easily be used for large heterogeneous data sets. We used shiver to reconstruct the consensus sequence and minority variant information from paired-end short-read whole-genome data produced with the Illumina platform, for sixty-five existing publicly available samples and fifty new samples. We show the systematic superiority of mapping to shiver's constructed reference compared with mapping the same reads to the closest of 3,249 real references: median values of 13 bases called differently and more accurately, 0 bases called differently and less accurately, and 205 bases of missing sequence recovered. We also successfully applied sh

Journal article

Ratmann O, Ha Minh Lam, Boni MF, 2017, Improved algorithmic complexity for the 3SEQ recombination detection algorithm, Molecular Biology and Evolution, Vol: 35, Pages: 247-251, ISSN: 1537-1719

Identifying recombinant sequences in an era of large genomic databases is challenging as it requires an efficient algorithm to identify candidate recombinants and parents, as well as appropriate statistical methods to correct for the large number of comparisons performed. In 2007, a computation was introduced for an exact nonparametric mosaicism statistic that gave high-precision p-values for putative recombinants. This exact computation meant that multiple-comparisons corrected p-values also had high precision, which is crucial when performing millions or billions of tests in large databases. Here, we introduce an improvement to the algorithmic complexity of this computation from O(mn3) to O(mn2), where m and n are the numbers of recombination-informative sites in the candidate recombinant. This new computation allows for recombination analysis to be performed in alignments with thousands of polymorphic sites. Benchmark runs are presented on viral genome sequence alignments, new features are introduced, and applications outside recombination analysis are discussed.

Journal article

Wymant C, Hall M, Ratmann O, Bonsall D, Golubchik T, de Cesare M, Gall A, Cornelissen M, Fraser Cet al., 2017, PHYLOSCANNER: Inferring Transmission from Within‐ and Between-Host Pathogen Genetic Diversity

<jats:title>Abstract</jats:title><jats:p>A central feature of pathogen genomics is that different infectious particles (virions, bacterial cells, etc.) within an infected individual may be genetically distinct, with patterns of relatedness amongst infectious particles being the result of both within-host evolution and transmission from one host to the next. Here we present a new software tool, phyloscanner, which analyses pathogen diversity from multiple infected hosts. phyloscanner provides unprecedented resolution into the transmission process, allowing inference of the direction of transmission from sequence data alone. Multiply infected individuals are also identified, as they harbour subpopulations of infectious particles that are not connected by within-host evolution, except where recombinant types emerge. Low-level contamination is flagged and removed. We illustrate phyloscanner on both viral and bacterial pathogens, namely HIV-1 sequenced on Illumina and Roche 454 platforms, HCV sequenced with the Oxford Nanopore MinION platform, and <jats:italic>Streptococcus pneumoniae</jats:italic> with sequences from multiple colonies per individual. phyloscanner is available from <jats:underline><jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://github.com/BDI-pathogens/phyloscanner">https://github.com/BDI-pathogens/phyloscanner</jats:ext-link></jats:underline>.</jats:p>

Journal article

Ratmann O, Wymant C, Colijn C, Danaviah S, Essex M, Frost S, Gall A, von Haeseler A, Kaleebu P, Kendall M, Kozlov A, Manasa J, Quang Minh B, Moyo S, Novitsky V, Nsubuga R, Pillay S, Quinn TC, Serwadda D, Ssemwanga D, Stamatakis A, Trininopoulos J, Wawer M, Leigh Brown A, de Oliveira T, Kellam P, Pillay D, Fraser Cet al., 2017, HIV-1 full-genome phylogenetics of generalized epidemics in sub-Saharan Africa: impact of missing nucleotide characters in next-generation sequences, Aids Research and Human Retroviruses, Vol: 33, Pages: 1083-1098, ISSN: 1931-8405

To characterize HIV-1 transmission dynamics in regions where the burden of HIV-1 is greatest, the “Phylogenetics and Networks for Generalised HIV Epidemics in Africa” consortium (PANGEA-HIV) is sequencing full-genome viral isolates from across sub-Saharan Africa. We report the first 3,985 PANGEA-HIV consensus sequences from four cohort sites (Rakai Community Cohort Study, n = 2,833; MRC/UVRI Uganda, n = 701; Mochudi Prevention Project, n = 359; Africa Health Research Institute Resistance Cohort, n = 92). Next-generation sequencing success rates varied: more than 80% of the viral genome from the gag to the nef genes could be determined for all sequences from South Africa, 75% of sequences from Mochudi, 60% of sequences from MRC/UVRI Uganda, and 22% of sequences from Rakai. Partial sequencing failure was primarily associated with low viral load, increased for amplicons closer to the 3′ end of the genome, was not associated with subtype diversity except HIV-1 subtype D, and remained significantly associated with sampling location after controlling for other factors. We assessed the impact of the missing data patterns in PANGEA-HIV sequences on phylogeny reconstruction in simulations. We found a threshold in terms of taxon sampling below which the patchy distribution of missing characters in next-generation sequences (NGS) has an excess negative impact on the accuracy of HIV-1 phylogeny reconstruction, which is attributable to tree reconstruction artifacts that accumulate when branches in viral trees are long. The large number of PANGEA-HIV sequences provides unprecedented opportunities for evaluating HIV-1 transmission dynamics across sub-Saharan Africa and identifying prevention opportunities. Molecular epidemiological analyses of these data must proceed cautiously because sequence sampling remains below the identified threshold and a considerable negative impact of missing characters on phyloge

Journal article

Ratmann O, Hodcroft EB, Pickles M, Cori A, Hall M, Lycett S, Colijn C, Dearlove B, Didelot X, Frost S, Hossain M, Joy JB, Kendall M, Kühnert D, Leventhal GE, Liang R, Plazzotta G, Poon A, Rasmussen DA, Stadler T, Volz E, Weis C, Leigh Brown AJ, Fraser Cet al., 2017, Phylogenetic tools for generalized HIV-1 epidemics: findings from the PANGEA-HIV methods comparison, Molecular Biology and Evolution, Vol: 34, Pages: 185-203, ISSN: 1537-1719

Viral phylogenetic methods contribute to understanding how HIV spreads in populations, and thereby help guide the design of prevention interventions. So far, most analyses have been applied to well-sampled concentrated HIV-1 epidemics in wealthy countries. To direct the use of phylogenetic tools to where the impact of HIV-1 is greatest, the Phylogenetics And Networks for Generalized HIV Epidemics in Africa (PANGEA-HIV) consortium generates full-genome viral sequences from across sub-Saharan Africa. Analyzing these data presents new challenges, since epidemics are principally driven by heterosexual transmission and a smaller fraction of cases is sampled. Here, we show that viral phylogenetic tools can be adapted and used to estimate epidemiological quantities of central importance to HIV-1 prevention in sub-Saharan Africa. We used a community-wide methods comparison exercise on simulated data, where participants were blinded to the true dynamics they were inferring. Two distinct simulations captured generalized HIV-1 epidemics, before and after a large community-level intervention that reduced infection levels. Five research groups participated. Structured coalescent modeling approaches were most successful: phylogenetic estimates of HIV-1 incidence, incidence reductions, and the proportion of transmissions from individuals in their first 3 months of infection correlated with the true values (Pearson correlation > 90%), with small bias. However, on some simulations, true values were markedly outside reported confidence or credibility intervals. The blinded comparison revealed current limits and strengths in using HIV phylogenetics in challenging settings, provided benchmarks for future methods’ development, and supports using the latest generation of phylogenetic tools to advance HIV surveillance and prevention.

Journal article

Yebra G, Hodcroft EB, Ragonnet-Cronin ML, Pillay D, Brown AJL, PANGEAHIV Consortium, ICONIC Projectet al., 2016, Using nearly full-genome HIV sequence data improves phylogeny reconstruction in a simulated epidemic., Scientific Reports, Vol: 6, ISSN: 2045-2322

HIV molecular epidemiology studies analyse viral pol gene sequences due to their availability, but whole genome sequencing allows to use other genes. We aimed to determine what gene(s) provide(s) the best approximation to the real phylogeny by analysing a simulated epidemic (created as part of the PANGEA_HIV project) with a known transmission tree. We sub-sampled a simulated dataset of 4662 sequences into different combinations of genes (gag-pol-env, gag-pol, gag, pol, env and partial pol) and sampling depths (100%, 60%, 20% and 5%), generating 100 replicates for each case. We built maximum-likelihood trees for each combination using RAxML (GTR + Γ), and compared their topologies to the corresponding true tree's using CompareTree. The accuracy of the trees was significantly proportional to the length of the sequences used, with the gag-pol-env datasets showing the best performance and gag and partial pol sequences showing the worst. The lowest sampling depths (20% and 5%) greatly reduced the accuracy of tree reconstruction and showed high variability among replicates, especially when using the shortest gene datasets. In conclusion, using longer sequences derived from nearly whole genomes will improve the reliability of phylogenetic reconstruction. With low sample coverage, results can be highly variable, particularly when based on short sequences.

Journal article

Lamers SL, Barbier A, Ratmann O, Fraser C, Rose R, Laeyendecker O, Grabowski Met al., 2016, HIV-1 Sequence Data Coverage in Central East Africa from 1959-2013, AIDS Research and Human Retroviruses, ISSN: 0889-2229

Central and Eastern African HIV sequence data has been most critical in understanding the establishment and evolution of the global HIV pandemic. Here we report on the extent of publically available HIV genetic sequence data in the Los Alamos National Laboratory Sequence Database sampled from 1959-2013 from six African countries: Uganda, Kenya, Tanzania, Burundi, the Democratic Republic of Congo, and Rwanda. We have summarized these data, including HIV subtypes, the years sampled, and the genomic regions sequenced. We also provide curated alignments for this important geographic area in five HIV genomic regions with substantial coverage.

Abstract
Cite

Journal article

Wilkinson E, Rasmussen D, Ratmann O, Stadler T, Engelbrecht S, de Oliveira Tet al., 2016, Origin, imports and exports of HIV-1 subtype C in South Africa: a historical perspective, Infection, Genetics and Evolution, Vol: 46, Pages: 200-208, ISSN: 1567-7257

BACKGROUND: While the HIV epidemic in South Africa had a later onset than epidemics in other southern African countries, prevalence grew rapidly during the 1990's when the country was going through socio-political changes with the end of Apartheid. South Africa currently has the largest number of people living with HIV in the world and the epidemic is dominated by a unique subtype, HIV-1 subtype C. This large epidemic is also characterized by high level of genetic diversity. We hypothesize that this diversity is due to multiple introductions of the virus during the period of change. In this paper, we apply novel phylogeographic methods to estimate the number of viral imports and exportsfrom the start of the epidemic to the present. METHODS: We assembled 11,289 unique subtype C pol sequences from southern Africa. These represent one of the largest sequence datasets ever analyzed in the region. Sequences were stratified based on country of sampling and levels of genetic diversity were estimated for each country. Sequences were aligned and a maximum-likelihood evolutionary tree was inferred. Least-Squares Dating was then used to obtain a dated phylogeny from which we estimated the number of introductions into and exports out of South Africa using parsimony-based ancestral location reconstructions. RESULTS: Our results identified 189 viral introductions into South Africa with the largest number of introductions attributed to Zambia (n=109), Botswana (n=32), Malawi (n=26) and Zimbabwe (n=13). South Africa also exported many viral lineages to its neighbours. The bulk viral imports and exports appear to have occurred between 1985 and 2000, coincident with the period of socio-political transition. CONCLUSION: The high level of subtype C genetic diversity in South Africa is related to multiple introductions of the virus to the country. While the number of viral imports and exports we identified was highly sensitive to the number of samples included from each country, they mo

Journal article

Nakagawa F, van Sighem A, Thiebaut R, Smith C, Ratmann O, Cambiano V, Albert J, Amato-Gauci A, Bezemer D, Campbell C, Commenges D, Donoghoe M, Ford D, Kouyos R, Lodwick R, Lundgren J, Pantazis N, Pharris A, Quinten C, Thorne C, Touloumi G, Delpech V, Philips Aet al., 2016, A method to estimate the size and characteristics of HIV-positive populations using an individual-based stochastic simulation model, Epidemiology, Vol: 27, Pages: 247-256, ISSN: 1531-5487

It is important not only to collect epidemiologic data onHIV but to also fully utilize such information to understand the epidemicover time and to help inform and monitor the impact of policiesand interventions. We describe and apply a novel method to estimatethe size and characteristics of HIV-positive populations. The methodwas applied to data on men who have sex with men living in the UKand to a pseudo dataset to assess performance for different data availability.The individual-based simulation model was calibrated using an approximate Bayesian computation-based approach. In 2013,48,310 (90% plausibility range: 39,900–45,560) men who have sexwith men were estimated to be living with HIV in the UK, of whom10,400 (6,160–17,350) were undiagnosed. There were an estimated3,210 (1,730–5,350) infections per year on average between 2010and 2013. Sixty-two percent of the total HIV-positive population arethought to have viral load <500 copies/ml. In the pseudo-epidemicexample, HIV estimates have narrower plausibility ranges and arecloser to the true number, the greater the data availability to calibratethe model. We demonstrate that our method can be applied to settingswith less data, however plausibility ranges for estimates will be widerto reflect greater uncertainty of the data used to fit the model.

Journal article

Ratmann O, van Sighem A, Bezemer D, Gavryushkina A, Jurriaans S, Wensing A, de Wolf F, Reiss P, Fraser Cet al., 2016, Sources of HIV infection among men having sex with men and implications for prevention, Science Translational Medicine, Vol: 8, ISSN: 1946-6242

Journal article

Bezemer D, Cori A, Ratmann O, van Sighem A, Hermanides HS, Dutilh BE, Gras L, Rodrigues Faria N, van den Hengel R, Duits AJ, Reiss P, de Wolf F, Fraser C, ATHENA observational cohortet al., 2015, Dispersion of the HIV-1 Epidemic in Men Who Have Sex with Men in the Netherlands: A Combined Mathematical Model and Phylogenetic Analysis., PLOS Medicine, Vol: 12, Pages: e1001898-e1001898, ISSN: 1549-1277

BACKGROUND: The HIV-1 subtype B epidemic amongst men who have sex with men (MSM) is resurgent in many countries despite the widespread use of effective combination antiretroviral therapy (cART). In this combined mathematical and phylogenetic study of observational data, we aimed to find out the extent to which the resurgent epidemic is the result of newly introduced strains or of growth of already circulating strains. METHODS AND FINDINGS: As of November 2011, the ATHENA observational HIV cohort of all patients in care in the Netherlands since 1996 included HIV-1 subtype B polymerase sequences from 5,852 patients. Patients who were diagnosed between 1981 and 1995 were included in the cohort if they were still alive in 1996. The ten most similar sequences to each ATHENA sequence were selected from the Los Alamos HIV Sequence Database, and a phylogenetic tree was created of a total of 8,320 sequences. Large transmission clusters that included ≥10 ATHENA sequences were selected, with a local support value ≥ 0.9 and median pairwise patristic distance below the fifth percentile of distances in the whole tree. Time-varying reproduction numbers of the large MSM-majority clusters were estimated through mathematical modeling. We identified 106 large transmission clusters, including 3,061 (52%) ATHENA and 652 Los Alamos sequences. Half of the HIV sequences from MSM registered in the cohort in the Netherlands (2,128 of 4,288) were included in 91 large MSM-majority clusters. Strikingly, at least 54 (59%) of these 91 MSM-majority clusters were already circulating before 1996, when cART was introduced, and have persisted to the present. Overall, 1,226 (35%) of the 3,460 diagnoses among MSM since 1996 were found in these 54 long-standing clusters. The reproduction numbers of all large MSM-majority clusters were around the epidemic threshold value of one over the whole study period. A tendency towards higher numbers was visible in recent years, especially in the more recently

Journal article

Pillay D, Herbeck J, Cohen MS, de Oliveira T, Fraser C, Ratmann O, Brown AL, Kellam Pet al., 2015, PANGEA-HIV: phylogenetics for generalised epidemics in Africa, LANCET INFECTIOUS DISEASES, Vol: 15, Pages: 259-261, ISSN: 1473-3099

Journal article

Jombart T, Aanensen DM, Baguelin M, Birrell P, Cauchemez S, Camacho A, Colijn C, Collins C, Cori A, Didelot X, Fraser C, Frost S, Hens N, Hugues J, Hohle M, Opatowski L, Rambautm A, Ratmann O, Soubeyrand S, Suchard MA, Wallinga J, Ypma R, Ferguso Net al., 2014, OutbreakTools: A new platform for disease outbreak analysis using the R software, Epidemics, Vol: 7, Pages: 28-34, ISSN: 1755-4365

The investigation of infectious disease outbreaks relies on the analysis of increasingly complex and diverse data, which offer new prospects for gaining insights into disease transmission processes and informing public health policies. However, the potential of such data can only be harnessed using a number of different, complementary approaches and tools, and a unified platform for the analysis of disease outbreaks is still lacking. In this paper, we present the new R package OutbreakTools, which aims to provide a basis for outbreak data management and analysis in R. OutbreakTools is developed by a community of epidemiologists, statisticians, modellers and bioinformaticians, and implements classes and methods for storing, handling and visualizing outbreak data. It includes real and simulated outbreak datasets. Together with a number of tools for infectious disease epidemiology recently made available in R, OutbreakTools contributes to the emergence of a new, free and open-source platform for the analysis of disease outbreaks.

Journal article

Ratmann O, Donker G, Meijer A, Fraser C, Koelle Ket al., 2012, Phylodynamic Inference and Model Assessment with Approximate Bayesian Computation: Influenza as a Case Study, PLoS Computational Biology, Vol: 8, ISSN: 1553-7358

A key priority in infectious disease research is to understand the ecological and evolutionary drivers of viral diseases from data on disease incidence as well as viral genetic and antigenic variation. We propose using a simulation-based, Bayesian method known as Approximate Bayesian Computation (ABC) to fit and assess phylodynamic models that simulate pathogen evolution and ecology against summaries of these data. We illustrate the versatility of the method by analyzing two spatial models describing the phylodynamics of interpandemic human influenza virus subtype A(H3N2). The first model captures antigenic drift phenomenologically with continuously waning immunity, and the second epochal evolution model describes the replacement of major, relatively long-lived antigenic clusters. Combining features of long-term surveillance data from the Netherlands with features of influenza A (H3N2) hemagglutinin gene sequences sampled in northern Europe, key phylodynamic parameters can be estimated with ABC. Goodness-of-fit analyses reveal that the irregularity in interannual incidence and H3N2's ladder-like hemagglutinin phylogeny are quantitatively only reproduced under the epochal evolution model within a spatial context. However, the concomitant incidence dynamics result in a very large reproductive number and are not consistent with empirical estimates of H3N2's population level attack rate. These results demonstrate that the interactions between the evolutionary and ecological processes impose multiple quantitative constraints on the phylodynamic trajectories of influenza A(H3N2), so that sequence and surveillance data can be used synergistically. ABC, one of several data synthesis approaches, can easily interface a broad class of phylodynamic models with various types of data but requires careful calibration of the summaries and tolerance parameters.

Journal article

Arinaminpathy N, Ratmann O, Koelle K, Epstein SL, Price GE, Viboud C, Miller MA, Grenfell BTet al., 2012, Impact of cross-protective vaccines on epidemiological and evolutionary dynamics of influenza, Proceedings of the National Academy of Sciences of USA, Vol: 109, Pages: 3173-3177, ISSN: 0027-8424

Large-scale immunization has profoundly impacted control of many infectious diseases such as measles and smallpox because of the ability of vaccination campaigns to maintain long-term herd immunity and, hence, indirect protection of the unvaccinated. In the case of human influenza, such potential benefits of mass vaccination have so far proved elusive. The central difficulty is a considerable viral capacity for immune escape; new pandemic variants, as well as viral escape mutants in seasonal influenza, compromise the buildup of herd immunity from natural infection or deployment of current vaccines. Consequently, most current influenza vaccination programs focus mainly on protection of specific risk groups, rather than mass prophylactic protection. Here, we use epidemiological models to show that emerging vaccine technologies, aimed at broad-spectrum protection, could qualitatively alter this picture. We demonstrate that sustained immunization with such vaccines could—through potentially lowering transmission rates and improving herd immunity—significantly moderate both influenza pandemic and seasonal epidemics. More subtly, phylodynamic models indicate that widespread cross-protective immunization could slow the antigenic evolution of seasonal influenza; these effects have profound implications for a transition to mass vaccination strategies against human influenza, and for the management of antigenically variable viruses in general.

Journal article

Koelle K, Ratmann O, Rasmussen DA, Pasour V, Mattingly Jet al., 2011, A dimensionless number for understanding the evolutionary dynamics of antigenically variable RNA viruses, PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, Vol: 278, Pages: 3723-3730, ISSN: 0962-8452

Journal article

Camacho A, Ballesteros S, Graham AL, Carrat F, Ratmann O, Cazelles Bet al., 2011, Explaining rapid reinfections in multiple-wave influenza outbreaks: Tristan da Cunha 1971 epidemic as a case study, PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, Vol: 278, Pages: 3635-3643, ISSN: 0962-8452

Journal article

Rasmussen DA, Ratmann O, Koelle K, 2011, Inference for nonlinear epidemiological models using genealogies and time series, PLOS Computational Biology, Vol: 7, ISSN: 1553-734X

Phylodynamics - the field aiming to quantitatively integrate the ecological and evolutionary dynamics of rapidly evolving populations like those of RNA viruses – increasingly relies upon coalescent approaches to infer past population dynamics from reconstructed genealogies. As sequence data have become more abundant, these approaches are beginning to be used on populations undergoing rapid and rather complex dynamics. In such cases, the simple demographic models that current phylodynamic methods employ can be limiting. First, these models are not ideal for yielding biological insight into the processes that drive the dynamics of the populations of interest. Second, these models differ in form from mechanistic and often stochastic population dynamic models that are currently widely used when fitting models to time series data. As such, their use does not allow for both genealogical data and time series data to be considered in tandem when conducting inference. Here, we present a flexible statistical framework for phylodynamic inference that goes beyond these current limitations. The framework we present employs a recently developed method known as particle MCMC to fit stochastic, nonlinear mechanistic models for complex population dynamics to gene genealogies and time series data in a Bayesian framework. We demonstrate our approach using a nonlinear Susceptible-Infected-Recovered (SIR) model for the transmission dynamics of an infectious disease and show through simulations that it provides accurate estimates of past disease dynamics and key epidemiological parameters from genealogies with or without accompanying time series data.

Journal article

Ratmann O, Andrieu C, Wiuf C, Richardson Set al., 2010, Reply to Robert et al.: Model criticism informs model choice and model comparison, Proceedings of the National Academy of Sciences of the United States of America, Vol: 107, Pages: E6-E7, ISSN: 0027-8424

In their letter to PNAS and a comprehensive set of notes on arXiv[arXiv:0909.5673v2], Christian Robert, Kerrie Mengersen and Carla Chen (RMC)represent our approach to model criticism in situations when the likelihoodcannot be computed as a way to "contrast several models with each other". Inaddition, RMC argue that model assessment with Approximate Bayesian Computationunder model uncertainty (ABCmu) is unduly challenging and question its Bayesianfoundations. We disagree, and clarify that ABCmu is a probabilistically soundand powerful too for criticizing a model against aspects of the observed data,and discuss further the utility of ABCmu.

Journal article

Ratmann O, Andrieu C, Wiuf C, Richardson Set al., 2009, Model criticism based on likelihood-free inference, with an application to protein network evolution, PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, Vol: 106, Pages: 10576-10581, ISSN: 0027-8424

Author Web Link
Cite
Citations: 107

Journal article

Wiuf C, Ratmann O, 2009, Evolutionary analysis of protein interaction networks, Statistical and Evolutionary Analysis of Biological Networks, Pages: 17-43, ISBN: 9781848164338

Systems approaches to understanding the structure, organisation and functioning of organisms and cells are now becoming commonplace. In this chapter we focus on protein interaction networks and their potential use for inference on the evolutionary processes that have shaped the interactome, the collection of all proteins in a cell together with their physical interactions. We demonstrate that simple mathematical models may capture essential aspects of the processes and use these to develop a Bayesian likelihood-free scheme for inference on three small organisms T. pallidum, H. pylori and P. falciparum.

Abstract
Cite
Citations: 2

Book chapter

This data is extracted from the Web of Science and reproduced under a licence from Thomson Reuters. You may not copy or re-distribute this data in whole or in part without the written consent of the Science business of Thomson Reuters.

Request URL: http://wlsprd.imperial.ac.uk:80/respub/WEB-INF/jsp/search-html.jsp Request URI: /respub/WEB-INF/jsp/search-html.jsp Query String: id=00459135&limit=30&person=true&page=3&respub-action=search.html