Evolutionary Dynamics and Molecular Epidemiology of Viruses

Meeting Times:

  • Wednesday, July 24, 12:00 PM – 5:00 PM
  • Thursday July 25, 8:30 AM – 5:00 PM
  • Friday July 26, 8:30 AM – 5:00 PM

Classroom: TBA

Module Summary:
Genetic sequence data is increasingly being used to track or characterize how diseases spread. During the current COVID-19 pandemic, in particular, genetic sequence data has become an important source of information. Initially, it gave us evidence for human-to-human transmission. It was then used to learn about how outbreaks in different parts of the world are related and to track the speed at which COVID-19 spreads. The reason why we can use genetic sequence data to do so is because random errors occur when SARS-CoV-2 (the virus that causes COVID-19) infects individuals, and those errors are then transmitted to other hosts. The further apart in the transmission history, pathogens isolated from different hosts are, the more divergent their genetic sequences will be. This in turn, means that we can use genetic sequences of pathogens to learn something about the transmission dynamics of pathogens. The most popular way to do so is by using phylogenetic and phylodynamic methods. These allow us to reconstruct how individual pathogens isolated from different patients are related and to reconstruct past transmission dynamics.

In this module, we will learn how to go from genetic sequences to learning something about transmission dynamics. To do so, we will look at how to use phylogenetic and bioinformatic tools to reconstruct the spread of pathogens from genetic sequence data. In particular, we will be focusing on Bayesian phylogenetics.

We will first briefly cover the different components of  Bayesian phylogenetic analyses, such as different evolutionary models. Additionally, we will introduce different phylodynamic models, such as coalescent and birth-death models, that allow us to extract information about past population dynamics from genetic sequence data.

As the main software, we will be using BEAST (Bayesian Evolutionary Analysis by Sampling Trees) and BEAST2. We will be covering how to set up and interpret analyses using lectures and tutorials that are focused on estimating evolutionary rates and population dynamics through time. Additionally, we will look into evolutionary processes including recombination and reassortment.

Prerequisites:
This module assumes knowledge of the material in Module 1: Probability and Statistical Inference, though not necessarily from taking that module.

Module Content:

  • Introduction to the BEAST2 workflow
  • How evolution is modeled in Bayesian phylogenetics (clock and site models)
  • Phylodynamics: How can we retrieve population dynamics from phylogenetics (tree priors)
  • How can we quantify the spatial spread of infectious diseases (structured tree priors)
  • How can we account for recombination or reassortment in the evolutionary history (Bayesian phylogenetic network inference, phylodynamics with recombination)

Instructors

Nicola Müller, PhD

Nicola Müller, PhD

Assistant Professor, Division of HIV, ID, and Global Medicine, University of California San Francisco

Dr. Müller is an Assistant Professor at UCSF in the EPPIcenter and the Division of HIV, ID and Global Medicine. He uses phylodynamics to quantify the transmission dynamics of bacterial and viral infectious diseases, ultimately to inform public health interventions. In doing so, he developed multiple phylodynamics approaches to study population structure, recombination and reassortment. He works on several aspects of Bayesian phylogenetics and phylodynamics. He's developed several BEAST2 software packages and has published multiple BEAST2 addons, including MASCOT, CoupledMCMC, CoalRe, Recombination etc.

Learn More >>

Julia Palacios, PhD

Julia Palacios, PhD

Assistant Professor, Department of Statistics Stanford University

In her research, Professor Palacios seeks to provide statistically rigorous answers to concrete, data-driven questions in population genetics, epidemiology, and comparative genomics, often involving probabilistic modeling of evolutionary forces and the development of computationally tractable methods that are applicable to big data problems. Past and current research relies heavily on the theory of stochastic processes and recent developments in machine learning and statistical theory for big data; future research plans are aimed at incorporating the effects of selection and population structure in Bayesian inference of evolutionary parameters such as effective population size and recombination rates, and development of more realistic and computationally efficient methods for phylodynamic methods of infectious diseases.

Learn More >>

Required Software:

Recommended Reading:
Primary research and tutorial articles will be provided for additional reading.