University of Connecticut University of UC Title Fallback Connecticut


Adiantum (maidenhair fern) pinnule with inlaid phylogeny

Phylogenetics (EEB 5349)

This is a graduate-level course in phylogenetics, emphasizing primarily maximum likelihood and Bayesian approaches to estimating phylogenies, which are genealogies at or above the species level. A primary goal is to provide an accessible introduction to the theory so that by the end of the course students should be able to understand much of the primary literature on modern phylogenetic methods and know how to intelligently apply these methods to their own problems. The laboratory provides hands-on experience with several important phylogenetic software packages (PAUP*, GARLI, RAxML, MrBayes, RevBayes, BEAST) and introduces students to the use of remote high performance computing resources to perform phylogenetic analyses.

EEB 5349 is being taught Spring Semester 2018:

Lecture: Tuesday/Thursday 11-12:15  (lecture instructor: Paul O. Lewis)
Lab: Friday 1:25-3:20 (laboratory instructor: Kevin Keegan)
Room: Torrey Life Science (TLS) 181, Storrs Campus
Text: none required, registered students will receive PDF copies of a textbook I am currently writing (see list of optional texts below)


Important! The syllabus below is left over from the Spring 2016 version of the course, and will change somewhat before (and during) the Spring 2018 version. I have removed links and will reinstate them as the semester progresses. Homeworks are due 1 week after the date they are assigned in the syllabus.

Date Lecture topics Lab/Homework
Jan. 16
The jargon of phylogenetic trees (edges, vertices, leaves, cherries, degree, split, polytomy, taxon, clade); types of genealogies; rooted vs. unrooted trees; newick descriptions; monophyletic, paraphyletic, and polyphyletic groups; why are phylogenies useful?
Homework 1: trees from splits (due in lecture Tuesday Jan 23)
Jan. 18
Optimality criteria, search strategies
Exhaustive enumeration, branch-and-bound search, algorithmic methods (star decomposition, stepwise addition, NJ), heuristic search strategies (NNI, SPR, TBR), evolutionary algorithms
Lab: Using the UConn Bioinformatics Facility cluster; Introduction to PAUP*NEXUS format
Jan. 23
Consensus trees, the parsimony criterion
Strict, semi-strict, and majority-rule consensus trees; maximum agreement subtrees; Camin-Sokal, Wagner, Fitch, Dollo, and transversion parsimony; step matrices and generalized parsimony
Homework 2: Parsimony (due in class Tuesday, Jan. 30)
Jan. 25
Bootstrapping, distance methods
Bootstrapping; Distance methods: split decomposition, quartet puzzling, neighbor-joining, least squares criterion, minimum evolution criterion
Lab: Python Primer
Jan. 30
Substitution models (updated after lecture)
Transition probability, instantaneous rates, Poisson processes, JC69 model, K2P model, F81 model, F84 model, HKY85 model, GTR model
Homework 3: Distances
Feb. 1
Maximum likelihood criterion
Likelihood: the probability of data given a model, maximum likelihood estimates (MLEs) of model parameters, likelihood of a tree, likelihood ratio test
Lab: Searching
Feb. 6
Rate heterogeneity
Proportion of invariable sites, discrete gamma, site-specific rates
Homework 4: Likelihood
Feb. 8
Codon, amino acid, secondary structure models
Empirical amino acid rate matrices, transition probabilities by exponentiating the rate matrix, RNA stem/loop structure, compensatory substitutions, stem models, nonsynonymous vs. synonymous rates, codon models.
 Lab: Likelihood
Feb. 13
Model selection
Likelihood ratio test (LRT), Akaike Information criterion (AIC), Bayesian Information Criterion (BIC)
Expected number of substitutions
An example derivation for the F81 model
Homework 5: Rate heterogeneity
Feb. 15
How to simulate nucleotide sequence data, and why it’s done
Long branch attraction
Statistical consistency, long branch attraction
Lab: ML analyses of large data sets using RAxML and GARLI
Feb. 20
Topology tests
ILD, KH, SH, AU and SOWH tests
Homework 6: Simulation
Feb. 22
Bayesian statistics
Conditional/joint probabilities, Bayes rule, prior vs. posterior distributions, probability mass vs. probability density
Lab: Exploring probability distributions using R
Feb. 27
Markov chain Monte Carlo
Metropolis algorithm, MCMC, mixing, heated chains, Hastings ratio
Homework 7: MCMC
Mar. 1
Priors used in Bayesian phylogenetics
Commonly-used prior distributions: Beta, Gamma, Lognormal, Dirichlet
Lab: MrBayes 3.2
Mar. 5
Prior miscellany
Hierarchical models and hyperpriors, Empirical Bayes, Dirichlet process priors, MCMC without data
Confidence vs. credible intervals
Frequentist confidence intervals differ from Bayesian credible intervals
Homework 8: LOCAL move
Mar. 7
Bayesian model selection
Marginal likelihoods and Bayes factors
 Lab: Morphology, partitioning and model selection in MRBAYES
Mar. 13
Mar. 15
Mar. 20
Discrete morphological characters
DNA sequences vs. morphological characters, Symmetric vs. asymmetric 2-state models, Mk model
Mar. 22
 TBA Lab: HyPhy
Mar. 27
Discussion of study guide questions Homework 9: Independent contrasts
Mar. 29
Correlated discrete character evolution
Pagel’s likelihood ratio test
Correlated continuous character evolution
Felsenstein’s independent contrasts
(simulator shown in class)
Apr. 3
Phylogenetic Generalized Least Squares (PGLS)
Linear regression with correlation structure of residuals determined by the phylogeny
Read O’Meara (2012) before Tuesday Apr. 10
Apr. 5
Stochastic character mapping
Introduction to the use of stochastic character mapping for estimating ancestral states and character correlation
Lab: APE
Apr. 10
Discussion of O’Meara (2012)
O’Meara, B. C. 2012. Evolutionary inferences from phylogenies: a review of methods. Ann. Rev. Ecol. Evol. Syst. 43:267-285.
Mixture models
Mixture of rate matrices, rjMCMC, heterotachy models, covarion models, Dirichlet process models.
Read Maddison and FitzJohn (2015) before Tuesday Apr. 19
Apr. 12
Divergence time estimation
Thorne/Kishino autocorrelated log-normal model; BEAST uncorrelated log-normal model; Yule tree priors; Fossilized Birth-Death Prior
Apr. 17
Discussion of Maddison & FitzJohn (2015)
Maddison, W. P., and FitzJohn, R. G. 2015. The unsolved challenge to phylogenetic correlation tests for categorical characters. Syst. Biol. 64(1):127-136.
Just enough coalescent theory
Coalescent theory needed for understanding the multispecies coalescent model
Final exam (take-home) will be handed out (due May 5 at 5:30pm)
Apr. 19
Gene trees within species trees
 Lab: *BEAST
Apr. 24
Medley of topics
Polytomy priors and community phylogenetics
No homework (use time to work on final)
Apr. 26
Medley (cont.)
Bayesian information content
Astral-2, SVDQuartets
(GARLI bootstrap trees)
Thursday May 3 Final exam due by 5:30pm

Books on phylogenetics

This is a list of books that you should know about, but none are required texts for this course. Listed in reverse chronological order.

Yang, Z. 2014. Molecular evolution: a statistical approach. Oxford University Press.

Baum, D. A., and S. D. Smith. 2013. Tree thinking: an introduction to phylogenetic biology. Roberts and Company Publishers, Greenwood Village, Colorado. (This book is probably the most useful companion volume for this course, introducing the methods in a very accessible way but also providing lots of practice interpreting phylogenies correctly.)

Hall, B. G. 2011. Phylogenetic trees made easy: a how-to manual (4th edition). Sinauer Associates, Sunderland. (A guide to running some of the most important phylogenetic software packages.)

Lemey, P., Salemi, M., and Vandamme, A.-M. 2009. The phylogenetic handbook: a practical approach to phylogenetic analysis and hypothesis testing (2nd edition). Cambridge University Press, Cambridge, UK (Chapters on theory are paired with practical chapters on software related to the theory.)

Felsenstein, J. 2004. Inferring phylogenies. Sinauer Associates, Sunderland. (Comprehensive overview of both history and methods of phylogenetics.)

Page, R., and Holmes, E. 1998. Molecular evolution: a phylogenetic approach. Blackwell Science (Very nice and accessible pre-Bayesian-era introduction to the field.)

Hillis, D., Moritz, C., and Mable, B. 1996. Molecular systematics (2nd ed.). Sinauer Associates, Sunderland. Chapters 11 (“Phylogenetic inference”) and 12 (“Applications of molecular systematics”). (Still a very valuable compendium of pre-Bayesian-era phylogenetic methods.)