Estimate combinations of the tree topology + parameters that are plausible given a phylogenetic model and data
Estimate combinations of the tree topology + parameters that are plausible given a phylogenetic model and data
Estimate combinations of the tree topology + parameters that are plausible given a phylogenetic model and data
Estimate combinations of the tree topology + parameters that are plausible given a phylogenetic model and data
Estimate combinations of the tree topology + parameters that are plausible given a phylogenetic model and data
Estimate combinations of the tree topology + parameters that are plausible given a phylogenetic model and data
Estimate combinations of the tree topology + parameters that are plausible given a phylogenetic model and data
Estimate combinations of the tree topology + parameters that are plausible given a phylogenetic model and data
This is a contrast with other methods
Ultimately, we are often estimating long-ago events from biased and scarce data.
The best thing we can be is humble
The best thing we can be is humble
and that means becoming friendly with uncertainty
In 2018, Sansom et al. did a comparison between Bayesian and parsimony trees using measures of stratigraphic congruence
Concluded parsimony trees have better stratigraphic congruence
There are many ways to measure stratigraphic congruence
Metric | Meaning | Range |
---|---|---|
Stratigraphic Consistency Index (SCI) | Proportion of nodes for which the oldest descendent of that node is younger than the oldest descendent of that node’s ancestor | 0 to 1, with one being perfectly consistent |
Minimum Implied Gap (MIG) | The sum of the branch lengths excluding tip durations | Positive numbers in millions of years |
Relative Completeness Index (RCI) | MIG score proportional to the summed length of tip durations | All real numbers |
Manhattan Stratigraphic Measure (MSM*) | MIG for the maximally stratigraphically consistent possible tree divided by the actual MIG | 0 to 1, with one being the most consistent |
Gap Excess Ratio (GER) | MIG minus the best possible stratigraphic fit, scaled by the contrast between the best and worst fit values | 0 to 1, with one being the most consistent |
These metrics quantify how consistent a phylogeny is with the rock record
Metric | Meaning | Range |
---|---|---|
Stratigraphic Consistency Index (SCI) | Proportion of nodes for which the oldest descendent of that node is younger than the oldest descendent of that node’s ancestor | 0 to 1, with one being perfectly consistent |
Minimum Implied Gap (MIG) | The sum of the branch lengths excluding tip durations | Positive numbers in millions of years |
Relative Completeness Index (RCI) | MIG score proportional to the summed length of tip durations | All real numbers |
Manhattan Stratigraphic Measure (MSM*) | MIG for the maximally stratigraphically consistent possible tree divided by the actual MIG | 0 to 1, with one being the most consistent |
Gap Excess Ratio (GER) | MIG minus the best possible stratigraphic fit, scaled by the contrast between the best and worst fit values | 0 to 1, with one being the most consistent |
These metrics quantify how consistent a phylogeny is with the rock record
Metric | Meaning | Range |
---|---|---|
Stratigraphic Consistency Index (SCI) | Proportion of nodes for which the oldest descendent of that node is younger than the oldest descendent of that node’s ancestor | 0 to 1, with one being perfectly consistent |
Minimum Implied Gap (MIG) | The sum of the branch lengths excluding tip durations | Positive numbers in millions of years |
Relative Completeness Index (RCI) | MIG score proportional to the summed length of tip durations | All real numbers |
Manhattan Stratigraphic Measure (MSM*) | MIG for the maximally stratigraphically consistent possible tree divided by the actual MIG | 0 to 1, with one being the most consistent |
Gap Excess Ratio (GER) | MIG minus the best possible stratigraphic fit, scaled by the contrast between the best and worst fit values | 0 to 1, with one being the most consistent |
Sansom et al. took 500 samples from the posterior and compared those with the most parsimonious tree
But is there more information in the posterior than a random sample can show us?
Graphics that display trees in 2-D space based on their proximity to one another
Could these be a valuable tool for looking at the posterior sample?
Is this how we see our forest?
Estimated parsimony trees (TNT, Goloboff and Catalano 2016) and Bayesian trees (Höhna et al 2016) for 127 published paleontological matrices
Calculated stratigraphic congruence for all equally-parsimonious trees and the Bayesian posterior sample in the R package Strap (Bell and Lloyd 2015)
Estimated parsimony trees (TNT, Goloboff and Catalano 2016) and Bayesian trees (Hoehna et al 2016) for 127 published paleontological matrices
Calculated stratigraphic congruence for all equally-parsimonious trees and the Bayesian posterior sample in the R package Strap (Bell and Lloyd 2015)
Modified the RWTY (Warren, Geneva and Lanfear 2017) to color points in the treespace by MIG score
Estimated parsimony trees (TNT, Goloboff and Catalano 2016) and Bayesian trees (Hoehna et al 2016) for 127 published paleontological matrices
Calculated stratigraphic congruence for all equally-parsimonious trees and the Bayesian posterior sample in the R package Strap (Bell and Lloyd 2015)
Modified the RWTY (Warren, Geneva and Lanfear 2017) to color points in the treespace by MIG score
Also calculated some basic summary tables across thee datasets using tidyverse
Example dataset: Yates (2003)
Example dataset: Yates (2003)
Example dataset: Demar 2013
Example dataset: Demar 2013
Bayesian methods estimate a sample of solutions
Bayesian methods estimate a sample of solutions
Unlike other methods, the distribution itself may be important
Bayesian methods estimate a sample of solutions
Unlike other methods, the distribution itself may be important
Looking at the full distribution of trees can provide us with information that one solution itself may not
We shouldn’t be asking “Is Bayes or parsimony better?”
We shouldn’t be asking “Is Bayes or parsimony better?”
We should instead be looking for ways to comfortably visualize variation in large datasets
We shouldn’t be asking “Is Bayes or parsimony better?”
We should instead be looking for ways to comfortably visualize variation in large datasets
Treespace visualizations provide an easy and intuitive way to do this
A sincere thank you to the organizers of this excellent event!
To Dan Warren and Rob Lanfear for the endlessly hackable RWTY software
And to my co-author and partner in crime, Dr. Graeme Lloyd