2014
DOI: 10.1093/molbev/msu200
|View full text |Cite
|
Sign up to set email alerts
|

Prospects for Building Large Timetrees Using Molecular Data with Incomplete Gene Coverage among Species

Abstract: Scientists are assembling sequence data sets from increasing numbers of species and genes to build comprehensive timetrees. However, data are often unavailable for some species and gene combinations, and the proportion of missing data is often large for data sets containing many genes and species. Surprisingly, there has not been a systematic analysis of the effect of the degree of sparseness of the species-gene matrix on the accuracy of divergence time estimates. Here, we present results from computer simulat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
36
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 54 publications
(38 citation statements)
references
References 28 publications
(46 reference statements)
2
36
0
Order By: Relevance
“…Zheng and Wiens, 2015) reinforces the idea that large amounts of missing data need not be problematic for divergencetime estimation (e.g. Filipski et al, 2014;Zheng and Wiens, 2015).…”
Section: Time-calibrated Phylogenysupporting
confidence: 67%
“…Zheng and Wiens, 2015) reinforces the idea that large amounts of missing data need not be problematic for divergencetime estimation (e.g. Filipski et al, 2014;Zheng and Wiens, 2015).…”
Section: Time-calibrated Phylogenysupporting
confidence: 67%
“…However, when each gene was 75% incomplete, the error was more substantial, with mean error close to 17 Myr and with a strong bias towards underestimated clade ages (relative to the complete dataset of 20 genes). These results emphasize the benefits of including genes that are sampled for all taxa (see also Filipski et al, 2014). However, in some cases, the errors were relatively small without these ''backbone'' genes, even when data were missing in 50% of the taxa in all genes.…”
Section: Resultsmentioning
confidence: 55%
“…This was intended to mimic the situation explored by Filipski et al (2014) in which there are no complete ''backbone'' genes, a situation in which the divergence-dating method that they examined sometimes performed poorly. Here, three different levels of missing data were again explored: (a) missing data in 8 taxa, with 25% missing data in the matrix overall; (b) missing data in 16 taxa, 50% overall; and (c) missing data in 24 taxa, 75% overall.…”
Section: Design Of Subsampling Experimentsmentioning
confidence: 99%
See 2 more Smart Citations