2004
DOI: 10.1093/molbev/msh182
|View full text |Cite
|
Sign up to set email alerts
|

Phylogenomics of Eukaryotes: Impact of Missing Data on Large Alignments

Abstract: Resolving the relationships between Metazoa and other eukaryotic groups as well as between metazoan phyla is central to the understanding of the origin and evolution of animals. The current view is based on limited data sets, either a single gene with many species (e.g., ribosomal RNA) or many genes but with only a few species. Because a reliable phylogenetic inference simultaneously requires numerous genes and numerous species, we assembled a very large data set containing 129 orthologous proteins ( approxima… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

21
263
1

Year Published

2006
2006
2022
2022

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 361 publications
(285 citation statements)
references
References 96 publications
21
263
1
Order By: Relevance
“…These results are supported by simulation studies, which suggest that the mere presence of missing data may not itself be misleading, beyond the reduced amount of data included (e.g. Wiens, 2003;Philippe et al, 2004;Wiens and Morrill, 2011).…”
Section: Introductionmentioning
confidence: 55%
“…These results are supported by simulation studies, which suggest that the mere presence of missing data may not itself be misleading, beyond the reduced amount of data included (e.g. Wiens, 2003;Philippe et al, 2004;Wiens and Morrill, 2011).…”
Section: Introductionmentioning
confidence: 55%
“…This conclusion is reinforced when the phylogenetic tree for the family in question conforms to the topology of the eukaryotic tree. However, none of the best phylogenetic trees for Ago-Piwi, Dicer-like, or RdRP polypeptides strictly coincided with the consensus tree of Eukaryota (Medina 2005) nor could reconstruct the monophyly of some higher-order eukaryotic groups, as previously reported for individual gene/protein trees (Arisue et al 2004;Philippe et al 2004). Besides the usual problems of weakness of phylogenetic signal, lateral gene transfers, hidden paralogy, and tree reconstruction artefacts (Philippe et al 2004), incorrectly predicted protein models (since some sequences were extracted from draft genomes) may have contributed to greater than usual divergences making the relationship among deep branches difficult to resolve.…”
Section: Phylogeny Of Argonaute-piwi Dicer-like and Rdrp Proteinsmentioning
confidence: 66%
“…We built upon previous phylogenomic datasets assembled in the Philippe lab Jimenez-Guri et al, 2007;Lartillot and Philippe, 2008;Philippe et al, 2004Philippe et al, , 2005b to select a set of 179 orthologous markers showing sufficient conservation across metazoans to be useful for inferring the phylogeny of metazoans. Alignments were built and updated with available sequences downloaded from the Trace Archive (http://www.ncbi.nlm.nih.gov/Traces/) and the EST Database (http://www.ncbi.nlm.nih.gov/dbEST/) of GenBank at the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/) using the program ED from the MUST package (Philippe, 1993).…”
Section: Phylogenomic Dataset Assemblymentioning
confidence: 99%
“…The complete alignment consists of 179 genes and 51 taxa for 53,799 unambiguously aligned amino-acid sites with 32% missing data. To study the potential impact of missing data on phylogenetic inference (Hartmann and Vision, 2008;Philippe et al, 2004;Wiens, 2006), a concatenation of the 106 genes with sequences available for at least 41 of the 51 OTUs was also constructed with SCAFOS. This reduced alignment consists of 106 genes and 51 taxa for 25,321 amino-acid sites and contains only 20% of missing data.…”
Section: Phylogenomic Dataset Assemblymentioning
confidence: 99%
See 1 more Smart Citation