2008
DOI: 10.1101/gr.073585.107
|View full text |Cite
|
Sign up to set email alerts
|

EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebrates

Abstract: We have developed a comprehensive gene orientated phylogenetic resource, EnsemblCompara GeneTrees, based on a computational pipeline to handle clustering, multiple alignment, and tree generation, including the handling of large gene families. We developed two novel non-sequence-based metrics of gene tree correctness and benchmarked a number of tree methods. The TreeBeST method from TreeFam shows the best performance in our hands. We also compared this phylogenetic approach to clustering approaches for ortholog… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
1,132
0
2

Year Published

2010
2010
2017
2017

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 1,092 publications
(1,158 citation statements)
references
References 28 publications
2
1,132
0
2
Order By: Relevance
“…Our automated pipeline started with gene families previously inferred for eight plant genomes (homologs: i.e., all the paralogs and orthologs; Vilella et al. 2009), including two Panicoideae grasses ( Setaria italica and Sorghum bicolor ), two non‐Panicoideae grasses ( Brachypodium distachyon and Oryza sativa ), and four nongrass species ( Amborella trichopoda, A. thaliana , Populus trichocarpa , and Selaginella moellendorffii ). To ensure accurate annotation, we restricted the analysis to gene families that included at least one A. thaliana sequence.…”
Section: Methodsmentioning
confidence: 99%
“…Our automated pipeline started with gene families previously inferred for eight plant genomes (homologs: i.e., all the paralogs and orthologs; Vilella et al. 2009), including two Panicoideae grasses ( Setaria italica and Sorghum bicolor ), two non‐Panicoideae grasses ( Brachypodium distachyon and Oryza sativa ), and four nongrass species ( Amborella trichopoda, A. thaliana , Populus trichocarpa , and Selaginella moellendorffii ). To ensure accurate annotation, we restricted the analysis to gene families that included at least one A. thaliana sequence.…”
Section: Methodsmentioning
confidence: 99%
“…This means that if exon divergent paralogs are enriched in young duplicates, the influence of duplication age may make it appear as though exon structure divergence between paralogs results in lower levels of alternative splicing. Figure 3B plots the proportions of exon divergent and nondivergent paralogs created during a variety of evolutionary epochs as indicated by Ensembl Compara (Vilella et al 2009). Within each plot, the proportions of divergent and nondivergent paralogs produced at each epoch are strikingly similar.…”
Section: Exon Divergent Paralogs Undergo Less Alternative Splicingmentioning
confidence: 99%
“…We constructed phylogenetic trees based on 3679 human paralogous families extracted from the Ensembl database [21]. We used the multiple sequence alignment (MSA) algorithm ClustalW2 [22] to align the nucleotide sequences.…”
Section: Resultsmentioning
confidence: 99%