Deep avian evolutionary relationships have been difficult to resolve as a result of a putative explosive radiation. Our study examined approximately 32 kilobases of aligned nuclear DNA sequences from 19 independent loci for 169 species, representing all major extant groups, and recovered a robust phylogeny from a genome-wide signal supported by multiple analytical methods. We documented well-supported, previously unrecognized interordinal relationships (such as a sister relationship between passerines and parrots) and corroborated previously contentious groupings (such as flamingos and grebes). Our conclusions challenge current classifications and alter our understanding of trait evolution; for example, some diurnal birds evolved from nocturnal ancestors. Our results provide a valuable resource for phylogenetic and comparative studies in birds.
Phylogenomics, the use of large-scale data matrices in phylogenetic analyses, has been viewed as the ultimate solution to the problem of resolving difficult nodes in the tree of life. However, it has become clear that analyses of these large genomic data sets can also result in conflicting estimates of phylogeny. Here, we use the early divergences in Neoaves, the largest clade of extant birds, as a "model system" to understand the basis for incongruence among phylogenomic trees. We were motivated by the observation that trees from two recent avian phylogenomic studies exhibit conflicts. Those studies used different strategies: 1) collecting many characters [$\sim$ 42 mega base pairs (Mbp) of sequence data] from 48 birds, sometimes including only one taxon for each major clade; and 2) collecting fewer characters ($\sim$ 0.4 Mbp) from 198 birds, selected to subdivide long branches. However, the studies also used different data types: the taxon-poor data matrix comprised 68% non-coding sequences whereas coding exons dominated the taxon-rich data matrix. This difference raises the question of whether the primary reason for incongruence is the number of sites, the number of taxa, or the data type. To test among these alternative hypotheses we assembled a novel, large-scale data matrix comprising 90% non-coding sequences from 235 bird species. Although increased taxon sampling appeared to have a positive impact on phylogenetic analyses the most important variable was data type. Indeed, by analyzing different subsets of the taxa in our data matrix we found that increased taxon sampling actually resulted in increased congruence with the tree from the previous taxon-poor study (which had a majority of non-coding data) instead of the taxon-rich study (which largely used coding data). We suggest that the observed differences in the estimates of topology for these studies reflect data-type effects due to violations of the models used in phylogenetic analyses, some of which may be difficult to detect. If incongruence among trees estimated using phylogenomic methods largely reflects problems with model fit developing more "biologically-realistic" models is likely to be critical for efforts to reconstruct the tree of life. [Birds; coding exons; GTR model; model fit; Neoaves; non-coding DNA; phylogenomics; taxon sampling.].
Avian diversification has been influenced by global climate change, plate tectonic movements, and mass extinction events. However, the impact of these factors on the diversification of the hyperdiverse perching birds (passerines) is unclear because family level relationships are unresolved and the timing of splitting events among lineages is uncertain. We analyzed DNA data from 4,060 nuclear loci and 137 passerine families using concatenation and coalescent approaches to infer a comprehensive phylogenetic hypothesis that clarifies relationships among all passerine families. Then, we calibrated this phylogeny using 13 fossils to examine the effects of different events in Earth history on the timing and rate of passerine diversification. Our analyses reconcile passerine diversification with the fossil and geological records; suggest that passerines originated on the Australian landmass ∼47 Ma; and show that subsequent dispersal and diversification of passerines was affected by a number of climatological and geological events, such as Oligocene glaciation and inundation of the New Zealand landmass. Although passerine diversification rates fluctuated throughout the Cenozoic, we find no link between the rate of passerine diversification and Cenozoic global temperature, and our analyses show that the increases in passerine diversification rate we observe are disconnected from the colonization of new continents. Taken together, these results suggest more complex mechanisms than temperature change or ecological opportunity have controlled macroscale patterns of passerine speciation.
Ratites (ostriches, emus, rheas, cassowaries, and kiwis) are large, flightless birds that have long fascinated biologists. Their current distribution on isolated southern land masses is believed to reflect the breakup of the paleocontinent of Gondwana. The prevailing view is that ratites are monophyletic, with the flighted tinamous as their sister group, suggesting a single loss of flight in the common ancestry of ratites. However, phylogenetic analyses of 20 unlinked nuclear genes reveal a genome-wide signal that unequivocally places tinamous within ratites, making ratites polyphyletic and suggesting multiple losses of flight. Phenomena that can mislead phylogenetic analyses, including long branch attraction, base compositional bias, discordance between gene trees and species trees, and sequence alignment errors, have been eliminated as explanations for this result. The most plausible hypothesis requires at least three losses of flight and explains the many morphological and behavioral similarities among ratites by parallel or convergent evolution. Finally, this phylogeny demands fundamental reconsideration of proposals that relate ratite evolution to continental drift.convergence ͉ flightlessness ͉ Paleognath ͉ homoplasy ͉ vicariance biogeography
We improve the taxon sampling for avian phylogeny by analyzing 7 new mitochondrial genomes (a toucan, woodpecker, osprey, forest falcon, American kestrel, heron, and a pelican). This improves inference of the avian tree, and it supports 3 major conclusions. The first is that some birds (including a parrot, a toucan, and an osprey) exhibit a complete duplication of the control region (CR) meaning that there are at least 4 distinct gene orders within birds. However, it appears that there are regions of continued gene conversion between the duplicate CRs, resulting in duplications that can be stable for long evolutionary periods. Because of this stable duplicated state, gene order can eventually either revert to the original order or change to the new gene order. The existence of this stable duplicate state explains how an apparently unlikely event (finding the same novel gene order) can arise multiple times. Although rare genomic changes have theoretical advantages for tree reconstruction, they can be compromised if these apparently rare events have a stable intermediate state. Secondly, the toucan and woodpecker improve the resolution of the 6-way split within Neoaves that has been called an "explosive radiation." An explosive radiation implies that normal microevolutionary events are insufficient to explain the observed macroevolution. By showing the avian tree is, in principle, resolvable, we demonstrate that the radiation of birds is amenable to standard evolutionary analysis. Thirdly, and as expected from theory, additional taxa breaking up long branches stabilize the position of some problematic taxa (like the falcon). In addition, we report that within the birds of prey and allies, we did not find evidence pairing New World vultures with storks or accipitrids (hawks, eagles, and osprey) with Falconids.
Production of massive DNA sequence data sets is transforming phylogenetic inference, but best practices for analyzing such data sets are not well established. One uncertainty is robustness to missing data, particularly in coalescent frameworks. To understand the effects of increasing matrix size and loci at the cost of increasing missing data, we produced a 90 taxon, 2.2 megabase, 4,800 locus sequence matrix of landfowl using target capture of ultraconserved elements. We then compared phylogenies estimated with concatenated maximum likelihood, quartet-based methods executed on concatenated matrices and gene tree reconciliation methods, across five thresholds of missing data. Results of maximum likelihood and quartet analyses were similar, well resolved, and demonstrated increasing support with increasing matrix size and sparseness. Conversely, gene tree reconciliation produced unexpected relationships when we included all informative loci, with certain taxa placed toward the root compared with other approaches. Inspection of these taxa identified a prevalence of short average contigs, which potentially biased gene tree inference and caused erroneous results in gene tree reconciliation. This suggests that the more problematic missing data in gene tree-based analyses are partial sequences rather than entire missing sequences from locus alignments. Limiting gene tree reconciliation to the most informative loci solved this problem, producing well-supported topologies congruent with concatenation and quartet methods. Collectively, our analyses provide a well-resolved phylogeny of landfowl, including strong support for previously problematic relationships such as those among junglefowl (Gallus), and clarify the position of two enigmatic galliform genera (Lerwa, Melanoperdix) not sampled in previous molecular phylogenetic studies.
It has long been appreciated that analyses of genomic data (e.g., whole genome sequencing or sequence capture) have the potential to reveal the tree of life, but it remains challenging to move from sequence data to a clear understanding of evolutionary history, in part due to the computational challenges of phylogenetic estimation using genome-scale data. Supertree methods solve that challenge because they facilitate a divide-and-conquer approach for large-scale phylogeny inference by integrating smaller subtrees in a computationally efficient manner. Here, we combined information from sequence capture and whole-genome phylogenies using supertree methods. However, the available phylogenomic trees had limited overlap so we used taxon-rich (but not phylogenomic) megaphylogenies to weave them together. This allowed us to construct a phylogenomic supertree, with support values, that included 707 bird species (~7% of avian species diversity). We estimated branch lengths using mitochondrial sequence data and we used these branch lengths to estimate divergence times. Our time-calibrated supertree supports radiation of all three major avian clades (Palaeognathae, Galloanseres, and Neoaves) near the Cretaceous-Paleogene (K-Pg) boundary. The approach we used will permit the continued addition of taxa to this supertree as new phylogenomic data are published, and it could be applied to other taxa as well.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.