The method to infer the phylogenetic relationship among quartets is implemented in the software SVDquartets, available at www.stat.osu.edu/∼lkubatko/software/SVDquartets.
Although multiple gene sequences are becoming increasingly available for molecular phylogenetic inference, the analysis of such data has largely relied on inference methods designed for single genes. One of the common approaches to analyzing data from multiple genes is concatenation of the individual gene data to form a single supergene to which traditional phylogenetic inference procedures - e.g., maximum parsimony (MP) or maximum likelihood (ML) - are applied. Recent empirical studies have demonstrated that concatenation of sequences from multiple genes prior to phylogenetic analysis often results in inference of a single, well-supported phylogeny. Theoretical work, however, has shown that the coalescent can produce substantial variation in single-gene histories. Using simulation, we combine these ideas to examine the performance of the concatenation approach under conditions in which the coalescent produces a high level of discord among individual gene trees and show that it leads to statistically inconsistent estimation in this setting. Furthermore, use of the bootstrap to measure support for the inferred phylogeny can result in moderate to strong support for an incorrect tree under these conditions. These results highlight the importance of incorporating variation in gene histories into multilocus phylogenetics.
BackgroundThe near exclusive use of praziquantel (PZQ) for treatment of human schistosomiasis has raised concerns about the possible emergence of drug-resistant schistosomes.Methodology/Principal FindingsWe measured susceptibility to PZQ of isolates of Schistosoma mansoni obtained from patients from Kisumu, Kenya continuously exposed to infection as a consequence of their occupations as car washers or sand harvesters. We used a) an in vitro assay with miracidia, b) an in vivo assay targeting adult worms in mice and c) an in vitro assay targeting adult schistosomes perfused from mice. In the miracidia assay, in which miracidia from human patients were exposed to PZQ in vitro, reduced susceptibility was associated with previous treatment of the patient with PZQ. One isolate (“KCW”) that was less susceptible to PZQ and had been derived from a patient who had never fully cured despite multiple treatments was studied further. In an in vivo assay of adult worms, the KCW isolate was significantly less susceptible to PZQ than two other isolates from natural infections in Kenya and two lab-reared strains of S. mansoni. The in vitro adult assay, based on measuring length changes of adults following exposure to and recovery from PZQ, confirmed that the KCW isolate was less susceptible to PZQ than the other isolates tested. A sub-isolate of KCW maintained separately and tested after three years was susceptible to PZQ, indicative that the trait of reduced sensitivity could be lost if selection was not maintained.Conclusions/SignificanceIsolates of S. mansoni from some patients in Kisumu have lower susceptibility to PZQ, including one from a patient who was never fully cured after repeated rounds of treatment administered over several years. As use of PZQ continues, continued selection for worms with diminished susceptibility is possible, and the probability of emergence of resistance will increase as large reservoirs of untreated worms diminish. The potential for rapid emergence of resistance should be an important consideration of treatment programs.
The inference of the evolutionary history of a collection of organisms is a problem of fundamental importance in evolutionary biology. The abundance of DNA sequence data arising from genome sequencing projects has led to significant challenges in the inference of these phylogenetic relationships. Among these challenges is the inference of the evolutionary history of a collection of species based on sequence information from several distinct genes sampled throughout the genome. It is widely accepted that each individual gene has its own phylogeny, which may not agree with the species tree. Many possible causes of this gene tree incongruence are known. The best studied is incomplete lineage sorting, which is commonly modeled by the coalescent process. Numerous methods based on the coalescent process have been proposed for estimation of the phylogenetic species tree given DNA sequence data. However, use of these methods assumes that the phylogenetic species tree can be identified from DNA sequence data at the leaves of the tree, although this has not been formally established. We prove that the unrooted topology of the n-leaf phylogenetic species tree is generically identifiable given observed data at the leaves of the tree that are assumed to have arisen from the coalescent process under a time-reversible substitution process with the possibility of site-specific rate variation modeled by the discrete gamma distribution and a proportion of invariable sites.
The analysis of hybridization and gene flow among closely related taxa is a common goal for researchers studying speciation and phylogeography. Many methods for hybridization detection use simple site pattern frequencies from observed genomic data and compare them to null models that predict an absence of gene flow. The theory underlying the detection of hybridization using these site pattern probabilities exploits the relationship between the coalescent process for gene trees within population trees and the process of mutation along the branches of the gene trees. For certain models, site patterns are predicted to occur in equal frequency (i.e., their difference is 0), producing a set of functions called phylogenetic invariants. In this article, we introduce HyDe, a software package for detecting hybridization using phylogenetic invariants arising under the coalescent model with hybridization. HyDe is written in Python and can be used interactively or through the command line using pre-packaged scripts. We demonstrate the use of HyDe on simulated data, as well as on two empirical data sets from the literature. We focus in particular on identifying individual hybrids within population samples and on distinguishing between hybrid speciation and gene flow. HyDe is freely available as an open source Python package under the GNU GPL v3 on both GitHub (https://github.com/pblischak/HyDe) and the Python Package Index (PyPI: https://pypi.python.org/pypi/phyde). [ABBA-BABA; coalescence; gene flow; hybridization; phylogenetic invariants.]
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.