A recent slew of ENCyclopedia Of DNA Elements (ENCODE) Consortium publications, specifically the article signed by all Consortium members, put forward the idea that more than 80% of the human genome is functional. This claim flies in the face of current estimates according to which the fraction of the genome that is evolutionarily conserved through purifying selection is less than 10%. Thus, according to the ENCODE Consortium, a biological function can be maintained indefinitely without selection, which implies that at least 80 − 10 = 70% of the genome is perfectly invulnerable to deleterious mutations, either because no mutation can ever occur in these “functional” regions or because no mutation in these regions can ever be deleterious. This absurd conclusion was reached through various means, chiefly by employing the seldom used “causal role” definition of biological function and then applying it inconsistently to different biochemical properties, by committing a logical fallacy known as “affirming the consequent,” by failing to appreciate the crucial difference between “junk DNA” and “garbage DNA,” by using analytical methods that yield biased errors and inflate estimates of functionality, by favoring statistical sensitivity over specificity, and by emphasizing statistical significance rather than the magnitude of the effect. Here, we detail the many logical and methodological transgressions involved in assigning functionality to almost every nucleotide in the human genome. The ENCODE results were predicted by one of its authors to necessitate the rewriting of textbooks. We agree, many textbooks dealing with marketing, mass-media hype, and public relations may well have to be rewritten.
The pronouncements of the ENCODE Project Consortium regarding “junk DNA” exposed the need for an evolutionary classification of genomic elements according to their selected-effect function. In the classification scheme presented here, we divide the genome into “functional DNA,” that is, DNA sequences that have a selected-effect function, and “rubbish DNA,” that is, sequences that do not. Functional DNA is further subdivided into “literal DNA” and “indifferent DNA.” In literal DNA, the order of nucleotides is under selection; in indifferent DNA, only the presence or absence of the sequence is under selection. Rubbish DNA is further subdivided into “junk DNA” and “garbage DNA.” Junk DNA neither contributes to nor detracts from the fitness of the organism and, hence, evolves under selective neutrality. Garbage DNA, on the other hand, decreases the fitness of its carriers. Garbage DNA exists in the genome only because natural selection is neither omnipotent nor instantaneous. Each of these four functional categories can be 1) transcribed and translated, 2) transcribed but not translated, or 3) not transcribed. The affiliation of a DNA segment to a particular functional category may change during evolution: Functional DNA may become junk DNA, junk DNA may become garbage DNA, rubbish DNA may become functional DNA, and so on; however, determining the functionality or nonfunctionality of a genomic sequence must be based on its present status rather than on its potential to change (or not to change) in the future. Changes in functional affiliation are divided into pseudogenes, Lazarus DNA, zombie DNA, and Jekyll-to-Hyde DNA.
BackgroundWe evaluated the sensitivity of the D-statistic, a parsimony-like method widely used to detect gene flow between closely related species. This method has been applied to a variety of taxa with a wide range of divergence times. However, its parameter space and thus its applicability to a wide taxonomic range has not been systematically studied. Divergence time, population size, time of gene flow, distance of outgroup and number of loci were examined in a sensitivity analysis.ResultThe sensitivity study shows that the primary determinant of the D-statistic is the relative population size, i.e. the population size scaled by the number of generations since divergence. This is consistent with the fact that the main confounding factor in gene flow detection is incomplete lineage sorting by diluting the signal. The sensitivity of the D-statistic is also affected by the direction of gene flow, size and number of loci. In addition, we examined the ability of the f-statistics, and , to estimate the fraction of a genome affected by gene flow; while these statistics are difficult to implement to practical questions in biology due to lack of knowledge of when the gene flow happened, they can be used to compare datasets with identical or similar demographic background.ConclusionsThe D-statistic, as a method to detect gene flow, is robust against a wide range of genetic distances (divergence times) but it is sensitive to population size. The D-statistic should only be applied with critical reservation to taxa where population sizes are large relative to branch lengths in generations.Electronic supplementary materialThe online version of this article (10.1186/s12859-017-2002-4) contains supplementary material, which is available to authorized users.
Transposable elements (TEs) are genomic parasites that impose fitness costs on their hosts by producing deleterious mutations and disrupting gametogenesis. Host genomes avoid these costs by regulating TE activity, particularly in germline cells where new insertions are heritable and TEs are exceptionally active. However, the capacity of different TE-associated fitness costs to select for repression in the host, and the role of selection in the evolution of TE regulation more generally remain controversial. In this study, we use forward, individual-based simulations to examine the evolution of small-RNA-mediated TE regulation, a conserved mechanism for TE repression that is employed by both prokaryotes and eukaryotes. To design and parameterize a biologically realistic model, we drew on an extensive survey of empirical studies of the transposition and regulation of P -element DNA transposons in Drosophila melanogaster . We observed that even under conservative assumptions, where small-RNA-mediated regulation reduces transposition only, repression evolves rapidly and adaptively after the genome is invaded by a new TE in simulated populations. We further show that the spread of repressor alleles through simulated populations is greatly enhanced by two additional TE-imposed fitness costs: dysgenic sterility and ectopic recombination. Finally, we demonstrate that the adaptive mutation rate to repression is a critical parameter that influences both the evolutionary trajectory of host repression and the associated proliferation of TEs after invasion in simulated populations. Our findings suggest that adaptive evolution of TE regulation may be stronger and more prevalent than previously appreciated, and provide a framework for interpreting empirical data.
Transposable elements (TEs) are genomic parasites that impose fitness costs on their hosts by producing deleterious mutations and disrupting gametogenesis. Host genomes avoid these costs by regulating TE activity, particularly in germline cells where new insertions are heritable and TEs are exceptionally active. However, the capacity of different TE-associated fitness costs to select for repression in the host, and the role of selection in the evolution of TE regulation more generally, remain controversial. In this study, we use individual-based simulations to examine the evolution of TE regulation through small RNA-mediated silencing. Small RNA silencing is a common mechanism for TE regulation employed by both prokaryotes and eukaryotes. We observed that even under conservative assumptions, where small RNA-mediated regulation reduces transposition only, repression evolves rapidly and adaptively after the genome is invaded by a new TE. We further show that the spread of repressor alleles is greatly enhanced by two additional TE-imposed fitness costs: dysgenic sterility and ectopic recombination. Finally, we demonstrate that the mutation rate to repression (i.e., the size of the mutational target) is a critical parameter that influences both the evolutionary trajectory of host repression and the associated proliferation of TEs. Our findings suggest that adaptive evolution of TE regulation may be stronger and more prevalent than previously suggested, and complement recent empirical observations of positive selection on small RNA-mediated repressor alleles. Both prokaryotic and eukaryotic genomes minimize the fitness costs of TEs by controlling their activity through small RNA mediated silencing (reviewed in Blumenstiel 2011). In eukaryotic germlines, small RNA mediated silencing of TEs is enacted by Argonaute proteins that are found in complex with small interfering RNAs (siRNAs) and Piwi-interacting RNAs (piRNAs). Regulatory siRNAs and piRNAs are produced from anti-sense TE transcripts, and identify TE-derived mRNAs by base complementarity. Complexed Argonaute proteins then silence the targeted TE transcriptionally by inducing heterochromatin formation, or post-transcriptionally by degrading the transcript. Consistent with prevention of germline DNA damage, small RNA-mediated silencing suppresses TE-associated dysgenic sterility in Drosophila (Blumenstiel and Hartl 2005;Brennecke et al. 2008;Chambeyron et al. 2008;Rozhkov et al.
The first synthesis of N-substituted lactams via an acceptorless dehydrogenative coupling of diols with primary amines in one step was enabled by combining Ru3(CO)12 with a hybrid N-heterocyclic carbene–phosphine–phosphine ligand as the catalyst.
The iconic Australasian kangaroos and wallabies represent a successful marsupial radiation. However, the evolutionary relationship within the two genera, Macropus and Wallabia, is controversial: mitochondrial and nuclear genes, and morphological data have produced conflicting scenarios regarding the phylogenetic relationships, which in turn impact the classification and taxonomy. We sequenced and analyzed the genomes of 11 kangaroos to investigate the evolutionary cause of the observed phylogenetic conflict. A multilocus coalescent analysis using ∼14,900 genome fragments, each 10 kb long, significantly resolved the species relationships between and among the sister-genera Macropus and Wallabia. The phylogenomic approach reconstructed the swamp wallaby (Wallabia) as nested inside Macropus, making this genus paraphyletic. However, the phylogenomic analyses indicate multiple conflicting phylogenetic signals in the swamp wallaby genome. This is interpreted as at least one introgression event between the ancestor of the genus Wallabia and a now extinct ghost lineage outside the genus Macropus. Additional phylogenetic signals must therefore be caused by incomplete lineage sorting and/or introgression, but available statistical methods cannot convincingly disentangle the two processes. In addition, the relationships inside the Macropus subgenus M. (Notamacropus) represent a hard polytomy. Thus, the relationships between tammar, red-necked, agile, and parma wallabies remain unresolvable even with whole-genome data. Even if most methods resolve bifurcating trees from genomic data, hard polytomies, incomplete lineage sorting, and introgression complicate the interpretation of the phylogeny and thus taxonomy.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.