Abstract:BackgroundIn this paper, we address the evidence for the Ambush Hypothesis. Proposed by Seligmann and Pollock, this hypothesis posits that there exists a selection for off-frame stop codons (OSCs) to counteract the possible deleterious effects of translational frameshifts, including the waste of resources and potential cytotoxicity. Two main types of study have been used to support the hypothesis. Some studies analyzed codon usage and showed that codons with more potential to create OSCs seem to be favored ove… Show more
“…Furthermore, incorrectly folded mistranslated proteins may have an adverse effect on cellular interactions or form toxic aggregates ( Tank and True 2009 ). The possible evolutionary advantage of capturing these frameshifts is conjectured to be reflected by an overrepresentation of out of frame stop codons, termed the “ambush hypothesis” ( Seligmann and Pollock 2004 ; Singh and Pardasani 2009 ; Tse et al 2010 ), although the frequency with which codons that form out of frame stops are used is largely predictable from the underlying GC pressure ( Morgens et al 2013 ). Alternatively, selection to reduce costs in genomes where frameshifting is most deleterious (notably GC rich ones) can explain the richer tRNA repertoire found in such genomes ( Warnecke et al 2010 ).…”
Beyond selection for optimal protein functioning, coding sequences (CDSs) are under selection at the RNA and DNA levels. Here, we identify a possible signature of “dual-coding,” namely extensive adenine (A) enrichment at bacterial CDS fourth sites. In 99.07% of studied bacterial genomes, fourth site A use is greater than expected given genomic A-starting codon use. Arguing for nucleotide level selection, A-starting serine and arginine second codons are heavily utilized when compared with their non-A starting synonyms. Several models have the ability to explain some of this trend. In part, A-enrichment likely reduces 5′ mRNA stability, promoting translation initiation. However T/U, which may also reduce stability, is avoided. Further, +1 frameshifts on the initiating ATG encode a stop codon (TGA) provided A is the fourth residue, acting either as a frameshift “catch and destroy” or a frameshift stop and adjust mechanism and hence implicated in translation initiation. Consistent with both, genomes lacking TGA stop codons exhibit weaker fourth site A-enrichment. Sequences lacking a Shine–Dalgarno sequence and those without upstream leader genes, that may be more error prone during initiation, have greater utilization of A, again suggesting a role in initiation. The frameshift correction model is consistent with the notion that many genomic features are error-mitigation factors and provides the first evidence for site-specific out of frame stop codon selection. We conjecture that the NTG universal start codon may have evolved as a consequence of TGA being a stop codon and the ability of NTGA to rapidly terminate or adjust a ribosome.
“…Furthermore, incorrectly folded mistranslated proteins may have an adverse effect on cellular interactions or form toxic aggregates ( Tank and True 2009 ). The possible evolutionary advantage of capturing these frameshifts is conjectured to be reflected by an overrepresentation of out of frame stop codons, termed the “ambush hypothesis” ( Seligmann and Pollock 2004 ; Singh and Pardasani 2009 ; Tse et al 2010 ), although the frequency with which codons that form out of frame stops are used is largely predictable from the underlying GC pressure ( Morgens et al 2013 ). Alternatively, selection to reduce costs in genomes where frameshifting is most deleterious (notably GC rich ones) can explain the richer tRNA repertoire found in such genomes ( Warnecke et al 2010 ).…”
Beyond selection for optimal protein functioning, coding sequences (CDSs) are under selection at the RNA and DNA levels. Here, we identify a possible signature of “dual-coding,” namely extensive adenine (A) enrichment at bacterial CDS fourth sites. In 99.07% of studied bacterial genomes, fourth site A use is greater than expected given genomic A-starting codon use. Arguing for nucleotide level selection, A-starting serine and arginine second codons are heavily utilized when compared with their non-A starting synonyms. Several models have the ability to explain some of this trend. In part, A-enrichment likely reduces 5′ mRNA stability, promoting translation initiation. However T/U, which may also reduce stability, is avoided. Further, +1 frameshifts on the initiating ATG encode a stop codon (TGA) provided A is the fourth residue, acting either as a frameshift “catch and destroy” or a frameshift stop and adjust mechanism and hence implicated in translation initiation. Consistent with both, genomes lacking TGA stop codons exhibit weaker fourth site A-enrichment. Sequences lacking a Shine–Dalgarno sequence and those without upstream leader genes, that may be more error prone during initiation, have greater utilization of A, again suggesting a role in initiation. The frameshift correction model is consistent with the notion that many genomic features are error-mitigation factors and provides the first evidence for site-specific out of frame stop codon selection. We conjecture that the NTG universal start codon may have evolved as a consequence of TGA being a stop codon and the ability of NTGA to rapidly terminate or adjust a ribosome.
“…When a larger data set was analyzed, Morgens et al found that only a slight overexpression of PSCs in close to 2000 bacterial genomes. They also found that some out-of-frame sense codons are even more overexpressed than the PSCs[155]. Bacteria with high GC content or metabolically versatile bacteria were found to contain very few PSCs in their genome raising questions regarding the validity of the ambush hypothesis[31,155].…”
Protein termination is an important cellular process. Protein termination relies on the stop-codons in the mRNA interacting properly with the releasing factors on the ribosome. One third of inherited diseases, including cancers, are associated with the mutation of the stop-codons. Many pathogens and viruses are able to manipulate their stop-codons to express their virulence. The influence of stop-codons is not limited to the primary reading frame of the genes. Stop-codons in the second and third reading frames are referred as premature stop signals (PSC). Stop-codons and PSCs together are collectively referred as stop-signals. The ratios of the stop-signals (referred as translation stop-signals ratio or TSSR) of genetically related bacteria, despite their great differences in gene contents, are much alike. This nearly identical Genomic-TSSR value of genetically related bacteria may suggest that bacterial genome expansion is limited by their unique stop-signals bias. We review the protein termination process and the different types of stop-codon mutation in plants, animals, microbes, and viruses, with special emphasis on the role of PSCs in directing bacterial evolution in their natural environments. Knowing the limit of genomic boundary could facilitate the formulation of new strategies in controlling the spread of diseases and combat antibiotic-resistant bacteria.
“…For example, the polyketide synthase (PKS) genes presented with significantly lower level of hidden stop codons than expected, suggesting both non-adherence to the ambush hypothesis and a suppression of hidden stop codon evolution [8]. In addition, it was reported that some sense codons have a more significant excess than stop codons [4]. These controversial results can be well explained if the emerging of the hidden stops after the occurrence of a frameshift is considered to be a signal to trigger the cell machine for readthrough and then for reading frame recovery, rather than a signal for translational termination.…”
Section: Frameshift Tolerating and The Ambush Hypothesismentioning
confidence: 99%
“…The "Ambush Hypothesis" [3] presumed that most frameshifts would yield non-functional proteins, lead to waste of energy, resources and activity of the biosynthetic machinery, and some peptides synthesized after frameshifts were thought to be cytotoxic [3][4][5][6][7][8][9][10][11].…”
The genetic code defines the relationship between a protein and its coding DNA sequence. It was presumed that most frameshifts would yield non-functional, truncated or cytotoxic products. In this study, we report that in E. coli a frameshift β-lactamase (bla) gene is still functional if all of the inner stop codons were readthrough or replaced by a sense codon. By analyzing a large dataset including all available protein coding genes in major model organisms, it is demonstrated that in any species, and in any protein-coding genes, the three translational products from the three different reading frames, are always similar to each other and with constant ~50% similarities and ~100% coverages, and the similarities is predefined by the genetic code rather than the sequences themselves, suggesting that the genetic code was optimized for frameshift tolerating in the early evolution, which endows every protein coding gene a shiftability, an inherent and everlasting ability to tolerate frameshift mutations, and serves as an innate mechanism for cells to deal with the frameshift problem. In addition, it is likely that every protein-coding gene can be translated into three isoforms from the three different reading frames, we proposed a new gene expression paradigm, "one gene, three translations", which is an amendment to the "one gene, one/multiple peptides" hypotheses.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.