We purge large databases of animal, plant, and fungal introncontaining genes to a 20% similarity level and then identify the most similar animal-plant, animal-fungal, and plant-fungal protein pairs. We identify the introns in each BLAST 2.0 alignment and score matched intron positions and slid (near-matched, within six nucleotides) intron positions automatically. Overall we find that 10% of the animal introns match plant positions, and a further 7% are ''slides.'' Fifteen percent of fungal introns match animal positions, and 13% match plant positions. Furthermore, the number of alignments with high numbers of matches deviates greatly from the Poisson expectation. The 30 animal-plant alignments with the highest matches (for which 44% of animal introns match plant positions) when aligned with fungal genes are also highly enriched for triple matches: 39% of the fungal introns match both animal and plant positions. This is strong evidence for ancestral introns predating the animal-plant-fungal divergence, and in complete opposition to any expectations based on random insertion. In examining the slid introns, we show that at least half are caused by imperfections in the alignments, and are most likely to be actual matches at common positions. Thus, our final estimates are that Ϸ14% of animal introns match plant positions, and that Ϸ17-18% of fungal introns match animal or plant positions, all of these being likely to be ancestral in the eukaryotes.exon ͉ phase distribution ͉ evolution ͉ eukaryote ͉ prokaryote I ntrons are prevalent in the complex eukaryotes but rare in the simple ones. Are these introns ancestral in all of the eukaryotes or do they arise as the organisms become more complex? Introns can be acquired by or eliminated from a gene during evolution, but what is the balance?An introns-late view argues that introns arise as ''selfish'' elements that play no constructive role in evolution. On this picture, introns appear relatively late in the evolution of eukaryotes (1-3) and spread as mobile elements that invade genes by insertion into short Ϸ4-to 5-nt-long ''proto-splice sites'' (4) (although the notion of proto-splice sites has been challenged; refs. 5 and 6).An introns-early theory suggests that introns made an essential contribution to the evolution of genes via ''exon shuffling,'' which created genes from exon ''pieces'' by recombination within the introns (7-12). In this view, introns existed before any eukaryote-prokaryote divergence, and since that time, the prokaryotic lineage completely lost its introns, whereas they were retained in the eukaryotes.The sequences within the introns change during evolution, far more rapidly than those of the exons. The only conserved elements are the short sequences at the 5Ј and 3Ј termini, which are very similar for all introns. The rest of the intron sequence appears neutral to selection, and the length of the intron sequence can change by orders of magnitude. However, the position of an intron in a gene's coding sequence is well conserved. If one compares the e...
BackgroundMachine learning techniques are becoming useful as an alternative approach to conventional medical diagnosis or prognosis as they are good for handling noisy and incomplete data, and significant results can be attained despite a small sample size. Traditionally, clinicians make prognostic decisions based on clinicopathologic markers. However, it is not easy for the most skilful clinician to come out with an accurate prognosis by using these markers alone. Thus, there is a need to use genomic markers to improve the accuracy of prognosis. The main aim of this research is to apply a hybrid of feature selection and machine learning methods in oral cancer prognosis based on the parameters of the correlation of clinicopathologic and genomic markers.ResultsIn the first stage of this research, five feature selection methods have been proposed and experimented on the oral cancer prognosis dataset. In the second stage, the model with the features selected from each feature selection methods are tested on the proposed classifiers. Four types of classifiers are chosen; these are namely, ANFIS, artificial neural network, support vector machine and logistic regression. A k-fold cross-validation is implemented on all types of classifiers due to the small sample size. The hybrid model of ReliefF-GA-ANFIS with 3-input features of drink, invasion and p63 achieved the best accuracy (accuracy = 93.81%; AUC = 0.90) for the oral cancer prognosis.ConclusionsThe results revealed that the prognosis is superior with the presence of both clinicopathologic and genomic markers. The selected features can be investigated further to validate the potential of becoming as significant prognostic signature in the oral cancer studies.
The Escherichia coli arginine repressor (ArgR) is an L-arginine-dependent DNA-binding protein that controls expression of the arginine biosynthetic genes and is required as an accessory protein in Xer site-specific recombination at cer and related recombination sites in plasmids. Site-directed mutagenesis was used to isolate two mutants of E. coli ArgR that were defective in arginine binding. Results from in vivo and in vitro experiments demonstrate that these mutants still act as repressors and bind their specific DNA sequences in an arginine-independent manner. Both mutants support Xer site-specific recombination at cer. One of the mutant proteins was purified and shown to bind to its DNA target sequences in vitro with different affinity and as a different molecular species to wild-type ArgR.
SummaryThe Escherichia coli arginine repressor (ArgR) controls expression of the arginine biosynthetic genes and acts as an accessory protein in Xer site-specific recombination at cer and related plasmid recombination sites. The hexameric wild-type protein shows L-arginine-dependent DNA binding. In this work, ArgR mutants that are defective in trimer-trimer interactions and bind DNA as trimers in an L-arginine-independent manner are isolated and characterized. Whereas the wild-type ArgR hexamer exhibits high-affinity binding to two repeated ARG boxes separated by 3 bp (each ARG box containing two identical dyad symmetrical 9 bp half-sites), the trimeric mutants bind to and footprint three adjacent half-sites of this 'idealized' substrate. Trimeric ArgR is impaired in its ability to repress the arginine biosynthetic genes and in Xer site-specific recombination. In the absence of L-arginine, residual wild-type ArgR-binding occurs as trimers. The binding of an N-terminal 77-amino-acid DNA-binding domain to idealized ARG boxes is also characterized.
Synonymous codon usage bias is an inevitable phenomenon in organismic taxa across the three domains of life. Though the frequency of codon usage is not equal across species and within genome in the same species, the phenomenon is non random and is tissue-specific. Several factors such as GC content, nucleotide distribution, protein hydropathy, protein secondary structure, and translational selection are reported to contribute to codon usage preference. The synonymous codon usage patterns can be helpful in revealing the expression pattern of genes as well as the evolutionary relationship between the sequences. In this study, synonymous codon usage bias patterns were determined for the evolutionarily close proteins of albumin superfamily, namely, albumin, α-fetoprotein, afamin, and vitamin D-binding protein. Our study demonstrated that the genes of the four albumin superfamily members have low GC content and high values of effective number of codons (ENC) suggesting high expressivity of these genes and less bias in codon usage preferences. This study also provided evidence that the albumin superfamily members are not subjected to mutational selection pressure.
Foot arch determines the shape of the foot, whether it is normal, flat or high. Excessive body weight is known to be the
This study presents the first large-scale initiative for local reference charts. The growth reference would enable the growth assessment of a Malaysian child compared to the average growth of children in the country. It is suggested that the use of WHO 2006 Child Growth Standards should be complemented with local reference charts for a more wholesome growth assessment.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.