In the process of data analysis, the investigator is often facing highly-volatile and random-appearing observed data. A vast body of literature shows that the assumption of underlying stochastic processes was not necessarily representing the nature of the processes under investigation and, when other tools were used, deterministic features emerged. Non Linear Time Series Analysis (NLTS) allows researchers to test whether observed volatility conceals systematic non linear behavior, and to rigorously characterize governing dynamics. Behavioral patterns detected by non linear time series analysis, along with scientific principles and other expert information, guide the specification of mechanistic models that serve to explain real-world behavior rather than merely reproducing it. Often there is a misconception regarding the complexity of the level of mathematics needed to understand and utilize the tools of NLTS (for instance Chaos theory). However, mathematics used in NLTS is much simpler than many other subjects of science, such as mathematical topology, relativity or particle physics. For this reason, the tools of NLTS have been confined and utilized mostly in the fields of mathematics and physics. However, many natural phenomena investigated I many fields have been revealing deterministic non linear structures. In this book we aim at presenting the theory and the empirical of NLTS to a broader audience, to make this very powerful area of science available to many scientific areas. This book targets students and professionals in physics, engineering, biology, agriculture, economy and social sciences as a textbook in Nonlinear Time Series Analysis (NLTS) using the R computer language.
The degeneracy of amino acid coding is one of the most crucial and enigmatic aspects of the genetic code. Different theories about the origin of the genetic code have been developed. However, to date, there is no comprehensive hypothesis on the mechanism that might have generated the degeneracy as we observe it. Here, we provide a new theory that explains the origin of the degeneracy based only on symmetry principles. The approach allows one to describe exactly the degeneracy of the early code (progenitor of the genetic code of LUCA, the last universal common ancestor) which is hypothesized to have the same degeneracy as the present vertebrate mitochondrial genetic code. The theory is based upon the tessera code, that fits as the progenitor of the early code. Moreover, we describe in detail the possible evolutionary transitions implied by our theory. The approach is supported by a unified mathematical framework that accounts for the degeneracy properties of both nuclear and mitochondrial genetic codes. Our work provides a new perspective to the understanding of the origin of the genetic code and the roles of symmetry principles in the organization of genetic information.
The study of correlation structures in DNA sequences is of great interest because it allows us to obtain structural and functional information about underlying genetic mechanisms. In this paper we present a study of the correlation structure of protein coding sequences of DNA based on a recently developed mathematical representation of the genetic code. A fundamental consequence of such representation is that codons can be assigned a parity class (odd-even). Such parity can be obtained by means of a nonlinear algorithm acting on the chemical character of the codon bases. In the same setting the Rumer's class can be naturally described and a new dichotomic class, the hidden class, can be defined. Moreover, we show that the set of DNA's base transformations associated to the three dichotomic classes can be put in a compact group-theoretic framework. We use the dichotomic classes as a coding scheme for DNA sequences and study the mutual dependence between such classes. The same analysis is carried out also on the chemical dichotomies of DNA bases. In both cases, the statistical analysis is performed by using an entropy-based dependence metric possessing many desirable properties. We obtain meaningful tests for mutual dependence by using suitable resampling techniques. We find strong short-range correlations between certain combinations of dichotomic codon classes. These results support our previous hypothesis that codon classes might play an active role in the organization of genetic information.
Summary:The origin of the genetic code represents one of the most challenging problems in molecular evolution. The genetic code is an important universal feature of extant organisms and indicates a common ancestry of different forms of life on earth. Known variants of the genetic code can be mainly divided in mitochondrial and nuclear classes. Here we provide a new insight on the origin of the mitochondrial genetic code: we found that its degeneracy distribution can be explained by using a mathematical approach recently developed for the description of the Euplotes nuclear variant of the genetic code. The results point to a primeval mitochondrial genetic code composed of four base codons, which we call tesserae, that, among other features, exhibit outstanding error detection capabilities. The theoretical description suggests also a formulation of a plausible biological theory about the origin of protein coding. Such theory is based on the symmetry properties of hypothetical primeval chemical adaptors between nucleic acids and amino acids (ancient tRNA's). Our paper provides a unified mathematical framework for different hypotheses on the origin of genetic coding. Also, it contributes to revisit our present view about the evolutionary steps that led to extant genetic codes by giving a new first-principles perspective on the difficult problem of the origin of the genetic code, and consequently, on the origin of life on earth.
In 1996 Arquès & Michel [(1996) J. Theor. Biol. 182, [45][46][47][48][49][50][51][52][53][54][55][56][57][58] discovered the existence of a common circular code in eukaryote and prokaryote genomes.Since then, circular code theory has provoked great interest and underwent a rapid development. In this paper we discuss some theoretical issues related to the synchronization properties of coding sequences and circular codes with particular emphasis on the problem of retrieval and maintenance of the reading frame. Motivated by the theoretical discussion, we adopt a rigorous statistical approach in order to try to answer to different questions. First, we investigate the covering capability of the whole class of 216 self-complementary, coding sequences are tailored as to maximize or minimize the coverage of circular codes on specific reading frames. Such optimization clearly relates the function of circular codes with reading frame synchronization.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.