Understanding the relationship between amino acid sequences and folding rates of proteins is an important challenge in computational and molecular biology. All existing algorithms for predicting protein folding rates have never taken into account the sequence coupling effects. In this work, a novel algorithm was developed for predicting the protein folding rates from amino acid sequences. The prediction was achieved on the basis of dipeptide composition, in which the sequence coupling effects are explicitly included through a series of conditional probability elements. Based on a non-redundant dataset of 99 proteins, the proposed method was found to provide an excellent agreement between the predicted and experimental folding rates of proteins when evaluated with the jackknife test. The correlation coefficient was 87.7% and the standard error was 2.04, which indicated the important contribution from sequence coupling effects to the determination of protein folding rates.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.