This paper presents two in-depth studies on RnaPredict, an evolutionary algorithm for RNA secondary structure prediction. The first study is an analysis of the performance of two thermodynamic models, Individual Nearest Neighbor (INN) and Individual Nearest Neighbor Hydrogen Bond (INN-HB). The correlation between the free energy of predicted structures and the sensitivity is analyzed for 19 RNA sequences. Although some variance is shown, there is a clear trend between a lower free energy and an increase in true positive base pairs. With increasing sequence length, this correlation generally decreases. In the second experiment, the accuracy of the predicted structures for these 19 sequences are compared against the accuracy of the structures generated by the mfold dynamic programming algorithm (DPA) and also to known structures. RnaPredict is shown to outperform the minimum free energy structures produced by mfold and has comparable performance when compared to sub-optimal structures produced by mfold.
RNA is central in several stages of protein synthesis, and also has structural, functional, and regulatory roles in the cell. The shape of organic molecules such as RNA largely determines their function within an organic system, thus methods for the computational prediction of structure are sought after. In the ab initio case where only the RNA sequence is known, the currently dominant structure prediction techniques employ minimization of the free energy of a given RNA molecule via a thermodynamic model. However, the minimum free energy structure is rarely the native structure; this is thought to be due to errors in the thermodynamic model parameters, which are experimentally determined.Cluster analysis performed by [6] on a sampling of structures from a Boltzmann weighted ensemble determined that the best cluster centroid had an improved sensitivity and significantly improved positive predictive value over the minimum free energy structure in the ensemble. Based on this result, we investigated the combination of an existing evolutionary algorithm for RNA secondary structure prediction with a clustering algorithm.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.