5-Methylcytosine (m5C) plays an extremely important role in the basic biochemical process. With the great increase of identified m5C sites in a wide variety of organisms, their epigenetic roles become largely unknown. Hence, accurate identification of m5C site is a key step in understanding its biological functions. Over the past several years, more attentions have been paid on the identification of m5C sites in multiple species. In this work, we firstly summarized the current progresses in computational prediction of m5C sites and then constructed a more powerful and reliable model for identifying m5C sites. To train the model, we collected experimentally confirmed m5C data from Homo sapiens, Mus musculus, Saccharomyces cerevisiae and Arabidopsis thaliana, and compared the performances of different feature extraction methods and classification algorithms for optimizing prediction model. Based on the optimal model, a novel predictor called iRNA-m5C was developed for the recognition of m5C sites. Finally, we critically evaluated the performance of iRNA-m5C and compared it with existing methods. The result showed that iRNA-m5C could produce the best prediction performance. We hope that this paper could provide a guide on the computational identification of m5C site and also anticipate that the proposed iRNA-m5C will become a powerful tool for large scale identification of m5C sites.
Simple sequence repeats (SSRs) are rare (approximately 1%) in most genomes and are generally considered to have no function. However, penaeid shrimp genomes have a high proportion of SSRs (>23%), raising the question of whether these SSRs play important functional and evolutionary roles in these SSR-rich species. Here, we show that SSRs drive genome plasticity and adaptive evolution in two penaeid shrimp species, Fenneropenaeus chinensis and Litopenaeus vannamei. Assembly and comparison of genomes of these two shrimp species at the chromosome-level revealed that transposable elements serve as carriers for SSR expansion, which is still occurring. The remarkable genome plasticity identified herein might have been shaped by significant SSR expansions. SSRs were also found to regulate gene expression by multi-omics analyses, and be responsible for driving adaptive evolution, such as the variable osmoregulatory capacities of these shrimp under low-salinity stress. These data provide strong evidence that SSRs are an important driver of the adaptive evolution in penaeid shrimp.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.