2019
DOI: 10.1038/s41598-018-36946-4
|View full text |Cite
|
Sign up to set email alerts
|

Nucleotide-level Convolutional Neural Networks for Pre-miRNA Classification

Abstract: Due to the biogenesis difference, miRNAs can be divided into canonical microRNAs and mirtrons. Compared to canonical microRNAs, mirtrons are less conserved and hard to be identified. Except stringent annotations based on experiments, many in silico computational methods have be developed to classify miRNAs. Although several machine learning classifiers delivered high classification performance, all the predictors depended heavily on the selection of calculated features. Here, we introduced nucleotide-level con… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
17
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 19 publications
(17 citation statements)
references
References 24 publications
0
17
0
Order By: Relevance
“…The results of this study have been obtained for amino acid sequences. It would be needed as a future work to investigate if this effect of padding on model performance can be translated to other biological sequences that are also one-hot encoded and padded, such as RNA 37 , 38 and miRNA 39 or DNA sequences 6 .…”
Section: Discussionmentioning
confidence: 99%
“…The results of this study have been obtained for amino acid sequences. It would be needed as a future work to investigate if this effect of padding on model performance can be translated to other biological sequences that are also one-hot encoded and padded, such as RNA 37 , 38 and miRNA 39 or DNA sequences 6 .…”
Section: Discussionmentioning
confidence: 99%
“…The results of this study have been obtained for amino acid sequences. It would be needed as a future work to check if this effect of padding on model performance can be translated to other biological sequences that are also one-hot encoded and padded, such as RNA [35,36] and miRNA [37] or DNA sequences [6].…”
Section: Discussionmentioning
confidence: 99%
“…Through machine learning, a sequence can be easily judged as a pre-miRNA or not. Based on the ratio between positive and negative data, these predictors can be divided into two types: one is the balanced data (or low unbalanced data) based predictors [32][33][34][35][36][37][38][39][40][41][42][43] and others are the highly unbalanced data based predictors [31,[44][45][46]. Moreover, these predictors also can be roughly divided into human pre-miRNA predictors [31-35, 37, 38, 40, 42, 43], plant pre-miRNA predictors [36,39] and multi-species pre-miRNA predictors [41,[44][45][46][47] according to the species.…”
Section: Introductionmentioning
confidence: 99%
“…In the same year, Xue et al proposed a descriptor that formulates local contiguous structuresequence characteristics from pre-miRNAs, and then combined with SVM to construct the triplet-SVM classifier [33]. neural networks to the prediction of pre-miRNA [42]. Fu et al achieved better results in human pre-miRNA prediction [43].…”
Section: Introductionmentioning
confidence: 99%