2014
DOI: 10.1109/tcbb.2013.146
|View full text |Cite
|
Sign up to set email alerts
|

Improved and Promising Identification of Human MicroRNAs by Incorporating a High-Quality Negative Set

Abstract: MicroRNA (miRNA) plays an important role as a regulator in biological processes. Identification of (pre-) miRNAs helps in understanding regulatory processes. Machine learning methods have been designed for pre-miRNA identification. However, most of them cannot provide reliable predictive performances on independent testing data sets. We assumed this is because the training sets, especially the negative training sets, are not sufficiently representative. To generate a representative negative set, we proposed a … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
153
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
8
2

Relationship

2
8

Authors

Journals

citations
Cited by 217 publications
(154 citation statements)
references
References 28 publications
1
153
0
Order By: Relevance
“…Features derived from RNA sequences or their predicted secondary structures were proposed to capture the characteristics of miRNAs, for example, k-mer (sub-sequence of RNAs) is one of the main sequence-based features reflecting the local sequence composition of RNAs (Wei et al, 2014). Because most of the pre-miRNAs have the characteristic of stemloop hairpin structures (Xue et al, 2005), some features were constructed based on the predicted secondary structures so as to reflect this characteristic; e.g., a set of 32 local triplet sequence-structure features were used in the Triplet-SVM (Xue et al, 2005) to predict the human miRNAs (Xue et al, 2005).…”
Section: Introductionmentioning
confidence: 99%
“…Features derived from RNA sequences or their predicted secondary structures were proposed to capture the characteristics of miRNAs, for example, k-mer (sub-sequence of RNAs) is one of the main sequence-based features reflecting the local sequence composition of RNAs (Wei et al, 2014). Because most of the pre-miRNAs have the characteristic of stemloop hairpin structures (Xue et al, 2005), some features were constructed based on the predicted secondary structures so as to reflect this characteristic; e.g., a set of 32 local triplet sequence-structure features were used in the Triplet-SVM (Xue et al, 2005) to predict the human miRNAs (Xue et al, 2005).…”
Section: Introductionmentioning
confidence: 99%
“…In our future work, we will try other features considering long range sequence information, which is indicted to be useful for the enhancer classification in our current work. We will also try the imbalanced classifiers [52][53][54][55][56]on our dataset, which has been employed CD-HIT and random sampling strategy for the large negative data. Combined with some more sophisticated machine learning models and feature reduction methods [57], we anticipate better performance can be achieved.…”
Section: Resultsmentioning
confidence: 99%
“…mirnaDetect (http://datamining.xmu.edu.cn/main/~leyiwei/mirnaDetect.html) a tool for detecting potential pre-miRNAs from the genome-scale data was also used to predict the possible miRNAs in Niger [15]. The secondary structures of putative pre-miRNAs were predicted by Mfold (http://mfold.rna.albany.edu) [16].…”
Section: Identification Of Conserved Mirnamentioning
confidence: 99%