2014
DOI: 10.1186/1471-2105-15-124
|View full text |Cite
|
Sign up to set email alerts
|

The discriminant power of RNA features for pre-miRNA recognition

Abstract: BackgroundComputational discovery of microRNAs (miRNA) is based on pre-determined sets of features from miRNA precursors (pre-miRNA). Some feature sets are composed of sequence-structure patterns commonly found in pre-miRNAs, while others are a combination of more sophisticated RNA features. In this work, we analyze the discriminant power of seven feature sets, which are used in six pre-miRNA prediction tools. The analysis is based on the classification performance achieved with these feature sets for the trai… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
20
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 36 publications
(21 citation statements)
references
References 48 publications
1
20
0
Order By: Relevance
“…The application of machine learning to biological data has become important and in pre-miRNA analysis it has become indispensable (Jiang et al, 2007; Lopes, Schliep & De Carvalho, 2014; Gudyś et al, 2013; Ding, Zhou & Guan, 2010; Bentwich, 2008; Batuwita & Palade, 2009; Van der Burgt et al, 2009; Gao et al, 2013). ML is a system which is influenced by different choices that can be made, for example, the selected training and testing datasets, feature selection, and the choice of classification algorithm.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…The application of machine learning to biological data has become important and in pre-miRNA analysis it has become indispensable (Jiang et al, 2007; Lopes, Schliep & De Carvalho, 2014; Gudyś et al, 2013; Ding, Zhou & Guan, 2010; Bentwich, 2008; Batuwita & Palade, 2009; Van der Burgt et al, 2009; Gao et al, 2013). ML is a system which is influenced by different choices that can be made, for example, the selected training and testing datasets, feature selection, and the choice of classification algorithm.…”
Section: Resultsmentioning
confidence: 99%
“…The ‘pseudo’ dataset (8,492 hairpins) is a popular negative dataset used in various studies (Jiang et al, 2007; Chen, Wang & Liu, 2016; Lopes, Schliep & De Carvalho, 2014) on the detection of pre-miRNAs and it was downloaded from Ng & Mishra (2007). No other negative dataset has been used by more than one study on pre-miRNA detection.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Thus, ML practitioners frequently perform feature selection procedures to select the most informative features to train useful models. The effects of feature selection in pre-miRNA prediction have been shown in the works of (13,14), where a large number of features are reduced to the most informative ones. Furthermore, the performance of ML models is also tied to the training and testing datasets.…”
Section: Introductionmentioning
confidence: 99%
“…The selection and validation of feature set and advances in machine learning techniques give way to the development algorithms with very high accuracy. Recently, studies are conducted exclusively to determine discriminant power of features selected in pre-microRNA identification [26]. A significant change arose due the advent of Next Generation Sequencing techniques, a quite a few algorithms based on enormous data output from sequencing techniques are also available.…”
Section: Introductionmentioning
confidence: 99%