2011
DOI: 10.1590/s1415-47572011000200031
|View full text |Cite
|
Sign up to set email alerts
|

Rules extraction from neural networks applied to the prediction and recognition of prokaryotic promoters

Abstract: Promoters are DNA sequences located upstream of the gene region and play a central role in gene expression. Computational techniques show good accuracy in gene prediction but are less successful in predicting promoters, primarily because of the high number of false positives that reflect characteristics of the promoter sequences. Many machine learning methods have been used to address this issue. Neural Networks (NN) have been successfully used in this field because of their ability to recognize imprecise and … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
6
0

Year Published

2013
2013
2019
2019

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 13 publications
(6 citation statements)
references
References 18 publications
0
6
0
Order By: Relevance
“…Among these methods, several machine learning algorithms have been used in developing prokaryotic promoter region prediction methods. For example, support vector machine (SVM) [ 7 , 8 , 12 , 25 , 26 ], artificial neural networks (ANNs) [ 16 , 17 , 20 , 27 29 ], partial least square (PLS) [ 18 ], and quadratic discriminant analysis (QDS) [ 14 ]. Some methods are based on probabilistic approaches (e.g., hidden Markov models (HMMs) [ 30 ] and a combination of HMMs and ANNs [ 31 ]).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Among these methods, several machine learning algorithms have been used in developing prokaryotic promoter region prediction methods. For example, support vector machine (SVM) [ 7 , 8 , 12 , 25 , 26 ], artificial neural networks (ANNs) [ 16 , 17 , 20 , 27 29 ], partial least square (PLS) [ 18 ], and quadratic discriminant analysis (QDS) [ 14 ]. Some methods are based on probabilistic approaches (e.g., hidden Markov models (HMMs) [ 30 ] and a combination of HMMs and ANNs [ 31 ]).…”
Section: Introductionmentioning
confidence: 99%
“…Therefore, developers have to generate their non-promoter sequences. Several strategies for generating non-promoter sequences have been used including: randomly generated sequences [ 16 , 17 , 28 ]; sequences extracted from intergenic or coding regions [ 7 , 11 , 12 , 14 , 15 , 18 , 25 , 28 ]; ii) Feature extraction: Several sequence and structure-based feature representations have been used for developing prokaryotic promoter region prediction methods. Examples of sequence based features include: k-mer representation [ 7 , 12 , 28 , 32 ], variable-window Z-curve [ 18 ], and nucleotide identity (NID) [ 17 ].…”
Section: Introductionmentioning
confidence: 99%
“…Previous studies have confirmed that certain promoters can be identified or predicted based on ANN method [21], [22], [23], [24], [25], [26], [27], [28], but no further effort was reported for quantitative description of their strength. Here, we constructed a finely characterzied Trc promoter & RBS library for sufficient model training and greatly improved the prediction accuracy compared with previous reported methods (PLS-, PWM- and thermodynamics-based) [10], [11], [12].…”
Section: Discussionmentioning
confidence: 99%
“…It can be adapted to continuously change the network structure based on input/output information during learning phase, which could reflect the non-linear relationships between quantitative characteristics and related qualitative performance in complex phenomena. Thus, ANNs have been widely used to various biological research fields such as protein structure and stability prediction [17], [18], [19], RNA secondary structure prediction [20], as well as promoter recognition and structure analysis [21], [22], [23], [24], [25], [26], [27], [28]. In this work, we constructed a high-performance ANN model to directly predict the strength of regulatory element from its sequence.…”
Section: Introductionmentioning
confidence: 99%
“…To overcome this obstacle, several researchers have directly translated the nucleotides in promoter sequences into digits, resulting in digital vectors that resemble the DNA sequences. Different approaches have been adopted to accommodate the variable distances between motifs, including initial sequence alignment 20 and coupling SVM with a sequence alignment kernel to affine gaps in the input sequences 7 . In some studies, the DNA sequences were broken down into collections of oligomers tagged with information on their locations relative to TSSs 6 , 21 .…”
Section: Introductionmentioning
confidence: 99%