2008
DOI: 10.1007/978-3-540-85984-0_115
|View full text |Cite
|
Sign up to set email alerts
|

A Comparative Study of Machine Learning Methods for Detecting Promoters in Bacterial DNA Sequences

Abstract: Abstract. Machine Learning methods have been widely used in bioinformatics, mainly for data classification and pattern recognition. The detection of genes in DNA sequences is still an open problem. Identifying the promoter region laying prior the gene itself is an important aid to detect a gene. This paper aims at applying several Machine Learning methods to the construction of classifiers for detection of promoters in the DNA of Escherichia coli. A thorough comparison of methods was done. In general, probabil… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2009
2009
2023
2023

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 11 publications
(12 citation statements)
references
References 9 publications
0
12
0
Order By: Relevance
“…The classifiers chosen are based on previous research in sequence classifications which had yielded better results. 38 Table 5 summarizes the comparative analysis of our approach with various traditional classification systems. The Appendix defines the parameters for each of the methods.…”
Section: Resultsmentioning
confidence: 99%
“…The classifiers chosen are based on previous research in sequence classifications which had yielded better results. 38 Table 5 summarizes the comparative analysis of our approach with various traditional classification systems. The Appendix defines the parameters for each of the methods.…”
Section: Resultsmentioning
confidence: 99%
“…The direct comparison of results obtained here with other classifiers for the same data sets (such as [21]) may lead to interpretation errors, since different partitions of the data sets and cross-validation procedures can be used. However, as shown in [19] probabilistic-like methods, such as WWAM and MDD, tend to achieve better predictive accuracy than other methods for biological sequences.…”
Section: Discussionmentioning
confidence: 94%
“…Such a measure, used for two-class problems, is the Matthews Correlation Coefficient (MCC) (Eq. 4), regarded as a balanced measure and frequently used in bioinformatics [1], [19].…”
Section: B Predictive Accuracy Measuresmentioning
confidence: 99%
“…The ROC plot is a useful technique for visualizing and comparing classifiers and is commonly used in decision making in ML, data mining and Bioinformatics (Sing et al, 2005;Tavares et al, 2008). It is constructed using the performance rate of the classifiers.…”
Section: Experiments and Resultsmentioning
confidence: 99%
“…HMMs offer the advantages of having strong statistical foundations that are well-suited to natural language domains and they are computationally efficient (Seymore et al, 1999). Therefore, HMMs have been used for pattern recognition in many domains and, in special, in Bioinformatics (Durbin et al, 1998;Tavares et al, 2008).…”
Section: Hidden Markov Modelsmentioning
confidence: 99%