2004
DOI: 10.1590/s1415-47572004000400031
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation of noise reduction techniques in the splice junction recognition problem

Abstract: The Human Genome Project has generated a large amount of sequence data. A number of works are currently concerned with analyzing these data. One of the analyses carried out is the identification of genes' structures on the sequences obtained. As such, one can search for particular signals associated with gene expression. Splice junctions represent a type of signal present on eukaryote genes. Many studies have applied Machine Learning techniques in the recognition of such regions. However, most of the genetic d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
0

Year Published

2009
2009
2019
2019

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 27 publications
(10 citation statements)
references
References 25 publications
(18 reference statements)
0
10
0
Order By: Relevance
“…A properly designed data filtering mechanism is considered to favor the system performance and efficiency [17]. In this paper, we adopt a filtering approach proposed in [33] to increase the signal-to-noise ratio of the investigated dataset by discarding background k-mers without false dismissals.…”
Section: B Search Space Reductionmentioning
confidence: 99%
See 1 more Smart Citation
“…A properly designed data filtering mechanism is considered to favor the system performance and efficiency [17]. In this paper, we adopt a filtering approach proposed in [33] to increase the signal-to-noise ratio of the investigated dataset by discarding background k-mers without false dismissals.…”
Section: B Search Space Reductionmentioning
confidence: 99%
“…Recently, the concept of negative data filtering (e.g. noise reduction) has been successfully applied for solving various problems in Bioinformatics [17]- [20]. Therefore, we believe a properly designed filtering mechanism by removing the biologically-irrelevant background sequences without touching any true motifs could increase the signal-to-noise ratio of the DNA sequences and potentially improve the predication accuracy.…”
Section: Introductionmentioning
confidence: 99%
“…It is generally accepted that inductive learning systems in the medical domain are dependent on the quality of the data (Pechenizkiy et al, 2006) and there has been significant research into data cleansing in bioinformatics (Malossini et al, 2006;Gamberger et al, 2000;Lorena and Carvalho, 2004;Tang and Chen, 2008a,b). Although instance based techniques such as k-NN are not generally used for classification in much of this research, noise reduction is an important element in the process as it can result in the simplification of the models created.…”
Section: Motivationmentioning
confidence: 99%
“…Although instance based techniques such as k-NN are not generally used for classification in much of this research, noise reduction is an important element in the process as it can result in the simplification of the models created. Lorena and Carvalho (2004), for example, found that preprocessing the training data to remove noise resulted in simplifications in induced SVM classifiers and higher comprehensiveness in induced decision tree classifiers.…”
Section: Motivationmentioning
confidence: 99%
See 1 more Smart Citation