2012
DOI: 10.1089/cmb.2011.0197
|View full text |Cite
|
Sign up to set email alerts
|

A Classification of Bioinformatics Algorithms from the Viewpoint of Maximizing Expected Accuracy (MEA)

Abstract: Many estimation problems in bioinformatics are formulated as point estimation problems in a high-dimensional discrete space. In general, it is difficult to design reliable estimators for this type of problem, because the number of possible solutions is immense, which leads to an extremely low probability for every solution-even for the one with the highest probability. Therefore, maximum score and maximum likelihood estimators do not work well in this situation although they are widely employed in a number of … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
18
0

Year Published

2013
2013
2018
2018

Publication Types

Select...
5
2

Relationship

3
4

Authors

Journals

citations
Cited by 12 publications
(18 citation statements)
references
References 88 publications
0
18
0
Order By: Relevance
“…Note that CentroidAlign employs an estimator based on maximum expected accuracy (MEA), which has been successfully applied in much software in the field of bioinformatics; see the review by Hamada and Asai [24] for details. In CentroidAlign, the sum-of-pair scores (SPS) [25] is optimized for predicting multiple alignments of RNA sequences (cf.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Note that CentroidAlign employs an estimator based on maximum expected accuracy (MEA), which has been successfully applied in much software in the field of bioinformatics; see the review by Hamada and Asai [24] for details. In CentroidAlign, the sum-of-pair scores (SPS) [25] is optimized for predicting multiple alignments of RNA sequences (cf.…”
Section: Methodsmentioning
confidence: 99%
“…( a ) The input is two RNA sequences, ( x; x ′ ), to be aligned; ( b-1 ) The exact algorithm of CentroidAlign considers a probability distribution of structural alignments between x and x ′ , which gives simultaneously the alignments between nucleotides and those between base-pairs (e.g., Sankoff model [4]); ( b-2 ) The exact case can be approximated by factorizing the distribution of structural alignments into (i) a distribution of secondary structures of x (e.g., the CONTRAfold [27] or McCaskill [22] models); (ii) a distribution of pairwise alignments between x and x ′ (e.g., the CONTRAlign model [23]); and (iii) a distribution of secondary structures of x ′ ; ( c ) By marginalization of the distribution(s) in (b), we obtain a distribution of alignments (*) in which the information about secondary structures is included; ( d ) The best multiple alignment is estimated based on maximizing expected accuracy (MEA) [24] in which the SPS scores of predicted alignments are optimized with respect to the distribution (*) of pairwise alignments given in (c). It should be emphasized that the computational cost of the exact algorithm is ≈ O ( L 6 ), while it is reduced to ≈ O ( L 3 ) in the approximate algorithm, where L is the (maximum) length of two input sequences.…”
Section: Figure A1mentioning
confidence: 99%
“…where G(θ, y) is called the gain function, which returns the similarity between two solutions in Y . When the gain function is designed according to an accuracy or evaluation measure for the target problem, in which y and θ are considered as a prediction and reference, respectively, the estimator is often called a maximum expected accuracy (MEA) estimator [57,58,59] 6 . MEA estimators predict the solution by maximizing the expected accuracy when the solutions are distributed according to p(θ|D).…”
Section: Definitionmentioning
confidence: 99%
“…In addition to the above examples, many algorithms in bioinformatics can be classified, from the viewpoint of MEA/MEG, with respect to gain function and predictive space. See [59] for a review of MEA estimators.…”
Section: Other Examplesmentioning
confidence: 99%
“…Many other decoding criteria were proposed (Hamada and Asai, 2012). For example, we can assign labels to states of the HMM, and then search for the most probable sequence of labels instead of the most probable state path.…”
Section: Introductionmentioning
confidence: 99%