1995
DOI: 10.2307/2533318
|View full text |Cite
|
Sign up to set email alerts
|

Estimation and Reliability of Molecular Sequence Alignments

Abstract: SummaryThe problem of estimating the relatedness of a pa1r of biological sequences IS addressed. A stochastic model of sequence evolution is described that allows insertion and deletion as well as replacement of amino acid residues (or substitution of nucleotides) over time. An expectation-maximization (EM) algorithm that obtains maximum likelihood estimates of the model parameters is introduced. The method assumes the sequences are related by descent from a common ancestor but the alignment (i.e., the precise… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
14
0

Year Published

1995
1995
2005
2005

Publication Types

Select...
4
3
2

Relationship

2
7

Authors

Journals

citations
Cited by 26 publications
(14 citation statements)
references
References 22 publications
0
14
0
Order By: Relevance
“…The 18 regions were also aligned by an ML algorithm with a 95% arc probability (12,13). This method includes only those sections (or arcs on a directed graph) for which the sum of the probabilities of all alignments that contain that arc is >95% of the sum of probabilities of all possible alignments (13).…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The 18 regions were also aligned by an ML algorithm with a 95% arc probability (12,13). This method includes only those sections (or arcs on a directed graph) for which the sum of the probabilities of all alignments that contain that arc is >95% of the sum of probabilities of all possible alignments (13).…”
Section: Methodsmentioning
confidence: 99%
“…The substitutions remaining after the application of these rules to the alignments were then analyzed. The second method was to use a maximum likelihood (ML) alignment algorithm with an arc probability of 95% (12,13).…”
mentioning
confidence: 99%
“…a conserved hexamer is expected to occur by chance about once in every eight promoters on average. We realize that these "back of the envelope" calculations must be interpreted cautiously because it is unlikely that even a probable optimal multiple sequence alignment will correspond precisely to the biological events that generated the sequences being aligned (Thorne and Churchill 1995), but in any case, four-way alignments seem to be the minimum required for efficient recognition of conserved regulatory sequences.…”
Section: Number Of Species For Genome Sequencingmentioning
confidence: 99%
“…However, a maximum likelihood distance method that retains the full information of sequences is also available. The program Statalign (Thorne, Kishino & Felsenstein 1991, Thorne & Kishino 1992, Thorne & Churchill 1995 does not require an alignment, it calculates pairwise maximum likelihood distances among all possible pairs of sequences. The resulting distance matrix, containing the distances and their standard deviations, is subjected to tree inference using the program Modfitch35, which is a modified version of the Fitch program (Felsenstein 1989), included in the Statalign package.…”
Section: Phylogenetic Analysesmentioning
confidence: 99%