2014
DOI: 10.1089/cmb.2013.0099
|View full text |Cite
|
Sign up to set email alerts
|

A Bayesian Sampler for Optimization of Protein Domain Hierarchies

Abstract: The process of identifying and modeling functionally divergent subgroups for a specific protein domain class and arranging these subgroups hierarchically has, thus far, largely been done via manual curation. How to accomplish this automatically and optimally is an unsolved statistical and algorithmic problem that is addressed here via Markov chain Monte Carlo sampling. Taking as input a (typically very large) multiple-sequence alignment, the sampler creates and optimizes a hierarchy by adding and deleting leaf… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
54
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
5
1

Relationship

3
3

Authors

Journals

citations
Cited by 33 publications
(54 citation statements)
references
References 25 publications
0
54
0
Order By: Relevance
“…The MAP, which corresponds to the maximum LLR (denoted here as LLR MAP ), is the value to which the sampler (with simulated annealing) tends to converge, though this is not guaranteed. As illustrated by Neuwald (2014), however, obtaining the same maximum LLR in multiple sampling runs using distinct random seeds suggests that a nearly optimal hierarchy is typically found. Of course, when comparing such MAP-associated LLRs for distinct hierarchies, it is essential that the same input alignment be used throughout.…”
Section: Measures Of Hierarchy Qualitymentioning
confidence: 96%
See 4 more Smart Citations
“…The MAP, which corresponds to the maximum LLR (denoted here as LLR MAP ), is the value to which the sampler (with simulated annealing) tends to converge, though this is not guaranteed. As illustrated by Neuwald (2014), however, obtaining the same maximum LLR in multiple sampling runs using distinct random seeds suggests that a nearly optimal hierarchy is typically found. Of course, when comparing such MAP-associated LLRs for distinct hierarchies, it is essential that the same input alignment be used throughout.…”
Section: Measures Of Hierarchy Qualitymentioning
confidence: 96%
“…We define as corresponding those pairs of nodes or subtrees, one from each hierarchy, with ‡ 90% overlap in their sequence sets; as roughly corresponding such pairs with > 50% but < 90% overlap; and as noncorresponding the remaining nodes. We represent sequence sets using a previously described data structure (Neuwald and Green, 1994) facilitating efficient bitwise set operations.…”
Section: Comparing Two Hierarchiesmentioning
confidence: 99%
See 3 more Smart Citations