2011
DOI: 10.1534/genetics.110.122614
|View full text |Cite
|
Sign up to set email alerts
|

Distinguishing Positive Selection From Neutral Evolution: Boosting the Performance of Summary Statistics

Abstract: Summary statistics are widely used in population genetics, but they suffer from the drawback that no simple sufficient summary statistic exists, which captures all information required to distinguish different evolutionary hypotheses. Here, we apply boosting, a recent statistical method that combines simple classification rules to maximize their joint predictive performance. We show that our implementation of boosting has a high power to detect selective sweeps. Demographic events, such as bottlenecks, do not … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

2
153
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 104 publications
(155 citation statements)
references
References 47 publications
2
153
0
Order By: Relevance
“…We note that our feature set bears some conceptual similarity to that of Lin et al (2011). With the exception of the iHH features, Lin et al (2011) effectively learn from a small set of weighted linear combinations of the scaled SFS.…”
Section: Comparison With Previous Learning-based Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…We note that our feature set bears some conceptual similarity to that of Lin et al (2011). With the exception of the iHH features, Lin et al (2011) effectively learn from a small set of weighted linear combinations of the scaled SFS.…”
Section: Comparison With Previous Learning-based Methodsmentioning
confidence: 99%
“…In fact, our analysis shows that the small difference in observed power is explained by the iHH features ( Figure S4). Furthermore, the fundamental nature of our feature set enables our framework to potentially learn linear combinations that are not captured by the set used in Lin et al (2011). In addition, we emphasize that although SFselect-s and Lin et al (2011) show similar power in the parameter-specific case, in practice the parameters of a selective sweep are seldom known.…”
Section: Comparison With Previous Learning-based Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…This explains why we use boosting in connection with regression here. In Lin et al (2011) on the other hand, Y has been taken as a binary variable with 0 indicating neutral evolution and 1 positive selection. This led to a problem of classification.…”
Section: Regression and Boostingmentioning
confidence: 99%
“…A promising method for estimating the best possible model is boosting (Bühlmann and Hothorn 2007), a machine learning method, which can be used both for classification and regression. In another population genetic setup, Lin et al (2011) implemented boosting successfully to distinguish positive selection and neutral evolution, by solving a binary classification problem.…”
mentioning
confidence: 99%