2010
DOI: 10.1002/gepi.20543
|View full text |Cite
|
Sign up to set email alerts
|

SNP Selection in genome‐wide and candidate gene studies via penalized logistic regression

Abstract: Penalized regression methods offer an attractive alternative to single marker testing in genetic association analysis. Penalized regression methods shrink down to zero the coefficient of markers that have little apparent effect on the trait of interest, resulting in a parsimonious subset of what we hope are true pertinent predictors. Here we explore the performance of penalization in selecting SNPs as predictors in genetic association studies. The strength of the penalty can be chosen either to select a good p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

4
209
1

Year Published

2011
2011
2014
2014

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 178 publications
(214 citation statements)
references
References 46 publications
4
209
1
Order By: Relevance
“…Yu and colleagues (2005) proposed a unified mixed model taking into account the genetic structure of the sample, based on single locus analysis. This model is being updated by integrating a multi-locus analysis (Ayers et al 2010).In autogamous crops, it is expected that large extent of LD will reduce the resolution and risks to lead to false positive associations. Nevertheless, successful results have been obtained in selfing crops (Atwell, Huang et al 2010;Ramsay, Comadran et al 2011).…”
Section: Association Genetics: New Valorization Of Natural Diversitymentioning
confidence: 99%
“…Yu and colleagues (2005) proposed a unified mixed model taking into account the genetic structure of the sample, based on single locus analysis. This model is being updated by integrating a multi-locus analysis (Ayers et al 2010).In autogamous crops, it is expected that large extent of LD will reduce the resolution and risks to lead to false positive associations. Nevertheless, successful results have been obtained in selfing crops (Atwell, Huang et al 2010;Ramsay, Comadran et al 2011).…”
Section: Association Genetics: New Valorization Of Natural Diversitymentioning
confidence: 99%
“…The second value (T = 0.3) is just below the threshold of "useful LD" (r = 0.316 or r 2 = 0.1) as defined by Kruglyak (1999) and Pritchard and Przeworski (2001) as the value at which the sample size is increased at most 10-fold. The last value (T = 0.25) is close to the threshold used in a previous comparison study (Ayers and Cordell 2010). For completeness, we also provide tFDR values at the remaining thresholds used for TPR2 (0.9, 0.7).…”
Section: Comparison Among Methodsmentioning
confidence: 82%
“…The "usual" approaches include CV and the use of model selection criteria such as AIC, BIC, and EBIC. While CV and AIC are generally useful for prediction rather than model/variable selection, BIC tends to select a true sparse model but does not achieve sufficient sparsity in high-dimensional very sparse settings such as linkage (QTL) mapping and GWAS for which modified BIC criteria have been proposed (e.g., Bogdan et al 2004 Ayers and Cordell (2010) used data permutation to determine the value of the tuning parameter. Multistage strategies have also been suggested as an approach to performing error control with PR (Meinshausen et al 2009;Wasserman and Roeder 2009).…”
Section: Mcp and Scadmentioning
confidence: 99%
See 1 more Smart Citation
“…We show that: (a) WGR behaves as a local smoother, (b) because of that, the problem of missing heritability and that of predictive performance are very distinct subjects: the former is largely dependent on the extent to which a WGR model can describe patterns of co-variability among true genetic values; on the other hand, prediction accuracy is largely dependent on local sample size, (c) that although imperfect LD between markers and QTL may explain part of the missing heritability, other factors such as miss-specified genetic architecture or familiar relationships play an important role on determining the extent of missing heritability and the predictive performance. Advocates of such models have postulated that they benefit from higher power and lower type-I error rates because they take into account all the available genetic information simultaneously (Hoggart et al, 2008;Ayers et al, 2010). Given the typical high dimensionality context of the studies, clear candidates for the joint modeling of genetic markers have been penalized regression methods as well as Bayesian methods (de los Campos et al, 2010).…”
mentioning
confidence: 99%