2018
DOI: 10.1142/s0219720018500105
|View full text |Cite
|
Sign up to set email alerts
|

New variable selection strategy for analysis of high-dimensional DNA methylation data

Abstract: In genetic association studies, regularization methods are often used due to their computational efficiency for analysis of high-dimensional genomic data. DNA methylation data generated from Infinium HumanMethylation450 BeadChip Kit have a group structure where an individual gene consists of multiple Cytosine-phosphate-Guanine (CpG) sites. Consequently, group-based regularization can precisely detect outcome-related CpG sites. Representative examples are sparse group lasso (SGL) and network-based regularizatio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
9
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 10 publications
(9 citation statements)
references
References 23 publications
0
9
0
Order By: Relevance
“…Therefore, computation of selection probability does not require to choose the optimal tuning parameter. For this reason, selection probability has been widely used to identify top ranked genetic variants or genes that are associated with a phenotype outcome for analysis of high-dimensional genomic data (Alexander and Lange, 2011;Sun et al, 2017;Choi et al, 2018;Kim and Sun, 2019). However, selection probability has never been applied to multivariate regularization methods.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…Therefore, computation of selection probability does not require to choose the optimal tuning parameter. For this reason, selection probability has been widely used to identify top ranked genetic variants or genes that are associated with a phenotype outcome for analysis of high-dimensional genomic data (Alexander and Lange, 2011;Sun et al, 2017;Choi et al, 2018;Kim and Sun, 2019). However, selection probability has never been applied to multivariate regularization methods.…”
Section: Methodsmentioning
confidence: 99%
“…Note that the q-dimensional vectorβ j (I m ; λ, α) has either all zero values or all nonzero values because of a group lasso penalty in (2.2). Similarly, selection probability of the individual elastic-net method in (2.1) can be computed and details are described by Sun et al, 2017;Choi et al, 2018;Kim and Sun, 2019). However, the selection probability of the j th variant of the elastic-net can have up to q different values since the coefficient vector is estimated from each of q phenotype outcomes.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…Feature selection methods and statistical learning methods such as sparse Group LASSO and network regularization have identified important CpGs in highly complex data. [29][30][31][32][33] More recent work has called for a greater understanding of the implications of DNAm-DNAm interactions through the incorporation of Gaussian Graphical Models, Canonical Correlation Analysis, and module discovery through weighted gene co-methylation networks. [34][35][36][37][38][39][40][41][42][43][44][45][46][47][48][49][50] There is growing support for the use of novel deep learning methods to aggregate, group, and select CpGs by their local context (e.g., genes) in an effort to connect and interpret the data with clinical outcomes.…”
Section: Introductionmentioning
confidence: 99%