2021
DOI: 10.1101/2021.08.04.455085
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

GEPSi: A Python Library to Simulate GWAS Phenotype Data

Abstract: MotivationMany computational methods aim to identify genetic variants associated with diseases and complex traits. Due to the absence of ground truth data, simulated genotype and phenotype data is needed to benchmark these methods. However, phenotypes are frequently simulated as an additive function of randomly selected variants, neglecting biological complexity such as non-random occurrence of causal SNPs, epistatic effects, heritability and dominance. Including such features would improve benchmarking studie… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 23 publications
(23 reference statements)
0
2
0
Order By: Relevance
“…For the Lasso and a few simple convex models, coordinate descent has been broadly disseminated to practitioners in off-the-shelf packages such as glmnet [Friedman et al, 2007] or scikit-learn [Pedregosa et al, 2011]. More recently, celer, a state-of-the-art convex working set algorithm [Massias et al, 2020] allowed for successful applications of the Lasso in very high dimensional problems in medicine [Reidenbach et al, 2021, Kim et al, 2021 or seismology [Muir and Zhan, 2021].…”
Section: Pred Rmse Left-outmentioning
confidence: 99%
“…For the Lasso and a few simple convex models, coordinate descent has been broadly disseminated to practitioners in off-the-shelf packages such as glmnet [Friedman et al, 2007] or scikit-learn [Pedregosa et al, 2011]. More recently, celer, a state-of-the-art convex working set algorithm [Massias et al, 2020] allowed for successful applications of the Lasso in very high dimensional problems in medicine [Reidenbach et al, 2021, Kim et al, 2021 or seismology [Muir and Zhan, 2021].…”
Section: Pred Rmse Left-outmentioning
confidence: 99%
“…Accordingly, the corresponding simulators have also been developed. For instance, many recent simulators support the generation of multiple correlated phenotypes (Meyer and Birney, 2018; O’Reilly et al ., 2012; Porter and O’Reilly, 2017), and some simulators support the epistatic effects between causal genetic factors (Fernandes and Lipka, 2020; Reidenbach et al ., 2021).…”
Section: Introductionmentioning
confidence: 99%