2019
DOI: 10.1186/s12859-019-2978-z
|View full text |Cite
|
Sign up to set email alerts
|

Stochastic Lanczos estimation of genomic variance components for linear mixed-effects models

Abstract: Background Linear mixed-effects models (LMM) are a leading method in conducting genome-wide association studies (GWAS) but require residual maximum likelihood (REML) estimation of variance components, which is computationally demanding. Previous work has reduced the computational burden of variance component estimation by replacing direct matrix operations with iterative and stochastic methods and by employing loose tolerances to limit the number of iterations in the REML optimization procedure. H… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
7
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(7 citation statements)
references
References 25 publications
0
7
0
Order By: Relevance
“…We used GCTA v1.91.3b 20 to construct genomic related matrices and perform HE regression. We obtained using BOLT-LMM v2.3.4 51 for computational efficiency; though BOLT-LMM uses a randomized algorithm, its numerical accuracy is comparable to that of the exact algorithm implemented GCTA 52 .…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…We used GCTA v1.91.3b 20 to construct genomic related matrices and perform HE regression. We obtained using BOLT-LMM v2.3.4 51 for computational efficiency; though BOLT-LMM uses a randomized algorithm, its numerical accuracy is comparable to that of the exact algorithm implemented GCTA 52 .…”
Section: Methodsmentioning
confidence: 99%
“…We therefore utilized a modified Python implementation of the REML algorithm presented in ref. 52 (available at https://github.com/rborder/SL_REML 56 ). We used LDAK v5.0 to obtain adjusted HE regression estimates 53 .…”
Section: Methodsmentioning
confidence: 99%
“…This article presents a new software tool to address the challenge, which we refer to as SLEMM (short for Stochastic-Lanczos-Expedited Mixed Models). SLEMM can efficiently perform Stochastic Lanczos Restricted Maximum Likelihood (REML) analysis ( Border and Becker, 2019 ) for millions of genotyped individuals and weight SNPs based on proximity and minor allele frequency (MAF). Because of linkage disequilibrium (LD) between SNPs, effect sizes of SNPs that are close to each other tend to be similar in model fitting ( Yang and Tempelman 2012 ; Zeng et al 2018 ), which stimulates proximity-based SNP weighting.…”
Section: Introductionmentioning
confidence: 99%
“…On the other hand, the inversion of the covariance matrix is required to perform on each computation of likelihood, an operation that is proportional to the cube of individual number. As a result, improving the computational efficiency of REML for estimating variance components has become one of the research hotspots [8][9][10][11][12].…”
Section: Introductionmentioning
confidence: 99%
“…Ganjgahi et al [4] proposed a weighted least squares REML (WLS-REML) using a noniterative one-step random effect estimator to decrease the computational cost. Border and Becker [12] developed stochastic Lanczos derivative-free REML and Lanczos first-order Monte Carlo REML to further improve the computing speed. However, these existing methods for REML variance components estimation are mainly aimed at single-locus LMM, which is not effective enough to handle a complex genetic structure.…”
Section: Introductionmentioning
confidence: 99%