2021
DOI: 10.1101/2021.11.03.466843
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Biobank-scale inference of ancestral recombination graphs enables genealogy-based mixed model association of complex traits

Abstract: Accurate inference of gene genealogies from genetic data has the potential to facilitate a wide range of analyses. We introduce a method for accurately inferring biobank-scale genome-wide genealogies from sequencing or genotyping array data, as well as strategies to utilize genealogies within linear mixed models to perform association and other complex trait analyses. We use these new methods to build genome-wide genealogies using genotyping data for 337,464 UK Biobank individuals and to detect associations in… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
45
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 22 publications
(62 citation statements)
references
References 69 publications
0
45
0
Order By: Relevance
“…In addition, while the accuracy of Gamma-SMC is comparable to current methods, it could be further improved. Directions include incorporating genetic maps for inference; using more accurate demographic models when building the flow field; incorporating a post-hoc TMRCA normalization step [29]; and using the conditioned sample frequency spectrum [22, 27], which utilizes allele frequencies in inference. In terms of speed, GPUs may offer additional acceleration.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…In addition, while the accuracy of Gamma-SMC is comparable to current methods, it could be further improved. Directions include incorporating genetic maps for inference; using more accurate demographic models when building the flow field; incorporating a post-hoc TMRCA normalization step [29]; and using the conditioned sample frequency spectrum [22, 27], which utilizes allele frequencies in inference. In terms of speed, GPUs may offer additional acceleration.…”
Section: Discussionmentioning
confidence: 99%
“…One exciting potential application of Gamma-SMC is in constructing an ARG. Indeed, the clear hierarchical structure evidenced in the pairwise posterior TMRCAs (Figure 3) reflects the genealogical tree structure at a site which, when inferred along the genome, gives rise to the tree sequence representation of an ARG [29]. Gamma-SMC may improve branch placement and timing in the weaving step of ARG building methods such as ARGweaver [24] and ARG-Needle [29], as the continuous nature of the posterior distributions of coalescence times in Gamma-SMC may help resolve ambiguities present in discrete time ap-proximations.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…The Li-Stephens model 15 is widely used for imputation 54 and phasing, 55 and it is also an important component of scalable genealogy inference methods like tsinfer and relate 18,19 . Large-scale genealogies, in turn, have enabled a variety of powerful methods in statistical and population genetics 26,27,56 . In this study, we leveraged genome-wide genealogies to derive LDGMs and to address computational challenges associated with LD and ancestral diversity.…”
Section: Discussionmentioning
confidence: 99%
“…Capitalizing on the limited number of common ancestral haplotypes at most loci, tree sequences provide a highly compact representation of human genetic data 18,20 . Tree sequences, and the closely related ancestral recombination graph, have enabled powerful new methods for understanding ancestral relationships [21][22][23] , measuring selection 19,24,25 , and analyzing complex traits 26,27 .…”
Section: Introductionmentioning
confidence: 99%