2020
DOI: 10.1101/2020.10.26.20220004
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Discovery of runs-of-homozygosity diplotype clusters and their associations with diseases in UK Biobank

Abstract: Runs of homozygosity (ROH) segments, contiguous homozygous regions in a genome were traditionally linked to families and inbred populations. However, a growing literature suggests that ROHs are ubiquitous in outbred populations. Still, most existing genetic studies of ROH in populations are limited to aggregated ROH content across the genome, which does not offer the resolution for mapping causal loci. This limitation is mainly due to a lack of methods for efficient identification of shared ROH diplotypes. Her… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
2
1

Relationship

6
0

Authors

Journals

citations
Cited by 6 publications
(11 citation statements)
references
References 49 publications
(59 reference statements)
0
8
0
Order By: Relevance
“…We computed all haplotypes on autosomes, with length ≥ 0.5 cM shared by 500 or more individuals from the UK Biobank using a PBWT-block algorithm. 47 In each simulation replicate, we used a random sample of N = 400,000 individuals and randomly sampled a causal haplotype. We simulated a continuous phenotype Y i for individual i as Y i = 0.05 age i + 0.5 sex i + H i γ + b i + ε i , where age i , sex i , b i and ε i were simulated using the same parameter settings as in the type I error simulations, and H i was the number of causal haplotypes carried by individual i , with possible values 0, 1, 2.…”
Section: Methodsmentioning
confidence: 99%
“…We computed all haplotypes on autosomes, with length ≥ 0.5 cM shared by 500 or more individuals from the UK Biobank using a PBWT-block algorithm. 47 In each simulation replicate, we used a random sample of N = 400,000 individuals and randomly sampled a causal haplotype. We simulated a continuous phenotype Y i for individual i as Y i = 0.05 age i + 0.5 sex i + H i γ + b i + ε i , where age i , sex i , b i and ε i were simulated using the same parameter settings as in the type I error simulations, and H i was the number of causal haplotypes carried by individual i , with possible values 0, 1, 2.…”
Section: Methodsmentioning
confidence: 99%
“…PBWT facilitates an efficient approach to enumerate all pairwise haplotype matches longer than a given length. While Durbin's algorithm outputs all pairwise matches in O(N M + #matches), a block-based approach [14][15][16] can enumerate all matching blocks without explicitly outputting all pairs in O(N M ). By sorting the haplotype sequences based on their reversed prefix order, the longest match for each haplotype sequence is placed in the adjacent position.…”
Section: Preliminariesmentioning
confidence: 99%
“…Moreover, all pairwise haplotype sequences at the site k that are identical for at least L sites from k are separated by a haplotype sequence j with the condition d k [j] > k − L [13]. We define a L-block at a site k as a set of haplotype indices that share long matches with each other ending at site k with a minimum length of L. All L-blocks at any site k may be efficiently scanned by consensus PBWT algorithms [14][15][16].…”
Section: Preliminariesmentioning
confidence: 99%
“…In addition, we simulated haplotype effects and compared FiMAP with IBD segments called with length ≥ 3 cM, 5 cM, 10 cM with the GWAS single-variant test. Specifically, we estimated the start and end positions for all haplotypes with length ≥ 0.5 cM shared by at least 500 individuals in the UK Biobank using a PBWT-block algorithm [ 58 ]. In each simulation replicate, we used a random sample of N = 400,000 individuals and randomly sampled a causal haplotype from an autosome.…”
Section: Verification and Comparisonmentioning
confidence: 99%