2021
DOI: 10.1101/2021.04.06.438656
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Bonsai: An efficient method for inferring large human pedigrees from genotype data

Abstract: Pedigree inference from genotype data is a challenging problem, particularly when pedigrees are sparsely sampled and individuals may be distantly related to their closest genotyped relatives. We present a new method that infers small pedigrees of close relatives and then assembles them into larger pedigrees. To assemble large pedigrees, we introduce several new formulas and tools including a new likelihood for the degree separating two small pedigrees, a method for detecting individuals who share background id… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 26 publications
(36 reference statements)
0
2
0
Order By: Relevance
“…PRIMUS [19] was excluded because it did not run successfully with the IBD values in real and simulated mice analyzed here. Finally, BONSAI [32] was also not evaluated, because it is intended for sparse pedigrees and contains hard-coded IBD and age distributions optimized for humans. We used three datasets to compare the methods: 1) a subset of a population of wild house mice from a long-term study, 2) five simulated datasets based on the genotypes of the founders of the long-term house mice study, and 3) a published dataset of a cattle pedigree and its genotypes [13,33].…”
Section: Comparison To Other Methodsmentioning
confidence: 99%
“…PRIMUS [19] was excluded because it did not run successfully with the IBD values in real and simulated mice analyzed here. Finally, BONSAI [32] was also not evaluated, because it is intended for sparse pedigrees and contains hard-coded IBD and age distributions optimized for humans. We used three datasets to compare the methods: 1) a subset of a population of wild house mice from a long-term study, 2) five simulated datasets based on the genotypes of the founders of the long-term house mice study, and 3) a published dataset of a cattle pedigree and its genotypes [13,33].…”
Section: Comparison To Other Methodsmentioning
confidence: 99%
“…One notable example is the UK Biobank wherein roughly 30% of genotyped individuals have a third degree (e.g., first cousin) or closer relative in the study 2 . Applications that make use of genetic relatives are numerous and varied and include pedigree reconstruction 3,4 , pedigree-based linkage analysis for disease and trait mapping 5 , heritability estimation 6,7 , forensic genetics 8 , and genetic genealogy 9 -a popular tool among direct-to-consumer genetic testing customers. On the other hand, traditional genome-wide association study tests and many population genetic models assume that the study samples are unrelated, and, as such, must exclude inferred relatives to avoid spurious signals or inaccurate parameter estimates 10 .…”
Section: Introductionmentioning
confidence: 99%