2020
DOI: 10.1101/2020.08.10.245043
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Why are rare variants hard to impute? Coalescent models reveal theoretical limits in existing algorithms

Abstract: Genotype imputation is an indispensable step in human genetic studies. Large reference panels with deeply sequenced genomes now allow interrogating variants with minor allele frequency < 1% without sequencing. While it is critical to consider limits of this approach, imputation methods for rare variants have only done so empirically; the theoretical basis of their imputation accuracy has not been explored. To provide theoretical consideration of imputation accuracy under the current imputation framework, we… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 35 publications
0
3
0
Order By: Relevance
“…Our results are consistent with findings from a previous study 45 , which showed across 492 traits a strong colocalization between common and rare coding variants associated with the same trait. Nevertheless, our conclusions remain limited by the relatively low performances of imputation in this MAF regime 46,47 . Therefore, large samples with wholegenome sequences will be required to robustly address this question.…”
Section: Discussionmentioning
confidence: 77%
“…Our results are consistent with findings from a previous study 45 , which showed across 492 traits a strong colocalization between common and rare coding variants associated with the same trait. Nevertheless, our conclusions remain limited by the relatively low performances of imputation in this MAF regime 46,47 . Therefore, large samples with wholegenome sequences will be required to robustly address this question.…”
Section: Discussionmentioning
confidence: 77%
“…Third, although we have focused on leveraging an ARG inferred from array data alone, ARG-Needle currently enables building an ARG using a mixture of sequencing and array data. This approach may be used to perform ARG-based genotype imputation, which is likely to improve upon approaches that do not accurately model the TMRCA between target and reference samples [52]. We performed preliminary simulation-based analyses using this strategy, and obtained promising results (see Methods, Supplementary Fig.…”
Section: Discussionmentioning
confidence: 99%
“…In fact, rare genetic variants with minor allele frequency (MAF) of ≤1% may significantly contribute to stroke heritability. In GWASs, the imputation accuracy of rare variants may be limited, and largely depends on the minor allele count (MAC) in the reference sample 26 . Rare variants can be reliably studied with nextgeneration sequencing-based techniques such as whole-genome sequencing (WGS) and whole-exome sequencing (WES).…”
Section: Introductionmentioning
confidence: 99%