2021
DOI: 10.1016/j.ajhg.2021.03.012
|View full text |Cite
|
Sign up to set email alerts
|

Low-coverage sequencing cost-effectively detects known and novel variation in underrepresented populations

Abstract: Background : Genetic studies of biomedical phenotypes in underrepresented populations identify disproportionate numbers of novel associations. However, current genomics infrastructure--including most genotyping arrays and sequenced reference panels--best serves populations of European descent. A critical step for facilitating genetic studies in underrepresented populations is to ensure that genetic technologies accurately capture variation in all populations. Here, we quantify the accuracy of low-coverage sequ… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
42
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
7
2

Relationship

1
8

Authors

Journals

citations
Cited by 50 publications
(42 citation statements)
references
References 38 publications
0
42
0
Order By: Relevance
“…In particular, we found that ancestry scores were imputed with near perfect accuracy, providing a framework to easily incorporate genetic ancestry into the study of tumors from existing large-scale datasets and expand our knowledge of population-specific mechanisms[44]. Multiple emerging studies have demonstrated the utility of low-coverage sequencing of normal samples[17,21,22] and our work extends these findings to tumor-only sequencing data.…”
Section: Discussionmentioning
confidence: 67%
See 1 more Smart Citation
“…In particular, we found that ancestry scores were imputed with near perfect accuracy, providing a framework to easily incorporate genetic ancestry into the study of tumors from existing large-scale datasets and expand our knowledge of population-specific mechanisms[44]. Multiple emerging studies have demonstrated the utility of low-coverage sequencing of normal samples[17,21,22] and our work extends these findings to tumor-only sequencing data.…”
Section: Discussionmentioning
confidence: 67%
“…Recent work has shown that ultra low-coverage sequencing, including off-target regions from targeted sequencing, can be used to accurately impute common germline polymorphisms by leveraging linkage disequilibrium (LD) information within the low-coverage data[1622]. Here, we demonstrate that similar techniques can be used to infer common germline variation from targeted sequencing of tumors.…”
Section: Introductionmentioning
confidence: 84%
“…Previously, genotype concordances between low-coverage (~ 0.5 ×) and genotype array, and deep sequencing data (~ 30 ×) were highly correlated, and several studies have continuously demonstrated potentials of LPS for precision medicine [ 4 , 6 ]. Additionally, LPS under 1.0 × has shown strong advantages over genotype array in terms of cost and imputation accuracy [ 24 , 30 ]. Typically, LPS around 1.0 × is expected to be half of the cost of genotype array with less than 1 million variants.…”
Section: Discussionmentioning
confidence: 99%
“…As a result, disease risks can be mis-inferred and not yield accurate estimations depending on populations, highlighting the need of taking into account ancestry of study participants. Moreover, cost-effective alternatives to genotyping arrays, such as low-coverage sequencing (≥4X), have been shown to capture variants at all frequencies more precisely and to identify novel variation in underrepresented populations, as Africans ( Martin et al, 2021 ). The rest of the studies we have included (9) are based either on a single or on more omics technologies.…”
Section: Large Data Collections For Aging: a Survey Of Available Databases And Datasetsmentioning
confidence: 99%