2019
DOI: 10.1002/humu.23717
|View full text |Cite
|
Sign up to set email alerts
|

High prevalence of cancer‐associated TP53 variants in the gnomAD database: A word of caution concerning the use of variant filtering

Abstract: The 1,000 genome project, the Exome Aggregation Consortium (ExAC) or the Genome Aggregation database (gnomAD) datasets, were developed to provide largescale reference data of genetic variations for various populations to filter out common benign variants and identify rare variants of clinical importance based on their frequency in the human population. Using a TP53 repository of 80,000 cancer variants, as well as TP53 variants from multiple cancer genome projects, we have defined a set of certified oncogenic T… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
37
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
7
1

Relationship

4
4

Authors

Journals

citations
Cited by 21 publications
(37 citation statements)
references
References 38 publications
0
37
0
Order By: Relevance
“…A full analysis of the 2018 release of the database was recently published in collaboration with the TCGA and will not be reiterated here. The new release of the database (100K_UMD TP53 database, October 2019) includes more than 100,000 TP53 mutations and a novel feature developed for the analysis of TP53 variants, the cancer shared dataset (CSD), which comprises only variants highly likely to be pathogenic 15 (see " Methods "). The CSD includes 258 TP53 SNS common to four different large independent sequencing datasets (see " Methods ").…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…A full analysis of the 2018 release of the database was recently published in collaboration with the TCGA and will not be reiterated here. The new release of the database (100K_UMD TP53 database, October 2019) includes more than 100,000 TP53 mutations and a novel feature developed for the analysis of TP53 variants, the cancer shared dataset (CSD), which comprises only variants highly likely to be pathogenic 15 (see " Methods "). The CSD includes 258 TP53 SNS common to four different large independent sequencing datasets (see " Methods ").…”
Section: Resultsmentioning
confidence: 99%
“…For the present analysis, we developed upon the CSD, a specific TP53 benchmark dataset that includes only highly likely pathogenic variants 15 . As variant recurrence is one of the strongest indicators of pathogenicity, using four independent sources of TP53 variants, each deploying a different analysis methodology, should negate methodological issues for the specific selection of pathogenic variants.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Among the many types of annotations that can be assigned to each sequenced NGS variant, of particular interest are those predicting if a variant has a benign or deleterious phenotypic effect. However, causative relationships between genes and phenotypes are very complex, depending on many connections and influences, and there are major challenges in obtaining the associations [2]. In the following paragraphs, we briefly overview the state-of-the-art approaches in annotating with phenotypic effects, focusing mostly on an array of predicted pathogenicity scores, and on population allele frequencies, as it is also very popular for estimation of pathogenicity.…”
Section: Introductionmentioning
confidence: 99%
“…With the growing availability of NGS technology in medical diagnostics, quantitatively exemplified by 68,000 genetic tests currently offered in clinics according to Genetests [1], the amount of experimentally sequenced data on human genetic variation in both healthy and patient populations is rapidly increasing. Accurate and exhaustive variant annotation is important for every application of NGS technology, including development of therapies, selection of effective individualized therapy, and comparing multiple samples in clinical studies, to mention a few; however, currently there are major challenges associated with sequencing data analysis and interpretation [2]. While most of the prevalent or common variants (about one million of currently sequenced variants) have been annotated as benign (neutral) based on their high frequency of occurrence in the healthy population [3], the remaining majority are ultra-rare Variants of Unknown Significance (VUS).…”
Section: Introductionmentioning
confidence: 99%