Abstract:Purpose
Data sharing through ClinVar offers a unique opportunity to identify interpretation differences between laboratories. As part of a ClinGen initiative, four clinical laboratories (Ambry, GeneDx, Partners Healthcare Laboratory for Molecular Medicine, and University of Chicago Genetic Services Laboratory) collaborated to identify the basis of interpretation differences and to investigate if data sharing and reassessment resolves interpretation differences by analyzing a subset of variants.
Methods
ClinV… Show more
“…Answering this question is a central challenge because it is clear that clinical laboratories are currently having trouble with using functional data to agree on variant interpretations. 80 The large scale of MAVE data provides several key advantages that could help alleviate the problematic nature of using functional data for interpretation. First, all variants within a MAVE are tested simultaneously so that measurements for different variants are readily comparable to each other, whereas one-at-atime assays are performed by different personnel in different research labs at different times.…”
Section: Limitations Of Maves and How To Overcome Themmentioning
Classical genetic approaches for interpreting variants, such as case-control or co-segregation studies, require finding many individuals with each variant. Because the overwhelming majority of variants are present in only a few living humans, this strategy has clear limits. Fully realizing the clinical potential of genetics requires that we accurately infer pathogenicity even for rare or private variation. Many computational approaches to predicting variant effects have been developed, but they can identify only a small fraction of pathogenic variants with the high confidence that is required in the clinic. Experimentally measuring a variant's functional consequences can provide clearer guidance, but individual assays performed only after the discovery of the variant are both time and resource intensive. Here, we discuss how multiplex assays of variant effect (MAVEs) can be used to measure the functional consequences of all possible variants in disease-relevant loci for a variety of molecular and cellular phenotypes. The resulting large-scale functional data can be combined with machine learning and clinical knowledge for the development of ''lookup tables'' of accurate pathogenicity predictions. A coordinated effort to produce, analyze, and disseminate large-scale functional data generated by multiplex assays could be essential to addressing the variant-interpretation crisis.
“…Answering this question is a central challenge because it is clear that clinical laboratories are currently having trouble with using functional data to agree on variant interpretations. 80 The large scale of MAVE data provides several key advantages that could help alleviate the problematic nature of using functional data for interpretation. First, all variants within a MAVE are tested simultaneously so that measurements for different variants are readily comparable to each other, whereas one-at-atime assays are performed by different personnel in different research labs at different times.…”
Section: Limitations Of Maves and How To Overcome Themmentioning
Classical genetic approaches for interpreting variants, such as case-control or co-segregation studies, require finding many individuals with each variant. Because the overwhelming majority of variants are present in only a few living humans, this strategy has clear limits. Fully realizing the clinical potential of genetics requires that we accurately infer pathogenicity even for rare or private variation. Many computational approaches to predicting variant effects have been developed, but they can identify only a small fraction of pathogenic variants with the high confidence that is required in the clinic. Experimentally measuring a variant's functional consequences can provide clearer guidance, but individual assays performed only after the discovery of the variant are both time and resource intensive. Here, we discuss how multiplex assays of variant effect (MAVEs) can be used to measure the functional consequences of all possible variants in disease-relevant loci for a variety of molecular and cellular phenotypes. The resulting large-scale functional data can be combined with machine learning and clinical knowledge for the development of ''lookup tables'' of accurate pathogenicity predictions. A coordinated effort to produce, analyze, and disseminate large-scale functional data generated by multiplex assays could be essential to addressing the variant-interpretation crisis.
“…However, NGS also yields increasing numbers of variants that predominantly are of unknown significance and compounds the challenge of variant interpretation (Good, Ainscough, McMichael, Su, & Griffith, 2014; Kamps et al., 2017). As clinical analysis of large volumes of patient variant data becomes increasingly difficult, inconsistencies increase both in variant interpretation and reporting between laboratories (Harrison et al., 2017). This issue is compounded by propagation of these inconsistencies to widely accessed knowledgebases (Hoskinson, Dubuc, & Mason‐Suares, 2017; Yorczyk, Robinson, & Ross, 2015).…”
Harmonization of cancer variant representation, efficient communication, and free distribution of clinical variant‐associated knowledge are central problems that arise with increased usage of clinical next‐generation sequencing. The Clinical Genome Resource (ClinGen) Somatic Working Group (WG) developed a minimal variant level data (MVLD) representation of cancer variants, and has an ongoing collaboration with Clinical Interpretations of Variants in Cancer (CIViC), an open‐source platform supporting crowdsourced and expert‐moderated cancer variant curation. Harmonization between MVLD and CIViC variant formats was assessed by formal field‐by‐field analysis. Adjustments to the CIViC format were made to harmonize with MVLD and support ClinGen Somatic WG curation activities, including four new features in CIViC: (1) introduction of an assertions feature for clinical variant assessment following the Association of Molecular Pathologists (AMP) guidelines, (2) group‐level curation tracking for organizations, enabling member transparency, and curation effort summaries, (3) introduction of ClinGen Allele Registry IDs to CIViC, and (4) mapping of CIViC assertions into ClinVar submission with automated submissions. A generalizable workflow utilizing MVLD and new CIViC features is outlined for use by ClinGen Somatic WG task teams for curation and submission to ClinVar, and provides a model for promoting harmonization of cancer variant representation and efficient distribution of
this information.
“…In many of the instances of discrepancies in assertion, consensus has been achieved simply by sharing evidence previously unavailable to one party
7–
9 . Harrison
et al found that 87.2% of discordant variants were resolved by reassessment and data sharing
8 . Several initiatives
5,
10–
14 have had success in encouraging the public sharing of datasets and new studies, but for the foreseeable future, private data will continue to be a challenge to achieving consensus.…”
Section: Introductionmentioning
confidence: 99%
“…Recently, in the field of cardiovascular disease, there have been several high-profile instances of cardiovascular variants deemed to be highly pathogenic, yet not segregating with disease
9,
16,
17 . This unfortunate outcome is inevitable owing to the aforementioned reasons and illustrates a key issue: the continual need to share and reconcile new information with old data and reclassify clinical assertions as appropriate on a regular basis
6,
8 .…”
While ClinVar has become an indispensable resource for clinical variant interpretation, its sophisticated structure provides it with a daunting learning curve. Often the sheer depth of types of information provided can make it difficult to analyze variant information with high throughput. Clinotator is a fast and lightweight tool to extract important aspects of criteria-based clinical assertions; it uses that information to generate several metrics to assess the strength and consistency of the evidence supporting the variant clinical significance. Clinical assertions are weighted by significance type, age of submission and submitter expertise category to filter outdated or incomplete assertions that otherwise confound interpretation. This can be accomplished in batches: either lists of Variation IDs or dbSNP rsIDs, or with vcf files that are additionally annotated. Using sample sets ranging from 15,000–50,000 variants, we slice out problem variants in minutes without extensive computational effort (using only a personal computer) and corroborate recently reported trends of discordance hiding amongst the curated masses. With the rapidly growing body of variant evidence, most submitters and researchers have limited resources to devote to variant curation. Clinotator provides efficient, systematic prioritization of discordant variants in need of reclassification. The hope is that this tool can inform ClinVar curation and encourage submitters to keep their clinical assertions current by focusing their efforts. Additionally, researchers can utilize new metrics to analyze variants of interest in pursuit of new insights into pathogenicity.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.