For the first time in Europe hundreds of rare disease (RD) experts team up to actively share and jointly analyse existing patient’s data. Solve-RD is a Horizon 2020-supported EU flagship project bringing together >300 clinicians, scientists, and patient representatives of 51 sites from 15 countries. Solve-RD is built upon a core group of four European Reference Networks (ERNs; ERN-ITHACA, ERN-RND, ERN-Euro NMD, ERN-GENTURIS) which annually see more than 270,000 RD patients with respective pathologies. The main ambition is to solve unsolved rare diseases for which a molecular cause is not yet known. This is achieved through an innovative clinical research environment that introduces novel ways to organise expertise and data. Two major approaches are being pursued (i) massive data re-analysis of >19,000 unsolved rare disease patients and (ii) novel combined -omics approaches. The minimum requirement to be eligible for the analysis activities is an inconclusive exome that can be shared with controlled access. The first preliminary data re-analysis has already diagnosed 255 cases form 8393 exomes/genome datasets. This unprecedented degree of collaboration focused on sharing of data and expertise shall identify many new disease genes and enable diagnosis of many so far undiagnosed patients from all over Europe.
Reanalysis of inconclusive exome/genome sequencing data increases the diagnosis yield of patients with rare diseases. However, the cost and efforts required for reanalysis prevent its routine implementation in research and clinical environments. The Solve-RD project aims to reveal the molecular causes underlying undiagnosed rare diseases. One of the goals is to implement innovative approaches to reanalyse the exomes and genomes from thousands of well-studied undiagnosed cases. The raw genomic data is submitted to Solve-RD through the RD-Connect Genome-Phenome Analysis Platform (GPAP) together with standardised phenotypic and pedigree data. We have developed a programmatic workflow to reanalyse genome-phenome data. It uses the RD-Connect GPAP’s Application Programming Interface (API) and relies on the big-data technologies upon which the system is built. We have applied the workflow to prioritise rare known pathogenic variants from 4411 undiagnosed cases. The queries returned an average of 1.45 variants per case, which first were evaluated in bulk by a panel of disease experts and afterwards specifically by the submitter of each case. A total of 120 index cases (21.2% of prioritised cases, 2.7% of all exome/genome-negative samples) have already been solved, with others being under investigation. The implementation of solutions as the one described here provide the technical framework to enable periodic case-level data re-evaluation in clinical settings, as recommended by the American College of Medical Genetics.
Hereditary diffuse gastric cancer (HDGC) caused by CDH1 variants predisposes to early-onset diffuse gastric (DGC) and lobular breast cancer (LBC). In Northern Portugal, the unusually high number of HDGC cases in unrelated families carrying the c.1901C>T variant (formerly known as p.A634V) suggested this as a CDH1-founder variant. We aimed to demonstrate that c.1901C>T is a bona fide truncating variant inducing cryptic splicing, to calculate the timing of a potential founder effect, and to characterize tumour spectrum and age of onset in carrying families. The impact in splicing was proven by using carriers’ RNA for PCR-cloning sequencing and allelic expression imbalance analysis with SNaPshot. Carriers and noncarriers were haplotyped for 12 polymorphic markers, and the decay of haplotype sharing (DHS) method was used to estimate the time to the most common ancestor of c.1901C>T. Clinical information from 58 carriers was collected and analysed. We validated the cryptic splice site within CDH1-exon 12, which was preferred over the canonical one in 100% of sequenced clones. Cryptic splicing induced an out-of-frame 37bp deletion in exon 12, premature truncation (p.Ala634ProfsTer7), and consequently RNA mediated decay. The haplotypes carrying the c.1901C>T variant were found to share a common ancestral estimated at 490 years (95% Confidence Interval 445–10,900). Among 58 carriers (27 males (M)–31 females (F); 13–83 years), DGC occurred in 11 (18.9%; 4M–7F; average age 33 ± 12) and LBC in 6 females (19.4%; average age 50 ± 8). Herein, we demonstrated that the c.1901C>T variant is a loss-of-function splice-site variant that underlies the first CDH1-founder effect in Portugal. Knowledge on this founder effect will drive genetic testing of this specific variant in HDGC families in this geographical region and allow intrafamilial penetrance analysis and better estimation of variant-associated tumour risks, disease age of onset, and spectrum.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.