It has long been known that canonical 5′ splice site (5′SS) GT>GC variants may be compatible with normal splicing. However, to date, the actual scale of canonical 5′SSs capable of generating wild‐type transcripts in the case of GT>GC substitutions remains unknown. Herein, combining data derived from a meta‐analysis of 45 human disease‐causing 5′SS GT>GC variants and a cell culture‐based full‐length gene splicing assay of 103 5′SS GT>GC substitutions, we estimate that ~15–18% of canonical GT 5′SSs retain their capacity to generate between 1% and 84% normal transcripts when GT is substituted by GC. We further demonstrate that the canonical 5′SSs in which substitution of GT by GC‐generated normal transcripts exhibit stronger complementarity to the 5′ end of U1 snRNA than those sites whose substitutions of GT by GC did not lead to the generation of normal transcripts. We also observed a correlation between the generation of wild‐type transcripts and a milder than expected clinical phenotype but found that none of the available splicing prediction tools were capable of reliably distinguishing 5′SS GT>GC variants that generated wild‐type transcripts from those that did not. Our findings imply that 5′SS GT>GC variants in human disease genes may not invariably be pathogenic.
BACKGROUND
We previously showed that several variations in the RHD gene, including synonymous changes, can be classified as splice site variants and may play a direct role in D variant phenotype expression. We sought to extend our study to additional candidates, notably in the first and last exons of the gene, by engineering a novel universal splice reporting vector, i.e., minigene.
STUDY DESIGN AND METHODS
Our previous plasmid construct was modified to allow subcloning of any exon(s) of interest for assessing effect of variations on splicing. Seventeen novel and/or uncharacterized variations of the RHD gene were selected for the study and tested in our novel model.
RESULTS
We engineered and validated a novel universal minigene for assessing virtually any variations of interest for splicing defect. Of the 17 variants tested in the novel model, 11 were shown to alter splicing either totally or partially, including the silent c.1065C>T variation, which induces major skipping of exon 7, and may therefore be responsible for reducing D antigen expression. We also showed that while all three missense variations c.1154G>C, c.1154G>T, and c.1154G>A in exon 9 are splice site variants, splicing is differentially altered and D‐negative phenotype observed in the presence of the latter substitution is likely due to a defect in RhD protein folding.
CONCLUSION
Overall, we hypothesize that splicing alteration is likely to be a common mechanism of D phenotype variation that has been underestimated so far. Further large‐scale studies are necessary to demonstrate this statement definitely.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.