CAG repeats occur predominantly in the coding regions of human genes, which suggests their functional importance. In some genes, these sequences can undergo pathogenic expansions leading to neurodegenerative polyglutamine (poly-Q) diseases. The mutant transcripts containing expanded CAG repeats possibly contribute to pathogenesis in addition to the well-known pathogenic effects of mutant proteins. We have analysed two crystal forms of RNA duplexes containing CAG repeats: (GGCAGCAGCC)2. One of the structures has been determined at atomic resolution (0.95 Å) and the other at 1.9 Å. The duplexes include non-canonical A–A pairs that fit remarkably well within a regular A-helix. All the adenosines are in the anti-conformation and the only interaction within each A–A pair is a single C2-H2···N1 hydrogen bond. Both adenosines in each A–A pair are shifted towards the major groove, although to different extents; the A which is the H-bond donor stands out more (the ‘thumbs-up’ conformation). The main effect on the helix conformation is a local unwinding. The CAG repeats and the previously examined CUG structures share a similar pattern of electrostatic charge distribution in the minor groove, which could explain their affinity for the pathogenesis-related MBNL1 protein.
Tracks containing CUG repeats are abundant in human gene transcripts. Their biological role includes modulation of pre-mRNA splicing, mRNA transport and regulation of translation. Expanded forms of CUG runs are associated with pathogenesis of several neurodegenerative diseases, including myotonic dystrophy type 1. We have analysed two crystal structures of RNA duplexes containing the CUG repeats: G(CUG)2C and (CUG)6. The first of the structures, analysed at 1.23 Å resolution, is of an oligomer designed by us. The second model was obtained after ‘detwinning’ the 1.58 Å X-ray data previously deposited in the PDB. The RNA duplexes are in the A-form in which all the C–G pairs form Watson–Crick interactions while all the uridine pairs can be described as U•U cis wobble having only one hydrogen bond between the bases. The residue, which accepts the H-bond, is inclined towards the minor groove. This previously unreported base pairing can be described as ‘stretched U–U wobble’. The regular hydrogen-bonding pattern of interactions with the solvent, the electrostatic charge distribution and surface features indicate the ligand binding potential of the CUG tracks.
The CGG repeats are present in the 5′-untranslated region (5′-UTR) of the fragile X mental retardation gene FMR1 and are associated with two diseases: fragile X-associated tremor ataxia syndrome (FXTAS) and fragile X syndrome (FXS). FXTAS occurs when the number of repeats is 55–200 and FXS develops when the number exceeds 200. FXTAS is an RNA-mediated disease in which the expanded CGG tracts form stable structures and sequester important RNA binding proteins. We obtained and analysed three crystal structures of double-helical CGG repeats involving unmodified and 8-Br modified guanosine residues. Despite the presence of the non-canonical base pairs, the helices retain an A-form. In the G–G pairs one guanosine is always in the syn conformation, the other is anti. There are two hydrogen bonds between the Watson–Crick edge of G(anti) and the Hoogsteen edge of G(syn): O6·N1H and N7·N2H. The G(syn)-G(anti) pair shows affinity for binding ions in the major groove. G(syn) causes local unwinding of the helix, compensated elsewhere along the duplex. CGG helical structures appear relatively stable compared with CAG and CUG tracts. This could be an important factor in the RNA’s ligand binding affinity and specificity.
PNA is a promising molecule for antisense therapy of trinucleotide repeat disorders. We present the first crystal structures of RNA–PNA duplexes. They contain CUG repeats, relevant to myotonic dystrophy type I, and CAG repeats associated with poly-glutamine diseases. We also report the first PNA–PNA duplex containing mismatches. A comparison of the PNA homoduplex and the PNA–RNA heteroduplexes reveals PNA's intrinsic structural properties, shedding light on its reported sequence selectivity or intolerance of mismatches when it interacts with nucleic acids. PNA has a much lower helical twist than RNA and the resulting duplex has an intermediate conformation. PNA retains its overall conformation while locally there is much disorder, especially peptide bond flipping. In addition to the Watson–Crick pairing, the structures contain interesting interactions between the RNA's phosphate groups and the Π electrons of the peptide bonds in PNA.
CCG repeats are highly over-represented in exons of the human genome. Usually they are located in the 5′ UTR but are also abundant in translated sequences. The CCG repeats are associated with three tri-nucleotide repeat disorders: Huntington’s disease, myotonic dystrophy type 1 and chromosome X-linked mental retardation (FRAXE). In this study, we present two crystal structures containing double-stranded CCG repeats: one of an RNA in the native form, and one containing LNA nucleotides. Both duplexes form A-helices but with strands slipped in the 5′ (native structure) or the 3′ direction (LNA-containing structure). As a result, one of two expected C-C pairs is eliminated from the duplex. Each of the three observed C-C pairs interacts differently, forming either one weak H-bond or none. LNA nucleotides have no apparent effect on the helical parameters but the base stacking is increased compared to the native duplex and the distribution of electrostatic potential in the major groove is changed. The CCG crystal structures explain the thermodynamic fragility of CCG runs and throw light on the observation that the MBNL1 protein recognises CCG runs, as well as CUG and CAG, but not the relatively stable CGG repeats.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.