Inordinate expansion and hypermethylation of the fragile X DNA triplet repeat, (GGC).-(GCC)., are correlated with the ability of the individual G-and C-rich single strands to form hairpin structures. Two-dimensional NMR and gel electrophoresis studies show that both the Gand C-rich single strands form hairpins under physiological conditions. This propensity of hairpin formation is more pronounced for the C-rich strand than for the G-rich strand. This observation suggests that the C-rich strand is more likely to form hairpin or "slippage" structure and show asymmetric strand expansion during replication. NMR data also show that the hairpins formed by the C-rich strands fold in such a way that the cytosine at the CpG step of the stem is C-C paired.The presence ofa C C mismatch at the CpG site generates local flexibility, thereby providing analogs of the transition to the methyltransferase. In other words, the hairpins of the C-rich strand act as better substrates for the human methyltransferase than the Watson-Crick duplex or the G-rich strand. Therefore, hairpin formation could account for the specific methylation of the CpG island in the fragile X repeat that occurs during inactivation of the FMR1 gene during the onset of the disease.Simple tandemly repeated DNA sequences are interspersed in both transcribed and nontranscribed regions of chromosomes (1-3). The hypothesis (4) that the unusual DNA structures adopted by these repeats principally determine their specific functions is gaining strength. We have previously described the unusual hairpin structures (5, 6) adopted by a variety of repetitive DNA sequences. Here, we show by NMR and gel electrophoresis that the individual strands from the fragile X triplet repeats, (GGC)n-(GCC)n, form intramolecular hairpins under physiological conditions. In these hairpins, the number of Watson-Crick G-C pairs is maximized in the stem through the formation of G-G or C C mispairs flanked by G-C pairs (Fig. 1). As shown below, these hairpins provide structural basis for three major phenomena associated with the fragile X syndrome (3, 4): (i) the site-specific fragility, (ii) the amplification of the repeat (especially the preferential expansion of the C-rich strand), and (iii) the hypermethylation of the CpG island adjacent to the fragile X gene, FMRI. MATERIALS AND METHODSGel Electrophoresis. Oligonucleotides were fully denatured by heating at 95°C for 2 min in 5 mM Tris/1 mM EDTA buffer, pH 7.5, containing 5 mM or 200 mM NaCl, followed by incubation at 55°C for 10 min and gradual cooling to roomThe publication costs of this article were defrayed in part by page charge payment. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. §1734 solely to indicate this fact. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Note that in the hairpins of the G-and C-rich strands, the central mismatched G-G or C-C pair in the stem is surrounded by twoWatson-Crick G-C pairs. IITo whom reprint requests should be addressed. 5199
Studies of the feasibility of a subunit vaccine to protect against human immunodeficiency virus (HIV) infection have principally focused on the third variable (V3) loop. The principal neutralizing determinant (PND) of HIV-1 is located inside the V3 loop of the surface envelope glycoprotein, gp120. However, progress toward a PND-based vaccine has been impeded by the amino acid sequence variability in the V3 loops of different HIV isolates. Theoretical studies revealed that the variability in sequence and structure of the V3 loop is confined to the N- and C-terminal sides of the conserved GPG crest. This leaves three regions of the V3 loop conserved both in sequence and secondary structure. We present the results of NMR studies that test the validity of our theoretical predictions. Structural studies are reported for the HIV-V3 loop (HIV-MN) in the linear and cyclic (S-S-bridged) forms. For the V3 loop sequence of the HIV-MN isolate, the three conserved secondary structural elements are as underlined below: turns turn helix CTRPNYNKRKRIHIGPGRAFYTTKNIIGTIROAHC Finally, the conformational requirement of the PND in the V3 loop-antibody interaction is tested by monitoring the monoclonal antibody binding to the HIV-MN V3 loop in the linear and cyclic forms by enzyme-linked immunosorbent assay. The binding data reveal that the cyclic V3 loop is a better ligand for the monoclonal antibodies than the linear form although the latter has the same sequence. This means that the monoclonal antibodies recognize the PNDs as conformational epitopes.
Human mucins are T or S glycosylated tandem repeat proteins. In breast cancer, mucins become under or unglycosylated. Two-dimensional nuclear magnetic resonance experiments are performed on chemically synthesized mucin tandem repeat polypeptides, (PDTRPAPGST-APPAHGVTSA)n the unglycosylated form for n=1,3 where (APDTR) constitutes the antigenic sites for the antibodies isolated form the tumors in the breast cancer patients. These studies demonstrate how the tandem repeats assemble in space giving rise to the overall tertiary structure, and the local structure and presentation of the antigenic site(APDTR) at the junction of two neighboring repeats. The NMR data reveal repeating knob-like structures connected by extended spacers. The knobs protrude away from the long-axis of Muc-1 and the predominant antigenic site (APDTR) forms the accessible tip of the knob. Multiple tandem repeats enhance the rigidity and presentation of the knob-like structures.
Repetitive DNA sequences, interspersed throughout the human genome, are capable of forming a wide variety of unusual DNA structures with simple and complex loopfolding patterns. The hairpin formed by the fragile X repeat, (CCG)n, and the bipartite triplex formed by the Friedreich's ataxia repeat, (GAA)n/(TTC)n, show simple loopfolding. On the other hand, the doubly folded hairpin formed by the human centromeric repeat, (AATGG)n, the hairpin G-quartet formed by (TTAGGG)n at the 3' telomere overhang, and the hairpin G-quartet, and hairpin C+.C paired i-motif formed by the insulin minisatellite, [formula: see text] show multiple and complex loopfolding. We have performed high resolution nuclear magnetic resonance (NMR) spectroscopy and in vitro replication to show that unique base-pairing and loopfolding render stability to these unusual structures under physiological conditions. The formation of such stable structures offers a mechanism of unwinding which is advantageous during transcription. For example, the formation of the hairpin G-quartet, and hairpin C+.C paired i-motif upstream of the insulin gene may facilitate transcription. These unusual DNA structures also provide unique 'protein recognition motifs' quite different from a Watson-Crick double helix. For example, the hairpin G-quartet formed by (TTAGGG)n at the 3' telomere overhang is specifically recognized and stabilized by the human repair protein, Ku70/Ku80 hetero-dimer, which may be important in the stability of the telomere. However, the formation of the same unusual DNA structures during replication is likely to cause instability in the lengths of the DNA repeats. If the altered (generally expanded) length enhances the probability of the unusual structure during the next cycle of replication, it further increases the instability of the repeat causing a 'dynamic mutation'. In fact, NMR and in vitro replication studies show that the longer the repeat length the higher is the probability of hairpin formation by the fragile X repeat, (CCG)n. In addition, the hairpin of the fragile X repeat, upstream of the FMR-1 gene, is more susceptible to CpG methylation than its duplex thereby leading to methyl-directed suppression of transcription. Thus, the selective advantage of the unusual structures formed by the DNA repeats in the regulation of gene expression may be offset by the genomic instability caused by the same structures during replication. The repeat number is a critical parameter that helps maintain a balance between the advantage gained from an unusual structure during gene expression and the disadvantage posed by the same structure during replication.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.