DNA methylation is a covalent modification of the nucleotide cytosine that is stably inherited at the dinucleotide CpG by somatic cells, and 70% of CpG dinucleotides in the genome are methylated. The exception to this pattern of methylation are CpG islands, CpG-rich sequences that are protected from methylation, and generally are thought to be methylated only on the inactive X-chromosome and in tumors, as well as differentially methylated regions (DMRs) in the vicinity of imprinted genes. To identify chromosomal regions that might harbor imprinted genes, we devised a strategy for isolating a library of normally methylated CpG islands. Most of the methylated CpG islands represented high copy number dispersed repeats. However, 62 unique clones in the library were characterized, all of which were methylated and GC-rich, with a GC content >50%. Of these, 43 clones also showed a CpG obs /CpG exp >0.6, of which 30 were studied in detail. These unique methylated CpG islands mapped to 23 chromosomal regions, and 12 were differentially methylated regions in uniparental tissues of germline origin, i.e., hydatidiform moles (paternal origin) and complete ovarian teratomas (maternal origin), even though many apparently were methylated in somatic tissues. We term these sequences gDMRs, for germline differentially methylated regions. At least two gDMRs mapped near imprinted genes, HYMA1 and a novel homolog of Elongin A and Elongin A2, which we term Elongin A3. Surprisingly, 18 of the methylated CpG islands were methylated in germline tissues of both parental origins, representing a previously uncharacterized class of normally methylated CpG islands in the genome, and which we term similarly methylated regions (SMRs). These SMRs, in contrast to the gDMRs, were significantly associated with telomeric band locations (P = .0008), suggesting a potential role for SMRs in chromosome organization. At least 10 of the methylated CpG islands were on average 85% conserved between mouse and human. These sequences will provide a valuable resource in the search for novel imprinted genes, for defining the molecular substrates of the normal methylome, and for identifying novel targets for mammalian chromatin formation.
Background: Gene annotation is a pivotal component in computational genomics, encompassing prediction of gene function, expression analysis, and sequence scrutiny. Hence, quantitative measures of the annotation landscape constitute a pertinent bioinformatics tool. GeneCards ® is a gene-centric compendium of rich annotative information for over 50,000 human gene entries, building upon 68 data sources, including Gene Ontology (GO), pathways, interactions, phenotypes, publications and many more.
Retroposed copies (RPCs) of genes are functional (intronless paralogs) or nonfunctional (processed pseudogenes) copies derived from mRNA through a process of retrotransposition. Previous studies found that gene families involved in mRNA translation or nuclear function were more likely to have large numbers of RPCs. Here we characterize RPCs of the few families coding for the abundant high-mobility-group (HMG) proteins in humans. Using an algorithm we developed, we identified and studied 219 HMG RPCs. For slightly more than 10% of these RPCs, we found evidence indicating expression. Furthermore, eight of these are potentially new members of the HMG families of proteins. For three RPCs, the evidence indicated expression as part of other transcripts; in all of these, we found the presence of alternative splicing or multiple polyadenylation signals. RPC distribution among the HMGs was not even, with 33-65 each for HMGB1, HMGB3, HMGN1, and HMGN2, and 0-6 each for HMGA1, HMGA2, HMGB2, and HMGN3. Analysis of the sequences flanking the RPCs revealed that the junction between the target site duplications and the 5Ј-flanking sequences exhibited the same TT/AAAA consensus found for the L1 endonuclease, supporting an L1-mediated retrotransposition mechanism. Finally, because our algorithm included aligning RPC flanking sequences with the corresponding HMG genomic sequence, we were able to identify transcribed regions of HMG genes that were not part of the published mRNA sequences.
The pathophysiologic role of the Philadelphia chromosome translocation in chronic myelogenous leukemia (CML) has been known for nearly 20 years. However, the most significant morbidity and mortality in CML are caused by progression to blast crisis, about which comparatively little is known at the molecular level. Genomic imprinting is a chromosomal modification leading to parental-origin–specific gene expression in somatic cells. Recently, we and others have described loss of imprinting (LOI) of the insulin-like growth factor-II gene (IGF2), leading to biallelic rather than monoallelic expression in a wide variety of solid tumors. We have now examined the imprinting status of IGF2 in samples from CML patients in stable phase, accelerated phase, and blast crisis. Five of six stable-phase patients showed normal imprinting, but LOI was found in all six cases of advanced disease (three accelerated phase, three blast crisis), which was statistically highly significant (P < .01). Thus, LOI represents a novel type of genetic alteration in CML that appears to be specifically associated with disease progression.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.