Estimation of gene number in mammals is difficult due to the high proportion of noncoding DNA within the nucleus. In this study, we provide a direct measurement of the number of genes in human and mouse. We have taken advantage of the fact that many mammalian genes are associated with CpG islands whose distinctive properties allow their physical separation from bulk DNA. Our results suggest that there are =45,000 CpG islands per haploid genome in humans and 37,000 in the mouse. Sequence comparison confirms that about 20% of the human CpG islands are absent from the homologous mouse genes. Analysis of a selection of genes suggests that both human and mouse are losing CpG islands over evolutionary time due to de novo methylation in the germ line followed by CpG loss through mutation. This process appears to be more rapid in rodents. Combining the number of CpG islands with the proportion of island-associated genes, we estimate that the total number of genes per haploid genome is 40,000 in both organisms. (see refs. 4-6 for reviews). CpG islands constitute a distinctive fraction of the genome because, unlike bulk DNA, they are nonmethylated and contain the dinucleotide CpG at its expected frequency. Another notable property of CpG islands is that their G+C content is significantly higher than that of non-island DNA. This facilitates their identification even in cloned DNA, where the native methylation pattern has been erased (7).In this study, we have exploited these properties to separate CpG islands from the rest of the genome and determineThe publication costs of this article were defrayed in part by page charge payment. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. §1734 solely to indicate this fact. their absolute number in human and mouse. A previous approach to the number of CpG islands was carried out in mouse by quantitation of end-labeled restriction fragments generated upon digestion of total genomic DNA with the methyl-sensitive restriction endonuclease Hpa II (8). An approximate figure of30,000 CpG islands per haploid genome was suggested. Our results show significant differences between mouse and human that are relevant to our understanding of the origin and maintenance of CpG islands.Because not all genes have CpG islands, the total number of genes cannot be deduced directly from their number. We have taken the study further by establishing the proportion of genes that are CpG island-associated. Combining the number of CpG islands per genome and the percentage of CpG island-associated genes, we obtain a direct estimate of the total number of genes in human and mouse. MATERIALS AND METHODSCell Culture Conditions. The human lymphoblastoid PES cell line (9) was grown in RPMI 1640 medium containing 10% tryptose phosphate broth and supplemented with 10% fetal bovine serum. The mouse embryo stem-cell line EFC-1 (10) was grown in Glasgow minimal essential medium supplemented with 10%o fetal bovine serum, 0.1 mM 2-mercaptoethanol, and 1 mM nonessential am...
The INK4/ARF locus encodes three tumour suppressors (p15(INK4b), ARF and p16(INK4a)) and is among the most frequently inactivated loci in human cancer. However, little is known about the mechanisms that govern the expression of this locus. Here we have identified a putative DNA replication origin at the INK4/ARF locus that assembles a multiprotein complex containing Cdc6, Orc2 and MCMs, and that coincides with a conserved noncoding DNA element (regulatory domain RD(INK4/ARF)). Targeted and localized RNA-interference-induced heterochromatinization of RD(INK4/ARF) results in transcriptional repression of the locus, revealing that RD(INK4/ARF) is a relevant transcriptional regulatory element. Cdc6 is overexpressed in human cancers, where it might have roles in addition to DNA replication. We have found that high levels of Cdc6 result in RD(INK4/ARF)-dependent transcriptional repression, recruitment of histone deacetylases and heterochromatinization of the INK4/ARF locus, and a concomitant decrease in the expression of the three tumour suppressors encoded by this locus. This mechanism is reminiscent of the silencing of the mating-type HM loci in yeast by replication factors. Consistent with its ability to repress the INK4/ARF locus, Cdc6 has cellular immortalization activity and neoplastic transformation capacity in cooperation with oncogenic Ras. Furthermore, human lung carcinomas with high levels of Cdc6 are associated with low levels of p16(INK4a). We conclude that aberrant expression of Cdc6 is oncogenic by directly repressing the INK4/ARF locus through the RD(INK4/ARF) element.
Heterochromatin comprises tightly compacted repetitive regions of eukaryotic chromosomes. The inheritance of heterochromatin through mitosis requires RNA interference (RNAi), which guides histone modification 1 during the DNA replication phase of the cell cycle2. Here, we show that the alternating arrangement of origins of replication and non-coding RNA in pericentromeric heterochromatin results in competition between transcription and replication. Co-transcriptional RNAi releases RNA polymerase II (PolII), allowing completion of DNA replication by the leading strand DNA polymerase, and associated histone modifying enzymes3 which spread heterochromatin with the replication fork. In the absence of RNAi, stalled forks are repaired by homologous recombination without histone modification.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.