Despite intense investigation, human replication origins and termini remain elusive. Existing data have shown strong discrepancies. Here we sequenced highly purified Okazaki fragments from two cell types and, for the first time, quantitated replication fork directionality and delineated initiation and termination zones genome-wide. Replication initiates stochastically, primarily within non-transcribed, broad (up to 150 kb) zones that often abut transcribed genes, and terminates dispersively between them. Replication fork progression is significantly co-oriented with the transcription. Initiation and termination zones are frequently contiguous, sometimes separated by regions of unidirectional replication. Initiation zones are enriched in open chromatin and enhancer marks, even when not flanked by genes, and often border ‘topologically associating domains' (TADs). Initiation zones are enriched in origin recognition complex (ORC)-binding sites and better align to origins previously mapped using bubble-trap than λ-exonuclease. This novel panorama of replication reveals how chromatin and transcription modulate the initiation process to create cell-type-specific replication programs.
In early Xenopus embryos, in which ribosomal RNA genes (rDNA) are not transcribed, rDNA replication initiates and terminates at 9- to 12-kilobase pair intervals, with no detectable dependence on specific DNA sequences. Resumption of ribosomal RNA (rRNA) synthesis at late blastula and early gastrula is accompanied by a specific repression of replication initiation within transcription units; the frequency of initiation within intergenic spacers remains as high as in early blastula. These results demonstrate that for rRNA genes, circumscribed zones of replication initiation emerge in intergenic DNA during the time in metazoan development when the chromatin is remodeled to allow gene transcription.
Neutral nucleotide substitutions occur at varying rates along genomes, and it remains a major issue to unravel the mechanisms that cause these variations and to analyze their evolutionary consequences. Here, we study the role of replication in the neutral substitution pattern. We obtained a high-resolution replication timing profile of the whole human genome by massively parallel sequencing of nascent BrdU-labeled replicating DNA. These data were compared to the neutral substitution rates along the human genome, obtained by aligning human and chimpanzee genomes using macaque and orangutan as outgroups. All substitution rates increase monotonously with replication timing even after controlling for local or regional nucleotide composition, crossover rate, distance to telomeres, and chromatin compaction. The increase in non-CpG substitution rates might result from several mechanisms including the increase in mutationprone activities or the decrease in efficiency of DNA repair during the S phase. In contrast, the rate of C / T transitions in CpG dinucleotides increases in later-replicating regions due to increasing DNA methylation level that reflects a negative correlation between timing and gene expression. Similar results are observed in the mouse, which indicates that replication timing is a main factor affecting nucleotide substitution dynamics at non-CpG sites and constitutes a major neutral process driving mammalian genome evolution.
Replication origins, fragile sites, and rDNA have been implicated as sources of chromosomal instability. However, the defining genomic features of replication origins and fragile sites are among the least understood elements of eukaryote genomes. Here, we map sites of replication initiation and breakage in primary cells at high resolution. We find that replication initiates between transcribed genes within nucleosome-depleted structures established by long asymmetrical poly(dA:dT) tracts flanking the initiation site. Paradoxically, long (>20 bp) (dA:dT) tracts are also preferential sites of polar replication fork stalling and collapse within early-replicating fragile sites (ERFSs) and late-replicating common fragile sites (CFSs) and at the rDNA replication fork barrier. Poly(dA:dT) sequences are fragile because long single-strand poly(dA) stretches at the replication fork are unprotected by the replication protein A (RPA). We propose that the evolutionary expansion of poly(dA:dT) tracts in eukaryotic genomes promotes replication initiation, but at the cost of chromosome fragility.
In higher eukaryotes, replication program specification in different cell types remains to be fully understood. We show for seven human cell lines that about half of the genome is divided in domains that display a characteristic U-shaped replication timing profile with early initiation zones at borders and late replication at centers. Significant overlap is observed between U-domains of different cell lines and also with germline replication domains exhibiting a N-shaped nucleotide compositional skew. From the demonstration that the average fork polarity is directly reflected by both the compositional skew and the derivative of the replication timing profile, we argue that the fact that this derivative displays a N-shape in U-domains sustains the existence of large-scale gradients of replication fork polarity in somatic and germline cells. Analysis of chromatin interaction (Hi-C) and chromatin marker data reveals that U-domains correspond to high-order chromatin structural units. We discuss possible models for replication origin activation within U/N-domains. The compartmentalization of the genome into replication U/N-domains provides new insights on the organization of the replication program in the human genome.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.