We present the first comprehensive analysis of a diploid human genome that combines single-molecule sequencing with single-molecule genome maps. Our hybrid assembly markedly improves upon the contiguity observed from traditional shotgun sequencing approaches, with scaffold N50 values approaching 30 Mb, and we identified complex structural variants (SVs) missed by other high-throughput approaches. Furthermore, by combining Illumina short-read data with long reads, we phased both single-nucleotide variants and SVs, generating haplotypes with over 99% consistency with previous trio-based studies. Our work shows that it is now possible to integrate single-molecule and high-throughput sequence data to generate de novo assembled genomes that approach reference quality.
A major clinical problem in the use of cisplatin to treat cancers is tumor resistance. DNA polymerase η (Polη) is a key polymerase that allows cancer cells to cope with cisplatin–DNA adducts formed during chemotherapy. We present here a structure of human Polη inserting dCTP opposite a cisplatin intrastrand cross-link (PtGpG). We show that specificity of human Polη for PtGpG derives from an active site that is open to permit Watson-Crick geometry of the nascent PtGpG•dCTP base pair and to accommodate the lesion without steric hindrance. The specificity is augmented by residues Gln38 and Ser62 that interact with PtGpG, and Arg61 that interacts with incoming dCTP. Collectively, the structure provides a basis for understanding how Polη in human cells can tolerate DNA damage caused by cisplatin chemotherapy and offers a framework for the design of inhibitors in cancer therapy.
Here we present PacmonSTR, a reference-based probabilistic approach, to identify the TR region and estimate the number of these TR elements in long DNA reads. We present a multistep approach that requires as input, a reference region and the reference TR element. Initially, the TR region is identified from the long DNA reads via a 3-stage modified Smith-Waterman approach and then, expected number of TR elements is calculated using a pair-Hidden Markov Models-based method. Finally, TR-based genotype selection (or clustering: homozygous/heterozygous) is performed with Gaussian mixture models, using the Akaike information criteria, and coverage expectations.
Pre-steady state kinetic studies on Y-family DNA polymerase η (Polη) have suggested that the polymerase undergoes a rate-limiting conformational change step before the phosphoryl transfer of the incoming nucleotide to the primer terminus. However, the nature of this rate-limiting conformational change step has been unclear, due in part to the lack of structural information on the Polη binary complex. We present here for the first time a crystal structure of human Polη (hPolη) in binary complex with its DNA substrate. We show that the hPolη domains move only slightly on dNTP binding and that the polymerase by and large is pre-aligned for dNTP binding and catalysis. We also show that there is no major reorientation of the DNA from a nonproductive to a productive configuration and that the active site is devoid of metals in the absence of dNTP. Together, these observations lead us to suggest that the rate-limiting conformational change step in the Polη replication cycle likely corresponds to a rate-limiting entry of catalytic metals in the active site.
Restriction enzymes share little or no sequence homology with the exception of isoschizomers, or enzymes that recognize and cleave the same DNA sequence. We present here the structure of a BamHI isoschizomer, OkrAI, bound to the same DNA sequence (TATGGATCCATA) as that cocrystallized with BamHI. We show that OkrAI is a more minimal version of BamHI, lacking not only the N- and C-terminal helices but also an internal 310 helix and containing β-strands that are shorter than those in BamHI. Despite these structural differences, OkrAI recognizes the DNA in a remarkably similar manner to BamHI, including asymmetric contacts via C-terminal ‘arms’ that appear to ‘compete’ for the minor groove. However, the arms are shorter than in BamHI. We observe similar DNA-binding affinities between OkrAI and BamHI but OkrAI has higher star activity (at 37°C) compared to BamHI. Together, the OkrAI and BamHI structures offer a rare opportunity to compare two restriction enzymes that work on exactly the same DNA substrate.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.