Whole-genome sequencing enables complete characterization of genetic variation, but geographic clustering of rare alleles demands many diverse populations be studied. Here we describe the Genome of the Netherlands (GoNL) Project, in which we sequenced the whole genomes of 250 Dutch parent-offspring families and constructed a haplotype map of 20.4 million single-nucleotide variants and 1.2 million insertions and deletions. The intermediate coverage (∼13×) and trio design enabled extensive characterization of structural variation, including midsize events (30-500 bp) previously poorly catalogued and de novo mutations. We demonstrate that the quality of the haplotypes boosts imputation accuracy in independent samples, especially for lower frequency alleles. Population genetic analyses demonstrate fine-scale structure across the country and support multiple ancient migrations, consistent with historical changes in sea level and flooding. The GoNL Project illustrates how single-population whole-genome sequencing can provide detailed characterization of genetic variation and may guide the design of future population studies.
We propose two new double-hybrid functionals, denoted B2K-PLYP and mPW2K-PLYP, which yield thermochemical performance comparable to existing double-hybrid functionals but offer superior performance for barrier heights of various kinds. We show that the new functionals yield excellent performance for all of the following: (a) main-group thermochemistry; (b) main-group thermochemical kinetics; (c) late transition metal reactions. In addition, B2K-PLYP performs well for weak interactions.
Intrinsically disordered regions, terminal tails, and flexible linkers are abundant in DNA-binding proteins and play a crucial role by increasing the affinity and specificity of DNA binding. Disordered tails often undergo a disorder-to-order transition during interactions with DNA and improve both the kinetics and thermodynamics of specific DNA binding. The DNA search by proteins that interact nonspecifically with DNA can be supported by disordered tails as well. The disordered tail may increase the overall protein-DNA interface and thus increase the affinity of the protein to the DNA and its sliding propensity while slowing linear diffusion. The exact effect of the disordered tails on the sliding rate depends on the degree of positive charge clustering, as has been shown for homeodomains and p53 transcription factors. The disordered tails, which may be viewed as DNA recognizing subdomains, can facilitate intersegment transfer events that occur via a "monkey bar" mechanism in which the domains bridge two different DNA fragments simultaneously. The "monkey bar" mechanism can be facilitated by internal disordered linkers in multidomain proteins that mediate the cross-talks between the constituent domains and especially their brachiation dynamics and thus their overall capability to search DNA efficiently. The residue sequence of the disordered tails has unique characteristics that were evolutionarily selected to achieve the optimized function that is unique to each protein. Perturbation of the electrostatic characteristics of the disordered tails by post-translational modifications, such as acetylation and phosphorylation, may affect protein affinity to DNA and therefore can serve to regulate DNA recognition. Modifying the disordered protein tails or the flexibility of the inter-domain linkers of multidomain proteins may affect the cross-talk between the constituent domains so as to facilitate the search kinetics of non-specific DNA sequences and increase affinity to the specific sequences.
Egr-1 is an inducible transcription factor that recognizes 9-bp target DNA sites via three zinc finger domains and activates genes in response to cellular stimuli such as synaptic signals and vascular stresses. Using spectroscopic and computational approaches, we have studied structural, dynamic, and kinetic aspects of the DNAscanning process in which Egr-1 is nonspecifically bound to DNA and perpetually changes its location on DNA. Our NMR data indicate that Egr-1 undergoes highly dynamic domain motions when scanning DNA. In particular, the zinc finger 1 (ZF1) of Egr-1 in the nonspecific complex is mainly dissociated from DNA and undergoes collective motions on a nanosecond timescale, whereas zinc fingers 2 and 3 (ZF2 and ZF3, respectively) are bound to DNA. This was totally unexpected because the previous crystallographic studies of the specific complex indicated that all of Egr-1's three zinc fingers are equally involved in binding to a target DNA site. Mutations that are expected to enhance ZF1's interactions with DNA and with ZF2 were found to reduce ZF1's domain motions in the nonspecific complex suggesting that these interactions dictate the dynamic behavior of ZF1. By experiment and computation, we have also investigated kinetics of Egr-1's translocation between two nonspecific DNA duplexes. Our data on the wild type and mutant proteins suggest that the domain dynamics facilitate Egr-1's intersegment transfer that involves transient bridging of two DNA sites. These results shed light on asymmetrical roles of the zinc finger domains for Egr-1 to scan DNA efficiently in the nucleus.NMR spectroscopy | target search process | interdomain dynamics | protein-DNA interactions | simulation I n cellular responses to various stimuli such as signals and stresses, gene regulation by transcription factors is of fundamental importance. Egr-1 (also known as Zif268) is an inducible transcription factor with crucial roles particularly in the brain and cardiovascular systems in mammals. In the brain, Egr-1 is induced by synaptic signals in an activity-dependent manner and activates genes for long-term memory formation and consolidation (1, 2). In the cardiovascular system, Egr-1 is a stress-inducible transcription factor that activates the genes for initiating defense responses against vascular stress and injury (3, 4). Given the short lifetime of induced Egr-1 (typically ∼2 h) (3), rapid gene activation by Egr-1 is important in these biological processes that require an immediate response to the stimuli.The induced Egr-1 protein has to initiate its role by searching for its target DNA sites among billions of DNA base pairs in the nucleus. In the DNA scanning process, transcription factors need to discriminate their target sites from nonspecific sites based on relatively minor differences in DNA structure and sequence. Crystallographic studies demonstrated that Egr-1 recognizes its 9-bp target sequence, GCGTGGGCG, as a monomer via zinc finger domains 1, 2, and 3 (hereafter referred to as ZF1, ZF2, and ZF3) that contact 3 ...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.