Rapid recognition of DNA target sites involves facilitated diffusion through which alternative sites are searched on genomic DNA. A key mechanism facilitating the localization of the target by a DNA-binding protein (DBP) is one-dimensional diffusion (sliding) in which electrostatic forces attract the protein to the DNA. As the protein reaches its target DNA site, it switches from purely electrostatic binding to a specific set of interactions with the DNA bases that also involves hydrogen bonding and van der Waals forces. High overlap between the DBP patches used for nonspecific and specific interactions with DNA may enable an immediate transition between the two binding modes following target site localization. By contrast, an imperfect overlap may result in greater frustration between the two potentially competing binding modes and consequently slower switching between them. A structural analysis of 125 DBPs indicates frustration between the two binding modes that results in a large difference between the orientations of the protein to the DNA when it slides compared to when it specifically interacts with DNA. Coarse-grained molecular dynamics simulations of in silico designed peptides comprising the full range of frustrations between the two interfaces show slower transition from nonspecific to specific DNA binding as the overlap between the patches involved in the two binding modes decreases. The complex search kinetics may regulate the search by eliminating trapping of the protein in semispecific sites while sliding.
DNA recognition by DNA-binding proteins (DBPs), which is a pivotal event in most gene regulatory processes, is often preceded by an extensive search for the correct site. A facilitated diffusion process in which a DBP combines three-dimensional diffusion in solution with one-dimensional sliding along DNA has been suggested to explain how proteins can locate their target sites on DNA much faster than predicted by three-dimensional diffusion alone. Although experimental and theoretical studies have recently advanced understanding of the biophysical principles underlying the search mechanism, the process under in vivo cellular conditions is poorly understood. In this study, we used various computational approaches to explore how the presence of obstacle proteins on the DNA influences search efficiency. At a low obstacle occupancy (i.e., when few obstacles occupy sites on the DNA), sliding by the searching DBP may be confined, which may impair search efficiency. The obstacles, however, can be bypassed during hopping events, and the number of bypasses is larger for higher obstacle occupancies. Dynamism on the part of the obstacles may even further facilitate search kinetics. Our study shows that the nature and efficiency of the search process may be governed not only by the intrinsic properties of the DBP and the salt concentration of the medium, but also by the in vivo association of DNA with other macromolecular obstacles, their location, and occupancy.
A fundamental step in gene-regulatory activities, such as repression, transcription, and recombination, is the binding of regulatory DNA-binding proteins (DBPs) to specific targets in the genome. To rapidly localize their regulatory genomic sites, DBPs reduce the dimensionality of the search space by combining three-dimensional (3D) diffusion in solution with one-dimensional (1D) sliding along DNA. However, the requirement to form a thermodynamically stable protein-DNA complex at the cognate genomic target sequence imposes a challenge on the protein because, as it navigates one-dimensionally along the genome, it may come in close contact with sites that share partial or even complete sequence similarity with the functional DNA sequence. This puzzling issue creates a conflict between two basic requirements: finding the cognate site quickly and stably binding it. Here, we structurally assessed the interface adopted by a variety of DBPs to bind DNA specifically and nonspecifically, and found that many DBPs utilize one interface to specifically recognize a DNA sequence and another to assist in propagating along the DNA through nonspecific associations. While these two interfaces overlap each other in some proteins, they present partial overlap in others and frustrate the protein-DNA interface. Using coarse-grained molecular dynamics simulations, we demonstrate that the existence of frustration in DBPs is a compromise between rapid 1D diffusion along other regions in the genome (high frustration smoothens the landscape for sliding) and rapid formation of a stable and essentially active protein-DNA complex (low frustration reduces the free energy barrier for switching between the two binding modes).
Evolutionary changes in cis-regulatory elements are thought to play a key role in morphological and physiological diversity across animals. Many conserved noncoding elements (CNEs) function as cis-regulatory elements, controlling gene expression levels in different biological contexts. However, determining specific associations between CNEs and related phenotypes is a challenging task. Here, we present a computational "reverse genomics" approach that predicts the phenotypic functions of human CNEs. We identify thousands of human CNEs that were lost in at least two independent mammalian lineages (IL-CNEs), and match their evolutionary profiles against a diverse set of phenotypes recently annotated across multiple mammalian species. We identify 2,759 compelling associations between human CNEs and a diverse set of mammalian phenotypes. We discuss multiple CNEs, including a predicted ear element near BMP7, a pelvic CNE in FBN1, a brain morphology element in UBE4B, and an aquatic adaptation forelimb CNE near EGR2, and provide a full list of our predictions. As more genomes are sequenced and more traits are annotated across species, we expect our method to facilitate the interpretation of noncoding mutations in human disease and expedite the discovery of individual CNEs that play key roles in human evolution and development.
We focus on dimeric DNA-binding proteins from two well-studied families: orthodox type II restriction endonucleases (REs) and transcription factors (TFs). Interactions of the protein's recognition sites with the DNA and, particularly, the contribution of each of the monomers to one-dimensional (1D) sliding along nonspecific DNA were studied using computational tools. Coarse-grained molecular dynamics simulations of DNA scanning by various TFs and REs provide insights into how the symmetry of a homodimer can be broken while they nonspecifically interact with DNA. The characteristics of protein sliding along DNA, such as the average sliding length, partitioning between 1D and 3D search, and the one-dimensional diffusion coefficient D1, strongly depend on the salt concentration, which in turn affects the probability of the two monomers adopting a cooperative symmetric sliding mechanism. Indeed, we demonstrate that maximal DNA search efficiency is achieved when the protein adopts an asymmetric search mode in which one monomer slides while its partner hops. We find that proteins classified as TFs have a higher affinity for the DNA, longer sliding lengths, and an increased probability of symmetric sliding in comparison with REs. Moreover, TFs can perform their biological function over a much wider range of salt concentrations than REs. Our results demonstrate that the different biological functions of DNA-binding proteins are related to the different nonspecific DNA search mechanisms they adopt.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.