Complex transcriptional behaviors are encoded in the DNA sequences of gene regulatory regions. Advances in our understanding of these behaviors have been gained recently by quantitative models that describe how molecules such as transcription factors and nucleosomes interact with the genomic sequence. An emerging view is that every regulatory sequence is associated with a unique binding affinity landscape for each molecule and, consequently, with a unique set of molecule binding configurations and transcriptional outputs. We present a quantitative framework based on existing methods that unifies these ideas. This framework explains many experimental observations regarding the binding patterns of factors and nucleosomes, and the dynamics of transcriptional activation. It can also be used to model more complex phenomena such as transcriptional noise and the evolution of transcriptional regulation.Many cellular and organismal processes depend on the establishment of complex patterns of gene expression at precise times and spatial locations, with inaccuracies in carrying out such transcriptional programs often being deleterious and leading to disease. The information for directing such expression patterns is encoded in regulatory DNA sequences -for example, reporter genes attached directly to such regulatory sequences adopt the expression pattern of the endogenous gene 1,2,3 , and DNA binding and gene expression patterns of an entire human chromosome are essentially unchanged in mice that carry this human chromosome 4 .Given the centrality of transcriptional programs to many biological processes, a predictive and quantitative understanding of the transcriptional behaviors encoded by DNA sequences is highly desirable. Such an understanding would allow us to go beyond merely identifying the transcription factors and regulatory DNA elements that are involved, and replace the existing qualitative and phenomenological descriptions by a mechanistic view of the process that integrates the involved components into physically realistic mechanistic models. Indeed, our ability to quantitatively predict the behavior of a regulatory system is a useful objective measure of the extent to which we truly understand how the system works. At a more practical level, the ability to accurately predict transcriptional behaviors from DNA sequences should allow us to predict the effect that sequence variation among individuals in the population has on gene expression and thus on more complex phenotypes and disease; and it would allow for improved rational design of transgenes for biotechnology and gene therapy.
NIH Public Access
Author ManuscriptNat Rev Genet. Author manuscript; available in PMC 2009 August 3.
Published in final edited form as:Nat Rev Genet. 2009 July ; 10(7): 443-456. doi:10.1038/nrg2591.
NIH-PA Author ManuscriptNIH-PA Author Manuscript
NIH-PA Author ManuscriptRecent work has significantly advanced our understanding of how genomic sequences are translated into transcriptional outputs. Progress has been made possib...