Proteins can exist in a trinity of structures: the ordered state, the molten globule and the random coil. Five examples follow which suggest that native protein structure can correspond to any of the three states (not just the ordered state) and that protein function can arise from any of the three states and their transitions. 1. In a process that likely mimics infection, fd phage converts from the ordered into the disordered molten globular state. 2. Nucleosome hyperacetylation is crucial to DNA replication and transcription; this chemical modification greatly increases the net negative charge of the nucleosome core particle. We propose that the increased charge imbalance promotes its conversion to a much less rigid form. 3. Clusterin contains an ordered domain and also a native molten globular region. The molten globular domain likely functions as a proteinaceous detergent for cell remodeling and removal of apoptotic debris. 4. In a critical signaling event, a helix in calcineurin becomes bound and surrounded by calmodulin, thereby turning on calcineurin's serine/threonine phosphatase activity.Locating the calcineurin helix within a region of disorder is essential for enabling calmodulin to surround its target upon binding. 5. Calsequestrin regulates calcium levels in the sarcoplasmic reticulum by binding about 50 ions/molecule. Disordered polyanion tails at the carboxy terminus bind many of these calcium ions, perhaps without adopting a unique structure. In addition to these examples, 16 more proteins with native disorder will be discussed. These disordered regions include molecular recognition domains, protein folding inhibitors, flexible linkers, entropic springs, entropic clocks and entropic bristles.Motivated by such examples of intrinsic disorder, we are studying the relationships between amino acid sequence and order/disorder, and from this information we are predicting intrinsic order/disorder from amino acid sequence.The sequence/structure relationships indicate that disorder is an encoded property, and the predictions strongly suggest that proteins in nature are much richer in intrinsic disorder than are those in the Protein Data Bank. Recent predictions on 29 genomes indicate that proteins from eucaryotes apparently have more intrinsic disorder than those from either bacteria or archaea, with typically > 30 % of eucaryotic proteins having disordered regions of length = 50 consecutive residues.
A method for rapid sequencing of intact proteins simultaneously from the N and C termini (1-2 s) with online chromatography is described and applied to the characterization of histone H3.1 posttranslational modifications and the identification of an additional member of the H2A gene family. Proteins are converted to gas-phase multiply charged positive ions by electrospray ionization and then allowed to react with fluoranthene radical anions. Electron transfer to the multiply charged protein promotes random dissociation of the NOC␣ bonds of the protein backbone. Multiply charged fragment ions are then deprotonated in a second ion͞ion reaction with the carboxylate anion of benzoic acid. The m͞z values for the resulting singly and doubly charged ions are used to read a sequence of 15-40 aa at both the N and C termini of the protein. This information, with the measured mass of the intact protein, is used to search protein or nucleotide databases for possible matches, detect posttranslational modifications, and determine possible splice variants.electron transfer dissociation ͉ fragmentation ͉ ion trap ͉ ion͞ion reactions ͉ top down P erhaps one of the most influential concepts in protein mass spectrometry has been the notion of enzymatic protein digestion to render a collection of peptides of suitable size for conventional tandem mass spectrometry, i.e., bottom-up proteomics. Doubtless this methodology has enabled significant progress for global protein identification (1, 2); however, many investigators now realize this approach has significant limitations (3). First, protein posttranslational modifications (PTMs) on multidomain proteins often work in concert; to determine their biological relevance, these patterns must be detected within the context of one another (across the whole protein). And second, mRNA editing (alternative splicing) has recently gained credit for significantly increasing the protein repertoire of complex organisms, up to three-fourths of all human genes have at least one variant (4-6). Thus, the use of short peptides as proxy markers for genes is inadequate and often misleading (7).To detect these biological events, several laboratories are now pursuing mass spectrometry-based methods to analyze whole proteins. Intact proteins are directly dissociated, either by electron capture (ECD) or collision activation (CAD), and the products are measured with the high resolving power of a Fourier transform ion cyclotron resonance mass spectrometer (FT-ICR-MS) (3,(8)(9)(10)(11)(12). McLuckey and coworkers (13-15) have used ion trap-based instrumentation to collisionally activate intact protein ions followed by spectral simplification with ion͞ion charge reduction reactions. In either case, the intact protein mass is measured while the product ions are used for sequencing and locating sites of modification. Unfortunately, mainstream implementation of this ''top-down''-type analysis is not routine because CAD typically generates only a handful of fragments (cleavage is directed at a few weak linkages, mak...
SUMMARY The mammalian sperm genome is thought to lack substantial information to regulate future expression after fertilization. Here we show that most promoters in mouse sperm are flanked by well-positioned nucleosomes marked by active histone modifications. Analysis of these modifications suggests that many enhancers and super-enhancers functional in embryonic and adult tissues are already specified in sperm. The sperm genome is bound by CTCF and cohesin at sites that are also present in round spermatids and embryonic stem cells (ESCs). These sites mediate interactions that organize the sperm genome into domains and compartments that overlap extensively with those found in mESCs. These results suggest that sperm carry a rich source of regulatory information, encoded in part by its three-dimensional folding specified by CTCF and cohesin. This information may contribute to future expression during embryonic and adult life, suggesting mechanisms by which environmental effects on the paternal germline are transmitted trans-generationally.
Histone variants are non-allelic protein isoforms that play key roles in diversifying chromatin structure. The known number of such variants has greatly increased in recent years, but the lack of naming conventions for them has led to a variety of naming styles, multiple synonyms and misleading homographs that obscure variant relationships and complicate database searches. We propose here a unified nomenclature for variants of all five classes of histones that uses consistent but flexible naming conventions to produce names that are informative and readily searchable. The nomenclature builds on historical usage and incorporates phylogenetic relationships, which are strong predictors of structure and function. A key feature is the consistent use of punctuation to represent phylogenetic divergence, making explicit the relationships among variant subtypes that have previously been implicit or unclear. We recommend that by default new histone variants be named with organism-specific paralog-number suffixes that lack phylogenetic implication, while letter suffixes be reserved for structurally distinct clades of variants. For clarity and searchability, we encourage the use of descriptors that are separate from the phylogeny-based variant name to indicate developmental and other properties of variants that may be independent of structure.
A homogeneous oligonucleosome complex was prepared by reconstitution of highly hyperacetylated histone octamers onto a linear DNA template consisting of 12 tandemly arranged 208-base pair fragments of the 5 S rRNA gene from the sea urchin Lytechinus variegatus. The ionic strength-dependent folding of this oligonucleosome assembly was monitored by sedimentation velocity and electron microscopy. Both types of analysis indicate that under ionic conditions resembling those found in the physiological range and in the absence of histone H1, the acetylated oligonucleosome complexes remain in an extended conformation in contrast to their nonacetylated counterparts. The implications of this finding in the context of a multistate model of chromatin folding (Hansen, J. C., and Ausio, J. (1992) TIBS 197, 187-191) as well as its biological relevance are discussed.
Using the method of salt dialysis, we have reconstituted histone octamers onto DNA templates consisting of 12 tandem repeats, each containing a fragment of the sea urchin 5S rRNA gene [Simpson, R.T., Thoma, F., & Brubaker, J.M. (1985) Cell 42, 799-808]. In these templates, each sea urchin repeat contains a sequence for preferred nucleosome positioning. Sedimentation velocity and sedimentation equilibrium studies in the analytical ultracentrifuge indicate that at molar histone/DNA ratios of 1.0-1.1 extremely homogeneous preparations of fully loaded oligonucleosomes (12 nucleosomes/template) can be regularly obtained. Digestion of the oligonucleosomes with micrococcal nuclease, followed by restriction mapping of purified nucleosome-bound DNA sequences, yields a complicated but consistent pattern of nucleosome positioning. Roughly 50% of the nucleosomes appear to be phased at positions 1-146 of each repeat, while the remainder of the nucleosomes occupy a number of other minor discrete positions along the template that differ by multiples of 10 bp. From sedimentation velocity studies of the oligonucleosomes in 0-0.2 M NaCl, we observe a reversible increase in mean sedimentation coefficient by almost 30%, accompanied by development of heterogeneity in sedimentation. These results, in combination with theoretical predictions, indicate that linear stretches of chromatin in the absence of lysine-rich histones exist in solution in a salt-dependent equilibrium between an extended (low salt) conformation and one or more folded (high salt) structures. In addition, by 100 mM NaCl, salt-dependent dissociation of histone octamers from these linear oligonucleosomes is observed.
In which taxa did H1 linker histones appear in the course of evolution? Detailed comparative analysis of the histone H1 and histone H1-related sequences available to date suggests that the origin of histone H1 can be traced to bacteria. The data also reveal that the sequence corresponding to the 'winged helix' motif of the globular structural domain, a domain characteristic of all metazoan histone H1 molecules, is evolutionarily conserved and appears separately in several divergent lines of protists. Some protists, however, appear to have only a lysine-rich basic protein, which has compositional similarity to some of the histone H1-like proteins from eubacteria and to the carboxy-terminal domain of the H1 linker histones from animals and plants. No lysine-rich basic proteins have been described in archaebacteria. The data presented in this review provide the surprising conclusion that whereas DNA-condensing H1-related histones may have arisen early in evolution in eubacteria, the appearance of the sequence motif corresponding to the globular domain of metazoan H1s occurred much later in the protists, after and independently of the appearance of the chromosomal core histones in archaebacteria.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.