The Database of Protein Disorder (DisProt, URL: www.disprot.org) has been significantly updated and upgraded since its last major renewal in 2007. The current release holds information on more than 800 entries of IDPs/IDRs, i.e. intrinsically disordered proteins or regions that exist and function without a well-defined three-dimensional structure. We have re-curated previous entries to purge DisProt from conflicting cases, and also upgraded the functional classification scheme to reflect continuous advance in the field in the past 10 years or so. We define IDPs as proteins that are disordered along their entire sequence, i.e. entirely lack structural elements, and IDRs as regions that are at least five consecutive residues without well-defined structure. We base our assessment of disorder strictly on experimental evidence, such as X-ray crystallography and nuclear magnetic resonance (primary techniques) and a broad range of other experimental approaches (secondary techniques). Confident and ambiguous annotations are highlighted separately. DisProt 7.0 presents classified knowledge regarding the experimental characterization and functional annotations of IDPs/IDRs, and is intended to provide an invaluable resource for the research community for a better understanding structural disorder and for developing better computational tools for studying disordered proteins.
The Database of Intrinsically Disordered Proteins (DisProt, URL: https://disprot.org) is the major repository of manually curated annotations of intrinsically disordered proteins and regions from the literature. We report here recent updates of DisProt version 9, including a restyled web interface, refactored Intrinsically Disordered Proteins Ontology (IDPO), improvements in the curation process and significant content growth of around 30%. Higher quality and consistency of annotations is provided by a newly implemented reviewing process and training of curators. The increased curation capacity is fostered by the integration of DisProt with APICURON, a dedicated resource for the proper attribution and recognition of biocuration efforts. Better interoperability is provided through the adoption of the Minimum Information About Disorder (MIADE) standard, an active collaboration with the Gene Ontology (GO) and Evidence and Conclusion Ontology (ECO) consortia and the support of the ELIXIR infrastructure.
Androgen receptor (AR) is a nuclear hormone receptor that regulates the transcription of genes involved in the development of testis, prostate and the nervous system. Misregulation of AR is a major driver of prostate cancer (PC). The primary agonist of full‐length AR is testosterone, whereas its splice variants, for example, AR‐v7 implicated in cancer may lack a ligand‐binding domain and are thus devoid of proper hormonal control. Recently, it was demonstrated that full‐length AR, but not AR‐v7, can undergo liquid–liquid phase separation (LLPS) in a cellular model of PC. In a detailed bioinformatics and deletion analysis, we have analyzed which AR region is responsible for LLPS. We found that its DNA‐binding domain (DBD) can bind RNA and can undergo RNA‐dependent LLPS. RNA regulates its LLPS in a reentrant manner, that is, it has an inhibitory effect at higher concentrations. As RNA binds DBD more weakly than DNA, while both RNA and DNA localizes into AR droplets, its LLPS depends on the relative concentration of the two nucleic acids. The region immediately preceding DBD has no effect on the LLPS propensity of AR, whereas the functional part of its long N‐terminal disordered transactivation domain termed activation function 1 (AF1) inhibits AR‐v7 phase separation. We suggest that the resulting diminished LLPS tendency of AR‐v7 may contribute to the misregulation of the transcription function of AR in prostate cancer.
Wide‐line 1H NMR measurements were extended and all results were interpreted in a thermodynamics‐based new approach on aqueous solutions of thymosin‐β4 (Tβ4), stabilin cytoplasmic domain (CTD), and their 1 : 1 complex. Energy distributions of potential barriers controlling the motion of protein‐bound water molecules were determined. Heterogeneous and homogeneous regions were found in the protein‐water interface. The measure of heterogeneity of this interface gives quantitative value for the portion of disordered parts in the protein. Ordered structural elements were found extending up to ∼20 % of the individual whole proteins. About 40 % of the binding sites of free Tβ4 get involved in bonds holding the complex together. The complex has the most heterogeneous solvent accessible surface (SAS) in terms of protein‐water interactions. The complex is more disordered than Tβ4 or stabilin CTD. The greater SAS area of the complex is interpreted as a clear sign of its open structure.
Histone lysine methyltransferases (HKMTs), catalyze mono-, di- and trimethylation of lysine residues, resulting in a regulatory pattern that controls gene expression. Their involvement in many different cellular processes and diseases makes HKMTs an intensively studied protein group, but scientific interest so far has been concentrated mostly on their catalytic domains. In this work we set out to analyze the structural heterogeneity of human HKMTs and found that many contain long intrinsically disordered regions (IDRs) that are conserved through vertebrate species. Our predictions show that these IDRs contain several linear motifs and conserved putative binding sites that harbor cancer-related SNPs. Although there are only limited data available in the literature, some of the predicted binding regions overlap with interacting segments identified experimentally. The importance of a disordered binding site is illustrated through the example of the ternary complex between MLL1, menin and LEDGF/p75. Our suggestion is that intrinsic protein disorder plays an as yet unrecognized role in epigenetic regulation, which needs to be further elucidated through structural and functional studies aimed specifically at the disordered regions of HKMTs.Reviewers: This article was reviewed by Arne Elofsson and Piotr Zielenkiewicz.Electronic supplementary materialThe online version of this article (doi:10.1186/s13062-016-0129-2) contains supplementary material, which is available to authorized users.
Heterogeneous nuclear ribonucleoproteins (hnRNP) function in RNA processing, have RNA-recognition motifs (RRMs) and intrinsically disordered, low-complexity domains (LCDs). While RRMs are drivers of RNA binding, there is only limited knowledge about the RNA interaction by the LCD of some hnRNPs. Here, we show that the LCD of hnRNPA2 interacts with RNA via an embedded Tyr/Gly-rich region which is a disordered RNA-binding motif. RNA binding is maintained upon mutating tyrosine residues to phenylalanines, but abrogated by mutating to alanines, thus we term the RNA-binding region ‘F/YGG motif’. The F/YGG motif can bind a broad range of structured (e.g. tRNA) and disordered (e.g. polyA) RNAs, but not rRNA. As the F/YGG otif can also interact with DNA, we consider it a general nucleic acid-binding motif. hnRNPA2 LCD can form dense droplets, by liquid–liquid phase separation (LLPS). Their formation is inhibited by RNA binding, which is mitigated by salt and 1,6-hexanediol, suggesting that both electrostatic and hydrophobic interactions feature in the F/YGG motif. The D290V mutant also binds RNA, which interferes with both LLPS and aggregation thereof. We found homologous regions in a broad range of RNA- and DNA-binding proteins in the human proteome, suggesting that the F/YGG motif is a general nucleic acid-interaction motif.
Wide-line H NMR measurements were extended and all results were reinterpreted in a new thermodynamics-based approach to study aqueous solutions of thymosin-β (Tβ ), stabilin C-terminal domain (CTD) and their 1:1 complex. The energy distributions of the potential barriers, which control motion of protein-bound water molecules, were determined. Heterogeneous and homogeneous regions were found at the protein-water interface. The measure of heterogeneity gives a quantitative value for the portion of disordered parts in the protein. Ordered structural elements were found extending up to 20 % of the whole proteins. About 40 % of the binding sites of free Tβ become involved in bonds holding the complex together. The complex has the most heterogeneous solvent accessible surface (SAS) in terms of protein-water interactions. The complex is more disordered than Tβ or stabilin CTD. The greater SAS area of the complex is interpreted as a clear sign of its open structure.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.