Peptide-protein interactions are very prevalent, mediating key processes such as signal transduction and protein trafficking. How can peptides overcome the entropic cost involved in switching from an unstructured, flexible peptide to a rigid, well-defined bound structure? A structure-based analysis of peptide-protein interactions unravels that most peptides do not induce conformational changes on their partner upon binding, thus minimizing the entropic cost of binding. Furthermore, peptides display interfaces that are better packed than protein-protein interfaces and contain significantly more hydrogen bonds, mainly those involving the peptide backbone. Additionally, "hot spot" residues contribute most of the binding energy. Finally, peptides tend to bind in the largest pockets available on the protein surface. Our study is based on peptiDB, a new and comprehensive data set of 103 high-resolution peptide-protein complex structures. In addition to improved understanding of peptide-protein interactions, our findings have direct implications for the structural modeling, design, and manipulation of these interactions.
Understanding emotion expressed in language has a wide range of applications, from building empathetic chatbots to detecting harmful online behavior. Advancement in this area can be improved using large-scale datasets with a fine-grained typology, adaptable to multiple downstream tasks. We introduce GoEmotions, the largest manually annotated dataset of 58k English Reddit comments, labeled for 27 emotion categories or Neutral. We demonstrate the high quality of the annotations via Principal Preserved Component Analysis. We conduct transfer learning experiments with existing emotion benchmarks to show that our dataset generalizes well to other domains and different emotion taxonomies. Our BERTbased model achieves an average F1-score of .46 across our proposed taxonomy, leaving much room for improvement. 1 * Work done while at Google Research.
In this study, we assess on a large scale the possibility of deriving self-inhibitory peptides from protein domains with globular architectures. Such inhibitory peptides would inhibit interactions of their origin domain by mimicking its mode of binding to cognate partners, and could serve as promising leads for rational design of inhibitory drugs. For our large-scale analysis, we analyzed short linear segments that were cut out of protein interfaces in-silico in complex structures of protein-protein docking Benchmark 3.0 and CAPRI targets from rounds 1-19. Our results suggest that more than 50% of these globular interactions are dominated by one short linear segment at the domain interface, which provides more than half of the original interaction energy. Importantly, in many cases the derived peptides show strong energetic preference for their original binding mode independently of the context of their original domain, as we demonstrate by extensive computational peptide docking experiments. As an in-depth case study, we computationally design a candidate peptide to inhibit the EphB4-EphrinB2 interaction based on a short peptide derived from the G-H loop in EphrinB2. Altogether, we provide an elaborate framework for the in-silico selection of candidate inhibitory molecules for protein-protein interactions. Such candidate molecules can be readily subjected to wet-lab experiments and provide highly promising starting points for subsequent drug design.
Peptide-mediated interactions, in which a short linear motif binds to a globular domain, play major roles in cellular regulation. An accurate structural model of this type of interaction is an excellent starting point for the characterization of the binding specificity of a given peptide-binding domain. A number of different protocols have recently been proposed for the accurate modeling of peptide-protein complex structures, given the structure of the protein receptor and the binding site on its surface. When no information about the peptide binding site(s) is a priori available, there is a need for new approaches to locate peptide-binding sites on the protein surface. While several approaches have been proposed for the general identification of ligand binding sites, peptides show very specific binding characteristics, and therefore, there is a need for robust and accurate approaches that are optimized for the prediction of peptide-binding sites. Here we present PeptiMap, a protocol for the accurate mapping of peptide binding sites on protein structures. Our method is based on experimental evidence that peptide-binding sites also bind small organic molecules of various shapes and polarity. Using an adaptation of ab initio ligand binding site prediction based on fragment mapping (FTmap), we optimize a protocol that specifically takes into account peptide binding site characteristics. In a high-quality curated set of peptide-protein complex structures PeptiMap identifies for most the accurate site of peptide binding among the top ranked predictions. We anticipate that this protocol will significantly increase the number of accurate structural models of peptide-mediated interactions.
Question answering (Q&A) communities have been gaining popularity in the past few years. The success of such sites depends mainly on the contribution of a small number of expert users who provide a significant portion of the helpful answers, and so identifying users that have the potential of becoming strong contributers is an important task for owners of such communities.We present a study of the popular Q&A website StackOverflow (SO), in which users ask and answer questions about software development, algorithms, math and other technical topics. The dataset includes information on 3.5 million questions and 6.9 million answers created by 1.3 million users in the years 2008-2012. Participation in activities on the site (such as asking and answering questions) earns users reputation, which is an indicator of the value of that user to the site.We describe an analysis of the SO reputation system, and the participation patterns of high and low reputation users. The contributions of very high reputation users to the site indicate that they are the primary source of answers, and especially of high quality answers. Interestingly, we find that while the majority of questions on the site are asked by low reputation users, on average a high reputation user asks more questions than a user with low reputation. We consider a number of graph analysis methods for detecting influential and anomalous users in the underlying user interaction network, and find they are effective in detecting extreme behaviors such as those of spam users. Lastly, we show an application of our analysis: by considering user contributions over first months of activity on the site, we predict who will become influential long-term contributors.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.