Intrinsically disordered proteins (IDPs) constitute a
significant
fraction of eukaryotic proteomes. High-resolution characterization
of IDP conformational ensembles can help elucidate their roles in
a wide range of biological processes but remains challenging both
experimentally and computationally. Here, we present a generic algorithm
to improve the accuracy of coarse-grained IDP models using a diverse
set of experimental measurements. It combines maximum entropy optimization
and least-squares regression to systematically adjust model parameters
and improve the agreement between simulation and experiment. We successfully
applied the algorithm to derive a transferable force field, which
we term the maximum entropy optimized force field (MOFF), for de novo
prediction of IDP structures. Statistical analysis of force field
parameters reveals features of amino acid interactions not captured
by potentials designed to work well for folded proteins. We anticipate
its combination of efficiency and accuracy will make MOFF useful for
studying the phase separation of IDPs, which drives the formation
of various biological compartments.
Many proteins have been shown to function via liquid−liquid phase separation. Computational modeling could offer much needed structural details of protein condensates and reveal the set of molecular interactions that dictate their stability. However, the presence of both ordered and disordered domains in these proteins places a high demand on the model accuracy. Here, we present an algorithm to derive a coarse-grained force field, MOFF, which can model both ordered and disordered proteins with consistent accuracy. It combines maximum entropy biasing, least-squares fitting, and basic principles of energy landscape theory to ensure that MOFF recreates experimental radii of gyration while predicting the folded structures for globular proteins with lower energy. The theta temperature determined from MOFF separates ordered and disordered proteins at 300 K and exhibits a strikingly linear relationship with amino acid sequence composition. We further applied MOFF to study the phase behavior of HP1, an essential protein for posttranslational modification and spatial organization of chromatin. The force field successfully resolved the structural difference of two HP1 homologues despite their high sequence similarity. We carried out large-scale simulations with hundreds of proteins to determine the critical temperature of phase separation and uncover multivalent interactions that stabilize higher-order assemblies. In all, our work makes significant methodological strides to connect theories of ordered and disordered proteins and provides a powerful tool for studying liquid−liquid phase separation with near-atomistic details.
Small-angle X-ray scattering (SAXS) experiments provide valuable structural data for biomolecules in solution. We develop a highly efficient maximum entropy approach to fit SAXS data by introducing minimal biases to a coarse-grained protein force field, the associative memory, water mediated, structure and energy model (AWSEM). We demonstrate that the resulting force field, AWSEM-SAXS, succeeds in reproducing scattering profiles, and models protein structures with shapes that are in much better agreement with experimental results.Quantitative metrics further reveal a modest, but consistent improvement in the accuracy of modeled structures when SAXS data are incorporated into the force field. Additionally, when applied to a multi-conformational protein, we find that AWSEM-SAXS is able to recover the population of different protein conformations from SAXS data alone. We, therefore, conclude that the maximum entropy approach is effective in fine-tuning the force field to better characterize both protein structure and conformational fluctuation.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.