A correct alignment is an essential requirement in homology modeling. Yet in order to bridge the structural gap between template and target, which may not only involve loop rearrangements, but also shifts of secondary structure elements and repacking of core residues, high‐resolution refinement methods with full atomic details are needed. Here, we describe four approaches that address this “last mile of the protein folding problem” and have performed well during CASP8, yielding physically realistic models: YASARA, which runs molecular dynamics simulations of models in explicit solvent, using a new partly knowledge‐based all atom force field derived from Amber, whose parameters have been optimized to minimize the damage done to protein crystal structures. The LEE‐SERVER, which makes extensive use of conformational space annealing to create alignments, to help Modeller build physically realistic models while satisfying input restraints from templates and CHARMM stereochemistry, and to remodel the side‐chains. ROSETTA, whose high resolution refinement protocol combines a physically realistic all atom force field with Monte Carlo minimization to allow the large conformational space to be sampled quickly. And finally UNDERTAKER, which creates a pool of candidate models from various templates and then optimizes them with an adaptive genetic algorithm, using a primarily empirical cost function that does not include bond angle, bond length, or other physics‐like terms. Proteins 2009. © 2009 Wiley‐Liss, Inc.
Condensins are key mediators of chromosome condensation across organisms. Like other condensins, the bacterial MukBEF condensin complex consists of an SMC family protein dimer containing two ATPase head domains, MukB, and two interacting subunits, MukE and MukF. We report complete structural views of the intersubunit interactions of this condensin along with ensuing studies that reveal a role for the ATPase activity of MukB. MukE and MukF together form an elongated dimeric frame, and MukF's C-terminal winged-helix domains (C-WHDs) bind MukB heads to constitute closed ring-like structures. Surprisingly, one of the two bound C-WHDs is forced to detach upon ATP-mediated engagement of MukB heads. This detachment reaction depends on the linker segment preceding the C-WHD, and mutations on the linker restrict cell growth. Thus ATP-dependent transient disruption of the MukB-MukF interaction, which creates openings in condensin ring structures, is likely to be a critical feature of the functional mechanism of condensins.
We present the results for CAPRI Round 30, the first joint CASP-CAPRI experiment, which brought together experts from the protein structure prediction and protein-protein docking communities. The Round comprised 25 targets from amongst those submitted for the CASP11 prediction experiment of 2014. The targets included mostly homodimers, a few homotetramers, and two heterodimers, and comprised protein chains that could readily be modeled using templates from the Protein Data Bank. On average 24 CAPRI groups and 7 CASP groups submitted docking predictions for each target, and 12 CAPRI groups per target participated in the CAPRI scoring experiment. In total more than 9500 models were assessed against the 3D structures of the corresponding target complexes. Results show that the prediction of homodimer assemblies by homology modeling techniques and docking calculations is quite successful for targets featuring large enough subunit interfaces to represent stable associations. Targets with ambiguous or inaccurate oligomeric state assignments, often featuring crystal contact-sized interfaces, represented a confounding factor. For those, a much poorer prediction performance was achieved, while nonetheless often providing helpful clues on the correct oligomeric state of the protein. The prediction performance was very poor for genuine tetrameric targets, where the inaccuracy of the homology-built subunit models and the smaller pair-wise interfaces severely limited the ability to derive the correct assembly mode. Our analysis also shows that docking procedures tend to perform better than standard homology modeling techniques and that highly accurate models of the protein components are not always required to identify their association modes with acceptable accuracy.
Many bacteria, including Legionella pneumophila, rely on the type IV secretion system to translocate a repertoire of effector proteins into the hosts for their survival and growth. Type IV coupling protein (T4CP) is a hexameric ATPase that links translocating substrates to the transenvelope secretion conduit. Yet, how a large number of effector proteins are selectively recruited and processed by T4CPs remains enigmatic. DotL, the T4CP of L. pneumophila, contains an ATPase domain and a C-terminal extension whose function is unknown. Unlike T4CPs involved in plasmid DNA translocation, DotL appeared to function by forming a multiprotein complex with four other proteins. Here, we show that the C-terminal extension of DotL interacts with DotN, IcmS, IcmW and an additionally identified subunit LvgA, and that this pentameric assembly binds Legionella effector proteins. We determined the crystal structure of this assembly and built an architecture of the T4CP holocomplex by combining a homology model of the ATPase domain of DotL. The holocomplex is a hexamer of a bipartite structure composed of a membrane-proximal ATPase domain and a membrane-distal substrate-recognition assembly. The presented information demonstrates the architecture and functional dissection of the multiprotein T4CP complexes and provides important insights into their substrate recruitment and processing.
For high‐accuracy template‐based‐modeling of CASP7 targets, we have applied a procedure based on the rigorous optimization of score functions at three stages: multiple alignment, chain building, and side‐chain modeling. We applied the conformational space annealing method to a newly developed consistency based score function for multiple alignment. For chain building, we optimized the MODELLER energy function. For side‐chain modeling, we optimized a SCWRL‐like energy function using a rotamer library constructed specifically for a given target sequence. By rigorous optimization, we have achieved significant improvement in backbone as well as side‐chain modeling for TBM and TBM/HA targets. For most TBM/HA targets (17/26), the predicted model was more accurate than the model one can construct from the best template in a posteriori fashion. It appears that the current method can extract relevant information out of multiple templates. Proteins 2007. © 2007 Wiley‐Liss, Inc.
We present a method to predict the solvent accessibility of proteins which is based on a nearest neighbor method applied to the sequence profiles. Using the method, continuous real-value prediction as well as two-state and three-state discrete predictions can be obtained. The method utilizes the z-score value of the distance measure in the feature vector space to estimate the relative contribution among the k-nearest neighbors for prediction of the discrete and continuous solvent accessibility. The Solvent accessibility database is constructed from 5717 proteins extracted from PISCES culling server with the cutoff of 25% sequence identities. Using optimal parameters, the prediction accuracies (for discrete predictions) of 78.38% (two-state prediction with the threshold of 25%), 65.1% (three-state prediction with the thresholds of 9 and 36%), and the Pearson correlation coefficient (between the predicted and true RSA's for continuous prediction) of 0.676 are achieved An independent benchmark test was performed with the CASP8 targets where we find that the proposed method outperforms existing methods. The prediction accuracies are 80.89% (for two state prediction with the threshold of 25%), 67.58% (three-state prediction), and the Pearson correlation coefficient of 0.727 (for continuous prediction) with mean absolute error of 0.148. We have also investigated the effect of increasing database sizes on the prediction accuracy, where additional improvement in the accuracy is observed as the database size increases. The SANN web server is available at http://lee.kias.re.kr/~newton/sann/.
In the template-based modeling (TBM) category of CASP10 experiment, we introduced a new protocol called protein modeling system (PMS) to generate accurate protein structures in terms of side-chains as well as backbone trace. In the new protocol, a global optimization algorithm, called conformational space annealing (CSA), is applied to the three layers of TBM procedure: multiple sequence-structure alignment, 3D chain building, and side-chain re-modeling. For 3D chain building, we developed a new energy function which includes new distance restraint terms of Lorentzian type (derived from multiple templates), and new energy terms that combine (physical) energy terms such as dynamic fragment assembly (DFA) energy, DFIRE statistical potential energy, hydrogen bonding term, etc. These physical energy terms are expected to guide the structure modeling especially for loop regions where no template structures are available. In addition, we developed a new quality assessment method based on random forest machine learning algorithm to screen templates, multiple alignments, and final models. For TBM targets of CASP10, we find that, due to the combination of three stages of CSA global optimizations and quality assessment, the modeling accuracy of PMS improves at each additional stage of the protocol. It is especially noteworthy that the side-chains of the final PMS models are far more accurate than the models in the intermediate steps.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.