Improved recognition of native-like protein structures using a family of designed sequences

Koehl, Patrice; Levitt, Michael

doi:10.1073/pnas.022408799

Cited by 15 publications

(9 citation statements)

References 59 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Given a sequence of amino acid residues, homology modeling methods essentially try to align the target sequence to suitable structure templates, stored in protein databases, and build a three-dimensional conformation by using alignment information (see, for example, [14,17,79]). Different alignment methods have been developed, such as BLAST [3], PSI-BLAST [4] and the profile-profile method [41]. The main limitation of the homology modeling methods is that they work effectively only for sequences with at least 30-40% identity.…”

Section: Computational Approaches To Protein Fold Predictionmentioning

confidence: 99%

Computational Methods for Protein Fold Prediction: an Ab-initio Topological Approach

Ceci

Mucherino

D'Apuzzo

et al. 2007

Springer Optimization and Its Applications

View full text Add to dashboard Cite

Summary. The prediction of protein native conformations is still a big challenge in science, although a strong research activity has been carried out on this topic in the last decades. In this chapter we focus on ab-initio computational methods for protein fold predictions that do not rely heavily on comparisons with known protein structures and hence appear to be the most promising methods for determining conformations not yet been observed experimentally. To identify main trends in the research concerning protein fold predictions, we briefly review several ab-initio methods, including a recent topological approach that models the protein conformation as a tube having maximum thickness without any self-contacts. This representation leads to a constrained global optimization problem. We introduce a modification in the tube model to increase the compactness of the computed conformations, and present results of computational experiments devoted to simulating α-helices and all-α proteins. A Metropolis Monte Carlo Simulated Annealing algorithm is used to search the protein conformational space.

show abstract

Section: Computational Approaches To Protein Fold Predictionmentioning

confidence: 99%

Computational Methods for Protein Fold Prediction: an Ab-initio Topological Approach

Ceci

Mucherino

D'Apuzzo

et al. 2007

Springer Optimization and Its Applications

View full text Add to dashboard Cite

show abstract

“…Most computational studies to date have produced designed sequences that tend to resemble the native sequence of the protein structure (Koehl and Levitt 1999b, 2002a; Kuhlman and Baker 2000; Raha et al 2000). This result has generally been attributed to the constraints imposed by using fixed backbones.…”

mentioning

confidence: 99%

Thoroughly sampling sequence space: Large‐scale protein design of structural ensembles

et al. 2002

View full text Add to dashboard Cite

Modeling the inherent flexibility of the protein backbone as part of computational protein design is necessary to capture the behavior of real proteins and is a prerequisite for the accurate exploration of protein sequence space. We present the results of a broad exploration of sequence space, with backbone flexibility, through a novel approach: large-scale protein design to structural ensembles. A distributed computing architecture has allowed us to generate hundreds of thousands of diverse sequences for a set of 253 naturally occurring proteins, allowing exciting insights into the nature of protein sequence space. Designing to a structural ensemble produces a much greater diversity of sequences than previous studies have reported, and homology searches using profiles derived from the designed sequences against the Protein Data Bank show that the relevance and quality of the sequences is not diminished. The designed sequences have greater overall diversity than corresponding natural sequence alignments, and no direct correlations are seen between the diversity of natural sequence alignments and the diversity of the corresponding designed sequences. For structures in the same fold, the sequence entropies of the designed sequences cluster together tightly. This tight clustering of sequence entropies within a fold and the separation of sequence entropy distributions for different folds suggest that the diversity of designed sequences is primarily determined by a structure's overall fold, and that the designability principle postulated from studies of simple models holds in real proteins. This has important implications for experimental protein design and engineering, as well as providing insight into protein evolution.Keywords: Protein design; sequence space; designability; backbone flexibility; distributed computingThe aim of protein design is to find amino acid sequences that are compatible with specific protein structures. Screening of sequences for compatibility with a protein structure was introduced in the early 1980s, with the definition of the inverse folding problem (Pabo 1983). Whereas protein folding involves finding the native three-dimensional structure for a particular amino acid sequence, the inverse folding problem seeks to define the entire set of sequences that can specifically form a stable protein with some target structure. Protein design, whether experimental, computational, or some hybrid approach, provides important clues towards a solution of the inverse protein folding problem by sampling the sequence space of known protein structures (Pande et al. 1997).An important practical use of protein design is in the stabilization of known protein folds (Dahiyat 1999). The optimization schemes used in most protein design algorithms are written to find local or globally optimized sequences, with the lowest or near-lowest free energy of folding for an existing target structure; much recent work has addressed this topic (Desjarlais and Clarke 1998; ShakhReprint requests to: Vijay S. Pande, Ch...

show abstract

“…functional residues | protein family evolution | protein function | protein-protein interfaces | specificity determining positions T he structure of protein families is shaped by the sequence divergence accumulated as a consequence of speciation, gene duplication, and deletion events, as well as by the evolutionary selective pressure exerted on each protein in accordance with the corresponding 3D structure and the specific function performed (1,2). The balance between genomic rearrangements and selective pressure to increase the functional repertoire available to organisms leads to the appearance of new subfamilies in evolutionary time (3).…”

mentioning

confidence: 99%

Protein interactions and ligand binding: From protein subfamilies to functional specificity

Rausell

Juan

Pazos

et al. 2010

Proc. Natl. Acad. Sci. U.S.A.

137

149

View full text Add to dashboard Cite

The divergence accumulated during the evolution of protein families translates into their internal organization as subfamilies, and it is directly reflected in the characteristic patterns of differentially conserved residues. These specifically conserved positions in protein subfamilies are known as "specificity determining positions" (SDPs). Previous studies have limited their analysis to the study of the relationship between these positions and ligandbinding specificity, demonstrating significant yet limited predictive capacity. We have systematically extended this observation to include the role of differential protein interactions in the segregation of protein subfamilies and explored in detail the structural distribution of SDPs at protein interfaces. Our results show the extensive influence of protein interactions in the evolution of protein families and the widespread association of SDPs with protein interfaces. The combined analysis of SDPs in interfaces and ligandbinding sites provides a more complete picture of the organization of protein families, constituting the necessary framework for a large scale analysis of the evolution of protein function.functional residues | protein family evolution | protein function | protein-protein interfaces | specificity determining positions T he structure of protein families is shaped by the sequence divergence accumulated as a consequence of speciation, gene duplication, and deletion events, as well as by the evolutionary selective pressure exerted on each protein in accordance with the corresponding 3D structure and the specific function performed (1, 2). The balance between genomic rearrangements and selective pressure to increase the functional repertoire available to organisms leads to the appearance of new subfamilies in evolutionary time (3).There are many aspects of protein function that contribute to the evolution of the family organization. These may include the global conservation of catalytic mechanisms (in the case of enzymes), specific binding to substrates and cofactors, as well as the interaction with other proteins in processes such as cell signaling, the regulation of reactions and the formation of macromolecular complexes. Interestingly, even though specific protein interactions certainly are an important part of protein function, the organization of protein families in relation to the specific interactions of different subfamilies remains a poorly studied aspect of functional specificity.Multiple sequences alignments (MSAs) provide essential information on the evolution of protein families. The positions in MSAs can be interpreted in terms of the amino acid changes allowed or disallowed during evolution, and therefore useful information at the residue level can be inferred from them (4). The most obvious example is the study of fully conserved positions that pinpoint important residues for the structure and function of the family members (5).A subtler pattern of conservation is represented by the positions differentially conserved within subfamili...

show abstract

Improved recognition of native-like protein structures using a family of designed sequences

Cited by 15 publications

References 59 publications

Computational Methods for Protein Fold Prediction: an Ab-initio Topological Approach

Computational Methods for Protein Fold Prediction: an Ab-initio Topological Approach

Thoroughly sampling sequence space: Large‐scale protein design of structural ensembles

Protein interactions and ligand binding: From protein subfamilies to functional specificity

Contact Info

Product

Resources

About