The recently developed TASSER (Threading/ASSembly/Refinement) method is applied to predict the tertiary structures of all CASP6 targets. TASSER is a hierarchical approach that consists of template identification by the threading program PROSPECTOR_3, followed by tertiary structure assembly via rearranging continuous template fragments. Assembly occurs using parallel hyperbolic Monte Carlo sampling under the guide of an optimized, reduced force field that includes knowledge-based statistical potentials and spatial restraints extracted from threading alignments. Models are automatically selected from the Monte Carlo trajectories in the low-temperature replicas using the clustering program SPICKER. For all 90 CASP targets/domains, PROSPECTOR_3 generates initial alignments with an average root-mean-square deviation (RMSD) to native of 8.4 A with 79% coverage. After TASSER reassembly, the average RMSD decreases to 5.4 A over the same aligned residues; the overall cumulative TM-score increases from 39.44 to 52.53. Despite significant improvements over the PROSPECTOR_3 template alignment observed in all target categories, the overall quality of the final models is essentially dictated by the quality of threading templates: The average TM-scores of TASSER models in the three categories are, respectively, 0.79 [comparative modeling (CM), 43 targets/domains], 0.47 [fold recognition (FR), 37 targets/domains], and 0.30 [new fold (NF), 10 targets/domains]. This highlights the need to develop novel (or improved) approaches to identify very distant targets as well as better NF algorithms.
The size and origin of the protein fold universe is of fundamental and practical importance. Analyzing randomly generated, compact sticky homopolypeptide conformations constructed in generic simplified and all-atom protein models, all have similar folds in the library of solved structures, the Protein Data Bank, and conversely, all compact, single-domain protein structures in the Protein Data Bank have structural analogues in the compact model set. Thus, both sets are highly likely complete, with the protein fold universe arising from compact conformations of hydrogen-bonded, secondary structures. Because side chains are represented by their C  atoms, these results also suggest that the observed protein folds are insensitive to the details of side-chain packing. Sequence specificity enters both in fine-tuning the structure and thermodynamically stabilizing a given fold with respect to the set of alternatives. Scanning the models against a three-dimensional active-site library, close geometric matches are frequently found. Thus, the presence of active-site-like geometries also seems to be a consequence of the packing of compact, secondary structural elements. These results have significant implications for the evolution of protein structure and function.evolution ͉ Protein Data Bank ͉ protein folding ͉ protein structure prediction P rotein structures represent very interesting systems in that they result from both physical chemical principles (1) and the evolutionary selection for protein function (2). Focusing on the tertiary structures adopted by protein domains (roughly defined as independent folding units) (3), a number of key questions must be addressed. How large is the protein fold universe (4-6)? Is it essentially infinite, or is there a limited repertoire of single-domain topologies such that at some point, the library of solved protein structures in the Protein Data Bank (PDB) (7) would be sufficiently complete that the likelihood of finding a new fold is minimal? If the number of folds is finite, how complete is the current PDB library (6,8,9)? That is, how likely is it that a given protein, whose structure is currently unknown, will have an already-solved structural analogue? The answer to these questions is not only of intrinsic interest, but has practical applications to structural genomics target selection strategies (5, 10). More generally, can the set of protein folds and its degree of completeness be understood on the basis of general physical chemical principles, or is it very dependent on the details of protein stereochemistry and evolutionary history (11)?In recent work that builds on the other studies (8, 12, 13), we suggested that the library of single-domain proteins already found in the PDB is essentially complete in the sense that single-domain PDB structures provide a set of structures from which any other single-domain protein can be modeled (9,14). By using sensitive structural alignment algorithms that assess the structural similarity of two protein structures, even when proteins be...
Ferredoxin-NADP+ (oxido)reductase (EC 1.18.1.2, FNR) is an FAD-containing enzyme that catalyzes the reversible electron transfer between NADP(H) and electron carrier proteins such as ferredoxin and flavodoxin. Isoforms of this flavoprotein are present in chloroplasts, mitochondria, and bacteria in which they participate in a wide variety of redox metabolic pathways. Although ferredoxin-NADP+ reductases have been thoroughly investigated and their properties reviewed on several occasions, considerable advances in the understanding of these flavoenzymes have occurred in the last few years, including the characterization of cDNA and genomic clones encoding FNR proteins from plants, algae, vertebrates, and bacteria, determination of the atomic structure of a plant FNR at high resolution, and the expression of functional reductases in microorganisms like Escherichia coli and Saccharomyces cerevisiae. The aim of this article is to summarize information gained through these recent developments, including the phylogenetic relationships among ferredoxin reductases and the key structural features of the plant FNR family. Other aspects such as the catalytic mechanism of FNR and the molecular events underlying biogenesis, intracellular sorting, folding, and holoenzyme assembly of this important flavoenzyme are also discussed in some detail. Ferredoxin-NADP+ reductases display several outstanding properties that make them excellent model proteins to address broad biological questions.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.