The InterPro database (https://www.ebi.ac.uk/interpro/) provides an integrative classification of protein sequences into families, and identifies functionally important domains and conserved sites. InterProScan is the underlying software that allows protein and nucleic acid sequences to be searched against InterPro's signatures. Signatures are predictive models which describe protein families, domains or sites, and are provided by multiple databases. InterPro combines signatures representing equivalent families, domains or sites, and provides additional information such as descriptions, literature references and Gene Ontology (GO) terms, to produce a comprehensive resource for protein classification. Founded in 1999, InterPro has become one of the most widely used resources for protein family annotation. Here, we report the status of InterPro (version 81.0) in its 20th year of operation, and its associated software, including updates to database content, the release of a new website and REST API, and performance improvements in InterProScan.
The InterPro database (http://www.ebi.ac.uk/interpro/) classifies protein sequences into families and predicts the presence of functionally important domains and sites. Here, we report recent developments with InterPro (version 70.0) and its associated software, including an 18% growth in the size of the database in terms on new InterPro entries, updates to content, the inclusion of an additional entry type, refined modelling of discontinuous domains, and the development of a new programmatic interface and website. These developments extend and enrich the information provided by InterPro, and provide greater flexibility in terms of data access. We also show that InterPro's sequence coverage has kept pace with the growth of UniProtKB, and discuss how our evaluation of residue coverage may help guide future curation activities.
The InterPro database (https://www.ebi.ac.uk/interpro/) provides an integrative classification of protein sequences into families, and identifies functionally important domains and conserved sites. Here, we report recent developments with InterPro (version 90.0) and its associated software, including updates to data content and to the website. These developments extend and enrich the information provided by InterPro, and provide a more user friendly access to the data. Additionally, we have worked on adding Pfam website features to the InterPro website, as the Pfam website will be retired in late 2022. We also show that InterPro's sequence coverage has kept pace with the growth of UniProtKB. Moreover, we report the development of a card game as a method of engaging the non-scientific community. Finally, we discuss the benefits and challenges brought by the use of artificial intelligence for protein structure prediction.
Here, we report a webserver for the improved SDM, used for predicting the effects of mutations on protein stability. As a pioneering knowledge-based approach, SDM has been highlighted as the most appropriate method to use in combination with many other approaches. We have updated the environment-specific amino-acid substitution tables based on the current expanded PDB (a 5-fold increase in information), and introduced new residue-conformation and interaction parameters, including packing density and residue depth. The updated server has been extensively tested using a benchmark containing 2690 point mutations from 132 different protein structures. The revised method correlates well against the hypothetical reverse mutations, better than comparable methods built using machine-learning approaches, highlighting the strength of our knowledge-based approach for identifying stabilising mutations. Given a PDB file (a Protein Data Bank file format containing the 3D coordinates of the protein atoms), and a point mutation, the server calculates the stability difference score between the wildtype and mutant protein. The server is available at http://structure.bioc.cam.ac.uk/sdm2
Membrane attack complex/perforin/cholesterol-dependent cytolysin (MACPF/CDC) proteins constitute a major superfamily of pore-forming proteins that act as bacterial virulence factors and effectors in immune defence. Upon binding to the membrane, they convert from the soluble monomeric form to oligomeric, membrane-inserted pores. Using real-time atomic force microscopy (AFM), electron microscopy (EM), and atomic structure fitting, we have mapped the structure and assembly pathways of a bacterial CDC in unprecedented detail and accuracy, focussing on suilysin from Streptococcus suis. We show that suilysin assembly is a noncooperative process that is terminated before the protein inserts into the membrane. The resulting ring-shaped pores and kinetically trapped arc-shaped assemblies are all seen to perforate the membrane, as also visible by the ejection of its lipids. Membrane insertion requires a concerted conformational change of the monomeric subunits, with a marked expansion in pore diameter due to large changes in subunit structure and packing.DOI: http://dx.doi.org/10.7554/eLife.04247.001
Many viruses are enveloped by a lipid bilayer acquired during assembly, which is typically studded with one or two types of glycoproteins. These viral surface proteins act as the primary interface between the virus and the host. Entry of enveloped viruses relies on specialized fusogen proteins to help merge the virus membrane with the host membrane. In the multicomponent herpesvirus fusion machinery, glycoprotein B (gB) acts as this fusogen. Although the structure of the gB ectodomain postfusion conformation has been determined, any other conformations (e.g., prefusion, intermediate conformations) have so far remained elusive, thus restricting efforts to develop antiviral treatments and prophylactic vaccines. Here, we have characterized the full-length herpes simplex virus 1 gB in a native membrane by displaying it on cell-derived vesicles and using electron cryotomography. Alongside the known postfusion conformation, a novel one was identified. Its structure, in the context of the membrane, was determined by subvolume averaging and found to be trimeric like the postfusion conformation, but appeared more condensed. Hierarchical constrained density-fitting of domains unexpectedly revealed the fusion loops in this conformation to be apart and pointing away from the anchoring membrane. This vital observation is a substantial step forward in understanding the complex herpesvirus fusion mechanism, and opens up new opportunities for more targeted intervention of herpesvirus entry. membrane fusion | class III viral fusion protein | prefusion conformation | electron cryotomography | subvolume averaging
Three-dimensional electron microscopy is currently one of the most promising techniques used to study macromolecular assemblies. Rigid and flexible fitting of atomic models into density maps is often essential to gain further insights into the assemblies they represent. Currently, tools that facilitate the assessment of fitted atomic models and maps are needed. TEMPy (template and electron microscopy comparison using Python) is a toolkit designed for this purpose. The library includes a set of methods to assess density fits in intermediate-to-low resolution maps, both globally and locally. It also provides procedures for singlefit assessment, ensemble generation of fits, clustering, and multiple and consensus scoring, as well as plots and output files for visualization purposes to help the user in analysing rigid and flexible fits. The modular nature of TEMPy helps the integration of scoring and assessment of fits into large pipelines, making it a tool suitable for both novice and expert structural biologists.
Many essential biological processes including cell regulation and signalling are mediated through the assembly of protein complexes. Changes to protein-protein interaction (PPI) interfaces can affect the formation of multiprotein complexes, and consequently lead to disruptions in interconnected networks of PPIs within and between cells, further leading to phenotypic changes as functional interactions are created or disrupted. Mutations altering PPIs have been linked to the development of genetic diseases including cancer and rare Mendelian diseases, and to the development of drug resistance. The importance of these protein mutations has led to the development of many resources for understanding and predicting their effects. We propose that a better understanding of how these mutations affect the structure, function, and formation of multiprotein complexes provides novel opportunities for tackling them, including the development of small-molecule drugs targeted specifically to mutated PPIs.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.