Many proteins function as homooligomers and are regulated via their oligomeric state. For some proteins, the stoichiometry of homooligomeric states under various conditions has been studied using gel filtration or analytical ultracentrifugation experiments. The interfaces involved in these assemblies may be identified using crosslinking and mass spectrometry, solution-state NMR, and other experiments. But for most proteins, the actual interfaces that are involved in oligomerization are inferred from X-ray crystallographic structures using assumptions about interface surface areas and physical properties. Examination of interfaces across different PDB entries in a protein family reveals several important features. First, similarity of space group, asymmetric unit size, and cell dimensions and angles (within 1%) does not guarantee that two crystals are actually the same crystal form, that is containing similar relative orientations and interactions within the crystal. Conversely, two crystals in different space groups may be quite similar in terms of all of the interfaces within each crystal. Second, NMR structures and an existing benchmark of PDB crystallographic entries consisting of 126 dimers and larger structures and 132 monomers was used to determine whether the existence or lack of existence of common interfaces across multiple crystal forms can be used to predict whether a protein is an oligomer or not. Monomeric proteins tend to have common interfaces across only a minority of crystal forms, while higher order structures exhibit common interfaces across a majority of available crystal forms. The data can be used to estimate the probability that an interface is biological if two or more crystal forms are available. Finally, the PISA database available from the EBI is more consistent in identifying interfaces observed in many crystal forms than is the PDB or EBI's Protein Quaternary Server (PQS). The PDB in particular is missing highly likely biological interfaces in its biological unit files for about 10% of PDB entries.
Classification of the structures of the complementarity determining regions (CDRs) of antibodies is critically important for antibody structure prediction and computational design. We have previously performed a clustering of antibody CDR conformations and defined a systematic nomenclature consisting of the CDR, length and an integer starting from the largest to the smallest cluster in the data set (e.g. L1-11-1). We present PyIgClassify (for Python-based immunoglobulin classification; available at http://dunbrack2.fccc.edu/pyigclassify/), a database and web server that provides access to assignments of all CDR structures in the PDB to our classification system. The database includes assignments to the IMGT germline V regions for heavy and light chains for several species. For humanized antibodies, the assignment of the frameworks is to human germlines and the CDRs to the germlines of mice or other species sources. The database can be searched by PDB entry, cluster identifier and IMGT germline group (e.g. human IGHV1). The entire database is downloadable so that users may filter the data as needed for antibody structure analysis, prediction and design.
The protein common interface database (ProtCID) is a database that contains clusters of similar homodimeric and heterodimeric interfaces observed in multiple crystal forms (CFs). Such interfaces, especially of homologous but non-identical proteins, have been associated with biologically relevant interactions. In ProtCID, protein chains in the protein data bank (PDB) are grouped based on their PFAM domain architectures. For a single PFAM architecture, all the dimers present in each CF are constructed and compared with those in other CFs that contain the same domain architecture. Interfaces occurring in two or more CFs comprise an interface cluster in the database. The same process is used to compare heterodimers of chains with different domain architectures. By examining interfaces that are shared by many homologous proteins in different CFs, we find that the PDB and the Protein Interfaces, Surfaces, and Assemblies (PISA) are not always consistent in their annotations of biological assemblies in a homologous family. Our data therefore provide an independent check on publicly available annotations of the structures of biological interactions for PDB entries. Common interfaces may also be useful in studies of protein evolution. Coordinates for all interfaces in a cluster are downloadable for further analysis. ProtCiD is available at http://dunbrack2.fccc.edu/protcid.
The Pfam assignment data in PDBfam are available at http://dunbrack2.fccc.edu/ProtCid/PDBfam, which can be searched by PDB codes and Pfam identifiers. They will be updated regularly.
Structural information on the interactions of proteins with other molecules is plentiful, and for some proteins and protein families, there may be 100s of available structures. It can be very difficult for a scientist who is not trained in structural bioinformatics to access this information comprehensively. Previously, we developed the Protein Common Interface Database (ProtCID), which provided clusters of the interfaces of full-length protein chains as a means of identifying biological assemblies. Because proteins consist of domains that act as modular functional units, we have extended the analysis in ProtCID to the individual domain level. This has greatly increased the number of large protein-protein clusters in ProtCID, enabling the generation of hypotheses on the structures of biological assemblies of many systems. The analysis of domain families allows us to extend ProtCID to the interactions of domains with peptides, nucleic acids, and ligands. ProtCID provides complete annotations and coordinate sets for every cluster.
Protein kinase autophosphorylation is a common regulatory mechanism in cell signaling pathways. Crystal structures of several homomeric protein kinase complexes have a serine, threonine, or tyrosine autophosphorylation site of one kinase monomer located in the active site of another monomer, a structural complex that we call an “autophosphorylation complex.” We developed and applied a structural bioinformatics method to identify all such autophosphorylation kinase complexes in X-ray crystallographic structures in the Protein Data Bank (PDB). We identified 15 autophosphorylation complexes in the PDB, of which 5 complexes had not previously been described in the publications describing the crystal structures. These 5 consist of tyrosine residues in the N-terminal juxtamembrane regions of colony stimulating factor 1 receptor (CSF1R, Tyr561) and EPH receptor A2 (EPHA2, Tyr594), tyrosine residues in the activation loops of the SRC kinase family member LCK (Tyr394) and insulin-like growth factor 1 receptor (IGF1R, Tyr1166), and a serine in a nuclear localization signal region of CDC-like kinase 2 (CLK2, Ser142). Mutations in the complex interface may alter autophosphorylation activity and contribute to disease; therefore we mutated residues in the autophosphorylation complex interface of LCK and found that two mutations impaired autophosphorylation (T445V and N446A) and mutation of Pro447 to Ala, Gly, or Leu increased autophosphorylation. The identified autophosphorylation sites are conserved in many kinases, suggesting that, by homology, these complexes may provide insight into autophosphorylation complex interfaces of kinases that are relevant drug targets.
Protein phosphorylation is a reversible post-translation modification essential in cell signaling. This study addresses a long-standing question as to how the most abundant serine/threonine Protein Phosphatase 2 (PP2A) holoenzyme, PP2A/B55α, specifically recognizes substrates and presents them to the enzyme active site. Here, we show how the PP2A regulatory subunit B55α recruits p107, a pRB-related tumor suppressor and B55α substrate. Using molecular and cellular approaches, we identified a conserved region 1 (R1, residues 615-626) encompassing the strongest p107 binding site. This enabled us to identify an 'HxRVxxV619-625' short linear motif (SLiM) in p107 as necessary for B55α binding and dephosphorylation of the proximal pSer-615 in vitro and in cells. Numerous B55α/PP2A substrates, including TAU, contain a related SLiM C-terminal from a proximal phosphosite, 'p[ST]-P-x(4,10)-[RK]-V-x-x-[VI]-R'. Mutation of conserved SLiM residues in TAU dramatically inhibits dephosphorylation by PP2A/B55α, validating its generality. A data-guided computational model details the interaction of residues from the conserved p107 SLiM, the B55α groove, and phosphosite presentation. Altogether these data provide key insights into PP2A/B55α mechanisms of substrate recruitment and active site engagement, and also facilitate identification and validation of new substrates, a key step towards understanding PP2A/B55α role in multiple cellular processes.
Native agarose gel electrophoresis-based particle gel assay has been commonly used for examination of hepatitis B virus (HBV) capsid assembly and pregenomic RNA encapsidation in HBV replicating cells. Interestingly, treatment of cells with several chemotypes of HBV core protein allosteric modulators (CpAMs) induced the assembly of both empty and DNA-containing capsids with faster electrophoresis mobility. In an effort to determine the physical basis of CpAM-induced capsid mobility shift, we found that the surface charge, but not the size, of capsids is the primary determinant of electrophoresis mobility. Specifically, through alanine scanning mutagenesis analysis of twenty-seven charged amino acids in core protein assembly domain and hinge region, we showed that except for K7 and E8, substitution of glutamine acid (E) or aspartic acid (D) on the surface of capsids reduced their mobility, but substitution of lysine (K) or arginine (R) on the surface of capsids increased their mobility in variable degrees. However, alanine substitution of the charged amino acids that are not exposed on the surface of capsid did not apparently alter capsid mobility. Hence, CpAM-induced electrophoresis mobility shift of capsids may reflect the global alteration of capsid structure that changes the exposure and/or ionization of charged amino acid side chains of core protein. Our findings imply that CpAM inhibition of pgRNA encapsidation is possibly due to the assembly of structurally altered nucleocapsids. Practically, capsid electrophoresis mobility shift is a diagnostic marker of compounds that target core protein assembly and predicts sensitivity of HBV strains to specific CpAMs.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.