Comparative modeling predicts the three-dimensional structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. The number of protein sequences that can be modeled and the accuracy of the predictions are increasing steadily because of the growth in the number of known protein structures and because of the improvements in the modeling software. Further advances are necessary in recognizing weak sequence-structure similarities, aligning sequences with structures, modeling of rigid body shifts, distortions, loops and side chains, as well as detecting errors in a model. Despite these problems, it is currently possible to model with useful accuracy significant parts of approximately one third of all known protein sequences. The use of individual comparative models in biology is already rewarding and increasingly widespread. A major new challenge for comparative modeling is the integration of it with the torrents of data from genome sequencing projects as well as from functional and structural genomics. In particular, there is a need to develop an automated, rapid, robust, sensitive, and accurate comparative modeling pipeline applicable to whole genomes. Such large-scale modeling is likely to encourage new kinds of applications for the many resulting models, based on their large number and completeness at the level of the family, organism, or functional network.
The following resources for comparative protein structure modeling and analysis are described (http://salilab.org): MODELLER, a program for comparative modeling by satisfaction of spatial restraints; MODWEB, a web server for automated comparative modeling that relies on PSI-BLAST, IMPALA and MODELLER; MODLOOP, a web server for automated loop modeling that relies on MODELLER; MOULDER, a CPU intensive protocol of MODWEB for building comparative models based on distant known structures; MODBASE, a comprehensive database of annotated comparative models for all sequences detectably related to a known structure; MODVIEW, a Netscape plugin for Linux that integrates viewing of multiple sequences and structures; and SNPWEB, a web server for structure-based prediction of the functional impact of a single amino acid substitution.
E-selectin is the inducible adhesion protein on the surface of endothelial cells which has a crucial role in the initial stages of recruitment of leucocytes to sites of inflammation. In addition, it is almost certainly involved in tumor cell adhesion and metastasis. This report is concerned with identification of a new class of oligosaccharide ligand--sulfate-containing--for the human E-selectin molecule from among oligosaccharides on an ovarian cystadenoma glycoprotein. This has been achieved by application of the neoglycolipid technology to oligosaccharides released from the glycoprotein by mild alkaline beta-elimination. Oligosaccharides were conjugated to lipid, resolved by thin-layer chromatography, and tested for binding by Chinese hamster ovary cells which had been transfected to express the full-length E-selectin molecule. Several components with strong E-selectin binding activity were revealed among acidic oligosaccharides. The smallest among these was identified by liquid secondary ion mass spectrometric analysis of the neoglycolipid, in conjunction with methylation analysis of the purified oligosaccharide preparation as an equimolar mixture of the Le(a)- and Le(x)/SSEA-1-type fucotetrasaccharides sulfated at position 3 of outer galactose: [formula: see text] To our knowledge this is the first report of a sulfofucooligosaccharide ligand for E-selectin. The binding activity is substantially greater than those of lipid-linked Le(a) and Le(x)/SSEA-1 sequences and is at least equal to that of the 3'-sialyl-Le(x)/SSEA-1 glycolipid analogue.
The nun gene product of prophage HK022 excludes phage A infection by blocking the expression of genes downstream from the A nut sequence. Gene expression in phage A and other lambdoid phages is temporally regulated by a mechanism of transcription termination and antitermination (1-5). The expression of the early A operons requires the suppression of transcription termination signals by a sequence-specific RNA-binding proiein, the 107-aa product of the A N gene. The N protein stabilizes transcription elongation by interacting with A nut RNA and RNA polymerase (RNAP) in association with host Nus factors (2-9). The A nut site is divisible into two parts, boxA and boxB. The boxA sequence is 10 nt long, and boxB is a 15-nt stem-loop structure. The boxA sequence lies a few nucleotides upstream of boxB. NusB and NusE (S10) specifically interact with boxA (10). The N protein specifically interacts with the loop sequence and one face of the stem sequence of boxB (11).The nun product of coliphage HK022 is a 109-aa protein that excludes superinfecting A by provoking transcription termination at or just distal to the A nut sites (12-15). Nun termination mimics N antitermination in its requirements for cis-acting sites and host factors, both reactions requiring the A nut site and the host Nus proteins for optimal efficiency (16). Mutations in nut inhibit both N and Nun (12,17). Certain nusA, nusB, and nusE mutations inhibit both functions (12); however, some nusA mutations and mutations in nusG and in rpoC (encoding the 13' subunit of RNAP) are
A database comprising all ligand-binding sites of known structure aligned with all related protein sequences and structures is described. Currently, the database contains approximately 50000 ligand-binding sites for small molecules found in the Protein Data Bank (PDB). The structure-structure alignments are obtained by the Combinatorial Extension (CE) program (Shindyalov and Bourne, Protein Eng., 11, 739-747, 1998) and sequence-structure alignments are extracted from the ModBase database of comparative protein structure models for all known protein sequences (Sanchez et al., Nucleic Acids Res., 28, 250-253, 2000). It is possible to search for binding sites in LigBase by a variety of criteria. LigBase reports summarize ligand data including relevant structural information from the PDB file, such as ligand type and size, and contain links to all related protein sequences in the TrEMBL database. Residues in the binding sites are graphically depicted for comparison with other structurally defined family members. LigBase provides a resource for the analysis of families of related binding sites.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.