Chemical Entities of Biological Interest (ChEBI) is a freely available dictionary of molecular entities focused on ‘small’ chemical compounds. The molecular entities in question are either natural products or synthetic products used to intervene in the processes of living organisms. Genome-encoded macromolecules (nucleic acids, proteins and peptides derived from proteins by cleavage) are not as a rule included in ChEBI. In addition to molecular entities, ChEBI contains groups (parts of molecular entities) and classes of entities. ChEBI includes an ontological classification, whereby the relationships between molecular entities or classes of entities and their parents and/or children are specified. ChEBI is available online at http://www.ebi.ac.uk/chebi/
The additional bibliography and list of Internet resources on bioinorganic chemistry are available at http://www.ebi.ac.uk/ approximately kirill/biometal/
This paper reviews the classification of the P450 superfamily which is mainly based on sequence homology. The widely accepted classification by Nebert et al. [(1991) DNA Cell Biol. 10, l-141 as well as the results of a 'two-step' multiple sequence alignment technique show that the molecular evolution of P45Os, in contrast to that of many protein families, does not follow phylogeny. The data suggest that during the evolution of P45Os, gene duplications and gene fusions, horizontal gene transfer and intron loss events have occurred. 'Weak' and 'strong' hierarchies in the clustering of P450 sequences were revealed. A novel evolutionary tree of the P450 superfamily has been constructed using a multiple alignment of consensus sequences. The simple classification of known WSO-containing monooxygenase systems into three-, two-and one-component systems is further discussed. Particularly, the multidomain enzyme, nitric oxide synthase (NOS), should be classified as an example of a eukaryotic one-component P450 system since its N-terminal (haem) domain exhibits similarity with microsomal P45Os.
All known P450-containing monooxygenase systems share common structural and functional domain architecture. Apart from P450 itself, these systems can comprise several fundamentally different protein components or domains, all of which are shared by other multicomponent/multidomain enzyme systems with various functions: FAD flavoprotein or domain, FMN domain, Fe2S2 ferredoxin, Fe3S4 ferredoxin, and cytochrome b5. Either FMN domain, ferredoxins or cytochrome b5 serve as the electron transport intermediate between the FAD domain and P450. The molecular evolution of both P450-containing systems and of each particular component does not follow phylogeny in general. Gene fusion and horizontal gene transfer events can lead to the appearance of novel redox chains in the same manner that artificial chimeric proteins can be constructed by humans. Recent studies using genetic and protein engineering techniques to investigate the separate domains and their interaction are described.
PRINTS is a compendium of protein motif fingerprints derived from the OWL composite sequence database. Fingerprints are groups of motifs within sequence alignments whose conserved nature allows them to be used as signatures of family membership. Fingerprints inherently offer improved diagnostic reliability over single motif methods by virtue of the mutual context provided by motif neighbors. To date, 650 fingerprints have been constructed and stored in PRINTS, the size of which has doubled in the last 2 years. The current version, 14.0, encodes 3500 motifs, covering a range of globular and membrane proteins, modular polypeptides, and so on. The database is now accessible via the UCL Bioinformatics Server on http:@ www.biochem.ucl.ac.uk/bsm/dbbrowser/. We describe here progress with the database, its compilation and interrogation software, and its Web interface.
The PRINTS database of protein family 'fingerprints' is a diagnostic resource that complements the PROSITE dictionary of sites and patterns. Unlike regular expressions, fingerprints exploit groups of conserved motifs within sequence alignments to build characteristic signatures of family membership. Thus fingerprints inherently offer improved diagnostic reliability by virtue of the mutual context provided by motif neighbours. To date, 600 fingerprints have been constructed and stored in PRINTS, representing a 50% increase in the size of the database in the last year. The current version, 13.0, encodes approximately 3000 motifs, covering a range of globular and membrane proteins, modular polypeptides, and so on. The database is accessible via UCL's Bioinformatics World Wide Web (WWW) server at http://www.biochem.ucl.ac.uk/bsm/dbbrowser / . We describe here progress with the database, its Web interface, and a recent exciting development: the integration of a novel colour alignment editor (http://www.biochem.ucl.ac.uk/bsm/dbbrowser++ +/CINEMA ), which allows visualisation and interactive manipulation of PRINTS alignments over the Internet.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.