The release of the 1000th complete microbial genome will occur in the next two to three years. In anticipation of this milestone, the Fellowship for Interpretation of Genomes (FIG) launched the Project to Annotate 1000 Genomes. The project is built around the principle that the key to improved accuracy in high-throughput annotation technology is to have experts annotate single subsystems over the complete collection of genomes, rather than having an annotation expert attempt to annotate all of the genes in a single genome. Using the subsystems approach, all of the genes implementing the subsystem are analyzed by an expert in that subsystem. An annotation environment was created where populated subsystems are curated and projected to new genomes. A portable notion of a populated subsystem was defined, and tools developed for exchanging and curating these objects. Tools were also developed to resolve conflicts between populated subsystems. The SEED is the first annotation environment that supports this model of annotation. Here, we describe the subsystem approach, and offer the first release of our growing library of populated subsystems. The initial release of data includes 180 177 distinct proteins with 2133 distinct functional roles. This data comes from 173 subsystems and 383 different organisms.
A previous bioinformatics-based search for riboswitches yielded several candidate motifs in eubacteria. One of these motifs commonly resides in the 5' untranslated regions of genes involved in the biosynthesis of queuosine (Q), a hypermodified nucleoside occupying the anticodon wobble position of certain transfer RNAs. Here we show that this structured RNA is part of a riboswitch selective for 7-aminomethyl-7-deazaguanine (preQ(1)), an intermediate in queuosine biosynthesis. Compared with other natural metabolite-binding RNAs, the preQ(1) aptamer appears to have a simple structure, consisting of a single stem-loop and a short tail sequence that together are formed from as few as 34 nucleotides. Despite its small size, this aptamer is highly selective for its cognate ligand in vitro and has an affinity for preQ(1) in the low nanomolar range. Relatively compact RNA structures can therefore serve effectively as metabolite receptors to regulate gene expression.
The YgjD/Kae1 family (COG0533) has been on the top-10 list of universally conserved proteins of unknown function for over 5 years. It has been linked to DNA maintenance in bacteria and mitochondria and transcription regulation and telomere homeostasis in eukaryotes, but its actual function has never been found. Based on a comparative genomic and structural analysis, we predicted this family was involved in the biosynthesis of N 6 -threonylcarbamoyl adenosine, a universal modification found at position 37 of tRNAs decoding ANN codons. This was confirmed as a yeast mutant lacking Kae1 is devoid of t 6 A. t 6 A À strains were also used to reveal that t 6 A has a critical role in initiation codon restriction to AUG and in restricting frameshifting at tandem ANN codons. We also showed that YaeZ, a YgjD paralog, is required for YgjD function in vivo in bacteria. This work lays the foundation for understanding the pleiotropic role of this universal protein family.
The enzyme YkvM from Bacillus subtilis was identified previously along with three other enzymes (YkvJKL) in a bioinformatics search for enzymes involved in the biosynthesis of queuosine, a 7-deazaguanine modified nucleoside found in tRNA GUN of Bacteria and Eukarya. Genetic analysis of ykvJKLM mutants in Acinetobacter confirmed that each was essential for queuosine biosynthesis, and the genes were renamed queCDEF. QueF exhibits significant homology to the type I GTP cyclohydrolases characterized by FolE. Given that GTP is the precursor to queuosine and that a cyclohydrolase-like reaction was postulated as the initial step in queuosine biosynthesis, QueF was proposed to be the putative cyclohydrolase-like enzyme responsible for this reaction. We have cloned the queF genes from B. subtilis and Escherichia coli and characterized the recombinant enzymes. Contrary to the predictions based on sequence analysis, we discovered that the enzymes, in fact, catalyze a mechanistically unrelated reaction, the NADPHdependentreductionof7-cyano-7-deazaguanineto7-aminomethyl-7-deazaguanine, a late step in the biosynthesis of queuosine. We report here in vitro and in vivo studies that demonstrate this catalytic activity, as well as preliminary biochemical and bioinformatics analysis that provide insight into the structure of this family of enzymes.tRNA ͉ modified base T he avalanche of new protein structures that have been reported over the last decade (see summary at www.rcsb. org͞pdb͞holdings.html) has made it clear that the number of scaffolds that are used to produce all of the proteins in a cell is surprisingly limited, with Ϸ80% of the proteins using one of the 400 structural folds identified to date (1, 2). Specific functions evolve by duplication, recombination, and divergence of this core repertoire (3). Analysis of the functions of the different members of a protein structural family reveal that, in general, catalytic mechanisms and chemistries are conserved in a given family whereas substrate specificity changes (4). Much rarer are the cases in which the reactions catalyzed differ among members of the family (3); the best characterized examples being the TIM barrel superfamilies (5) and the enolase superfamily (6). Understanding the molecular paths that lead to the evolution of one function from another in a given superfamily is one of the next challenges of structural biology, impacting not only our understanding of how proteins evolve, but also the task of correctly annotating the genes identified by whole-genome sequencing (7,8).We recently used comparative genomic techniques (9) to discover four previously uncharacterized bacterial genes families (queCDEF) involved in the biosynthesis of the modified nucleoside queuosine (10). Three of these families (queCDE) have homologs in Archaea and are therefore implicated in the biosynthesis of the related modified nucleoside archaeosine (Fig. 1). Both nucleosides share an unusual 7-deazaguanosine core structure but diverge in their phylogenetic distribution, location in the ...
The enzyme clavaminate synthase (CS) catalyzes the formation of the first bicyclic intermediate in the biosynthetic pathway to the potent beta-lactamase inhibitor clavulanic acid. Our previous work has led to the proposal that the cyclization/desaturation of the substrate proclavaminate proceeds in two oxidative steps, each coupled to a decarboxylation of alpha-ketoglutarate and a reduction of dioxygen to water [Salowe, S. P., Marsh, E. N., & Townsend, C. A. (1990) Biochemistry 29, 6499-6508]. We have now employed kinetic isotope effect studies to determine the order of oxidations for CS purified from Streptomyces clavuligerus. By using (4'RS)-[4'-3H,1-14C]-rac-proclavaminate, a primary T(V/K) = 8.3 +/- 0.2 was measured from [3H]water release data, while an alpha-secondary T(V/K) = 1.06 +/- 0.01 was determined from the changing 3H/14C ratio of the product clavaminate. Values for the primary and alpha-secondary effects of 11.9 +/- 1.7 and 1.12 +/- 0.07, respectively, were obtained from the changing 3H/14C ratio of the residual proclavaminate by using new equations derived for a racemic substrate bearing isotopic label at both primary and alpha-secondary positions. Since only the first step of consecutive irreversible reactions will exhibit a V/K isotope effect, we conclude that C-4' is the initial site of oxidation in proclavaminate. As expected, no significant changes in the 3H/14C ratio of residual substrate were observed with [3-3H,1-14C]-rac-proclavaminate. However, two new tritiated compounds were produced in this incubation, apparently the result of isotope-induced branching brought about by the presence of tritium at the site of the second oxidation. One of these compounds was identified by comparison to authentic material as dihydroclavaminate, a stable intermediate that normally remains enzyme-bound. On the basis of the body of information available and the similarities to alpha-ketoglutarate-dependent dioxygenases, a comprehensive mechanistic scheme for CS is proposed to account for this unusual enzymatic transformation.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.