The aim of the UniProt Knowledgebase is to provide users with a comprehensive, high-quality and freely accessible set of protein sequences annotated with functional information. In this article, we describe significant updates that we have made over the last two years to the resource. The number of sequences in UniProtKB has risen to approximately 190 million, despite continued work to reduce sequence redundancy at the proteome level. We have adopted new methods of assessing proteome completeness and quality. We continue to extract detailed annotations from the literature to add to reviewed entries and supplement these in unreviewed entries with annotations provided by automated systems such as the newly implemented Association-Rule-Based Annotator (ARBA). We have developed a credit-based publication submission interface to allow the community to contribute publications and annotations to UniProt entries. We describe how UniProtKB responded to the COVID-19 pandemic through expert curation of relevant entries that were rapidly made available to the research community through a dedicated portal. UniProt resources are available under a CC-BY (4.0) license via the web at https://www.uniprot.org/.
Pirellula sp. strain 1 (''Rhodopirellula baltica'') is a marine representative of the globally distributed and environmentally important bacterial order Planctomycetales. Here we report the complete genome sequence of a member of this independent phylum. With 7.145 megabases, Pirellula sp. strain 1 has the largest circular bacterial genome sequenced so far. The presence of all genes required for heterolactic acid fermentation, key genes for the interconversion of C1 compounds, and 110 sulfatases were unexpected for this aerobic heterotrophic isolate. Although Pirellula sp. strain 1 has a proteinaceous cell wall, remnants of genes for peptidoglycan synthesis were found. Genes for lipid A biosynthesis and homologues to the flagellar L-and P-ring protein indicate a former Gram-negative type of cell wall. Phylogenetic analysis of all relevant markers clearly affiliates the Planctomycetales to the domain Bacteria as a distinct phylum, but a deepest branching is not supported by our analyses. P irellula sp. strain 1, which is in the process of being validly described as ''Rhodopirellula baltica,'' is a marine, aerobic, heterotrophic representative of the globally distributed and environmentally important bacterial order Planctomycetales. Molecular microbial ecology studies repeatedly provided evidence that planctomycetes are abundant in terrestrial and marine habitats (1-5). For example, they inhabit phytodetrital macroaggregates in marine environments (6) and include one of the organisms known to derive energy from the anaerobic oxidation of ammonia (7). They catalyze important transformations in global carbon and nitrogen cycles. By their mineralization of marine snow particles planctomycetes have a profound impact on global biogeochemistry and climate by affecting exchange processes between the geosphere and atmosphere (8). From a phylogenetic perspective the order Planctomycetales forms an independent, monophyletic phylum of the domain Bacteria (9). It has recently been suggested to be the deepest branching bacterial phylum (10). Planctomycetes are unique in many other respects. Their cell walls do not contain peptidoglycan, the main structural polymer of most members of the domain Bacteria. They show a unique cell compartmentalization in which a single membrane separates a peripheral ribosome-free paryphoplasm from the inner riboplasm (pirellulosome). Within the riboplasm, all planctomycetes contain a condensed fibrillar nucleoid, which in Gemmata spp. is surrounded by a additional double membrane (11). These structures, together with an unusual fatty acid composition of the phospholipids, resemble eukaryotes rather than a representative of the bacterial domain (12).Characteristic for Planctomycetales are the polar cell organization and a life cycle with a polar, yeast-like cell division. Cells attach to surfaces at their vegetative poles by means of an excreted holdfast substance or stalks (13). Further unusual features are the crateriform structures on the cell surface of all planctomycetes (14). They appear as...
Members of the Bacteroidetes, formerly known as the Cytophaga-Flavobacteria-Bacteroides (CFB) phylum, are among the major taxa of marine heterotrophic bacterioplankton frequently found on macroscopic organic matter particles (marine snow). In addition, they have been shown to also represent a significant part of free-living microbial assemblages in nutrient-rich microenvironments. Their abundance and distribution pattern in combination with enzymatic activity studies has led to the notion that organisms of this group are specialists for degradation of high molecular weight compounds in both the dissolved and particulate fraction of the marine organic matter pool, implying a major role of Bacteroidetes in the marine carbon cycle. Despite their ecological importance, comprehensive molecular data on organisms of this group have been scarce so far. Here we report on the first whole genome analysis of a marine Bacteroidetes representative, 'Gramella forsetii' KT0803. Functional analysis of the predicted proteome disclosed several traits which in joint consideration suggest a clear adaptation of this marine Bacteroidetes representative to the degradation of high molecular weight organic matter, such as a substantial suite of genes encoding hydrolytic enzymes, a predicted preference for polymeric carbon sources and a distinct capability for surface adhesion.
Magnetotactic bacteria (MTB) are a heterogeneous group of aquatic prokaryotes with a unique intracellular organelle, the magnetosome, which orients the cell along magnetic field lines. Magnetotaxis is a complex phenotype, which depends on the coordinate synthesis of magnetosomes and the ability to swim and orient along the direction caused by the interaction with the Earth's magnetic field. Although a number of putative magnetotaxis genes were recently identified within a conserved genomic magnetosome island (MAI) of several MTB, their functions have remained mostly unknown, and it was speculated that additional genes located outside the MAI might be involved in magnetosome formation and magnetotaxis. In order to identify genes specifically associated with the magnetotactic phenotype, we conducted comparisons between four sequenced magnetotactic Alphaproteobacteria including the nearly complete genome of Magnetospirillum gryphiswaldense strain MSR-1, the complete genome of Magnetospirillum magneticum strain AMB-1, the complete genome of the magnetic coccus MC-1, and the comparative-ready preliminary genome assembly of Magnetospirillum magnetotacticum strain MS-1 against an in-house database comprising 426 complete bacterial and archaeal genome sequences. A magnetobacterial core genome of about 891 genes was found shared by all four MTB. In addition to a set of approximately 152 genus-specific genes shared by the three Magnetospirillum strains, we identified 28 genes as group specific, i.e., which occur in all four analyzed MTB but exhibit no (MTB-specific genes) or only remote (MTB-related genes) similarity to any genes from nonmagnetotactic organisms and which besides various novel genes include nearly all mam and mms genes previously shown to control magnetosome formation. The MTB-specific and MTB-related genes to a large extent display synteny, partially encode previously unrecognized magnetosome membrane proteins, and are either located within (18 genes) or outside (10 genes) the MAI of M. gryphiswaldense. These genes, which represent less than 1% of the 4,268 open reading frames of the MSR-1 genome, as yet are mostly of unknown functions but are likely to be specifically involved in magnetotaxis and, thus, represent prime targets for future experimental analysis.Magnetotactic bacteria (MTB) represent a diverse group of prokaryotes with respect to morphology and physiology (5) that are capable of magnetic orientation. All known MTB belong to different subgroups of the Proteobacteria and the Nitrospirae phylum, with most known cultivated and uncultivated representatives within the Alphaproteobacteria (2). Magnetotaxis is a complex phenotype, which depends on the synthesis of specific intracellular organelles, the magnetosomes. The synthesis of magnetosomes is a process with genetic control over the species-specific morphology and size of magnetosome crystals and the intracellular assembly of chain-like structures. In addition, magnetotaxis requires active migration along the geomagnetic field, which apparentl...
Background: In the emerging field of environmental genomics, direct cloning and sequencing of genomic fragments from complex microbial communities has proven to be a valuable source of new enzymes, expanding the knowledge of basic biological processes. The central problem of this so called metagenome-approach is that the cloned fragments often lack suitable phylogenetic marker genes, rendering the identification of clones that are likely to originate from the same genome difficult or impossible. In such cases, the analysis of intrinsic DNA-signatures like tetranucleotide frequencies can provide valuable hints on fragment affiliation. With this application in mind, the TETRA web-service and the TETRA stand-alone program have been developed, both of which automate the task of comparative tetranucleotide frequency analysis.
Desulfotalea psychrophila is a marine sulfate-reducing delta-proteobacterium that is able to grow at in situ temperatures below 0 degrees C. As abundant members of the microbial community in permanently cold marine sediments, D. psychrophila-like bacteria contribute to the global cycles of carbon and sulfur. Here, we describe the genome sequence of D. psychrophila strain LSv54, which consists of a 3 523 383 bp circular chromosome with 3118 predicted genes and two plasmids of 121 586 bp and 14 663 bp. Analysis of the genome gave insight into the metabolic properties of the organism, e.g. the presence of TRAP-T systems as a major route for the uptake of C(4)-dicarboxylates, the unexpected presence of genes from the TCA cycle, a TAT secretion system, the lack of a beta-oxidation complex and typical Desulfovibrio cytochromes, such as c(553), c(3) and ncc. D. psychrophila encodes more than 30 two-component regulatory systems, including a new Ntr subcluster of hybrid kinases, nine putative cold shock proteins and nine potentially cold shock-inducible proteins. A comparison of D. psychrophila's genome features with those of the only other published genome from a sulfate reducer, the hyperthermophilic archaeon Archaeoglobus fulgidus, revealed many striking differences, but only a few shared features.
The anaerobic oxidation of methane is a globally significant process which is mediated by consortia of yet uncultivated methanotrophic archaea (ANME) and sulfate-reducing bacteria. In order to gain deeper insights into genome characteristics of the different ANME groups, large-insert genomic libraries were constructed using DNA extracted from a methanotrophic microbial mat growing in the anoxic part of the Black Sea, and from sediments above gas hydrates at the Hydrate Ridge off the coast of Oregon. Analysis of these fosmid libraries with respect to archaeal 16S rRNA gene diversity revealed a single ANME-1b ribotype for the Black Sea libraries, whereas the sequences derived from the Hydrate Ridge library phylogenetically affiliated with the ANME-2a, ANME-2c and ANME-3 group. Genome walking for ANME-1b resulted in a contiguous 155 kb composite genome fragment. The comparison of a set of four genomic fragments belonging to the different ANME groups revealed differences in the rRNA operon structure and the average G+C content, with the ANME-2c contig showing the highest divergence within the set. A detailed analysis of the ANME contigs with respect to genes putatively involved in the anaerobic oxidation of methane led to the identification of: (i) a putative N5,N10-methenyltetrahydromethanopterin cyclohydrolase gene, (ii) a gene cluster supposedly encoding a novel type of heterodisulfide reductase/dehydrogenase complex and (iii) a gene cluster putatively encoding a new type of CO dehydrogenase/acetyl-CoA synthase enzyme complex.
Motivation To provide high quality computationally tractable enzyme annotation in UniProtKB using Rhea, a comprehensive expert-curated knowledgebase of biochemical reactions which describes reaction participants using the ChEBI (Chemical Entities of Biological Interest) ontology. Results We replaced existing textual descriptions of biochemical reactions in UniProtKB with their equivalents from Rhea, which is now the standard for annotation of enzymatic reactions in UniProtKB. We developed improved search and query facilities for the UniProt website, REST API, and SPARQL endpoint that leverage the chemical structure data, nomenclature, and classification that Rhea and ChEBI provide. Availability and Implementation UniProtKB at https://www.uniprot.org; UniProt REST API at https://www.uniprot.org/help/api; UniProt SPARQL endpoint at https://sparql.uniprot.org/; Rhea at https://www.rhea-db.org. Supplementary information None.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.