Summary
KofamKOALA is a web server to assign KEGG Orthologs (KOs) to protein sequences by homology search against a database of profile hidden Markov models (KOfam) with pre-computed adaptive score thresholds. KofamKOALA is faster than existing KO assignment tools with its accuracy being comparable to the best performing tools. Function annotation by KofamKOALA helps linking genes to KEGG resources such as the KEGG pathway maps and facilitates molecular network reconstruction.
Availability and implementation
KofamKOALA, KofamScan and KOfam are freely available from GenomeNet (https://www.genome.jp/tools/kofamkoala/).
Supplementary information
Supplementary data are available at Bioinformatics online.
SummaryThe ocean is home to myriad small planktonic organisms that underpin the functioning of marine ecosystems. However, their spatial patterns of diversity and the underlying drivers remain poorly known, precluding projections of their responses to global changes. Here we investigate the latitudinal gradients and global predictors of plankton diversity across archaea, bacteria, eukaryotes, and major virus clades using both molecular and imaging data from Tara Oceans. We show a decline of diversity for most planktonic groups toward the poles, mainly driven by decreasing ocean temperatures. Projections into the future suggest that severe warming of the surface ocean by the end of the 21st century could lead to tropicalization of the diversity of most planktonic groups in temperate and polar regions. These changes may have multiple consequences for marine ecosystem functioning and services and are expected to be particularly significant in key areas for carbon sequestration, fisheries, and marine conservation.Video Abstract
KofamKOALA is a web server to assign KEGG Orthologs (KOs) to protein sequences by homology search against a database of profile hidden Markov models (KOfam) with pre-computed adaptive score thresholds. KofamKOALA is faster than existing KO assignment tools with its accuracy being comparable to the best performing tools. Function annotation by KofamKOALA helps linking genes to KEGG resources such as the KEGG pathway maps and facilitates molecular network reconstruction.
Nucleocytoplasmic large DNA viruses (NCLDVs) are ubiquitous in marine environments and infect diverse eukaryotes. However, little is known about their biogeography and ecology in the ocean. By leveraging the Tara Oceans pole-to-pole metagenomic data set, we investigated the distribution of NCLDVs across size fractions, depths and biomes, as well as their associations with eukaryotic communities. Our analyses revealed a heterogeneous distribution of NCLDVs across oceans, with an elevated uniqueness in polar biomes. The community structures of NCLDV families were correlated with specific eukaryotic lineages including many photosynthetic groups.NCDLV communities were generally distinct between surface and mesopelagic zones, but at some locations, they exhibited a high similarity between the two depths. This vertical similarity was correlated to surface phytoplankton biomass but not to physical mixing processes, suggesting the potential role of vertical export in structuring mesopelagic NCLDV communities. These results underscore the importance of the coupling between NCLDVs and eukaryotes in biogeochemical processes in the ocean.
Diatoms and haptophytes are two major phytoplankton groups, playing pivotal roles in global biogeochemical cycles and marine ecosystems. In general, diatoms have higher growth rates than haptophytes, whereas haptophytes tend to have higher nutrient uptake affinity. However, precise linkages between their ecological traits and geographical distributions remain poorly understood. Herein, we examined the basin-scale variability of the abundance and taxonomic composition of these two phytoplankton groups across 35 sites in the Pacific Ocean using DNA metabarcoding. The diatom community was generally dominated by a few genera at each sample site, whereas the haptophyte community consisted of a large number of genera in most of the sites. The coexistence of various haptophyte genera might be achieved by diversification of their ecophysiological traits such as mixotrophy. On the other hand, the diatom community might experience greater inter-genus competition due to the rapid uptake of nutrients. Our data further supports the notion that their distinct ecological strategies underlie the emergence of contrasting diversity patterns of these phytoplankton groups in the central Pacific at a basin scale.
“Megaviridae” is a proposed family of giant viruses infecting unicellular eukaryotes. These viruses are ubiquitous in the sea and have impact on marine microbial community structure and dynamics through their lytic infection cycle. However, their diversity and biogeography have been poorly characterized due to the scarce detection of Megaviridae sequences in metagenomes, as well as the limitation of reference sequences used to design specific primers for this viral group. Here, we propose a set of 82 degenerated primers (referred to as MEGAPRIMER), targeting DNA polymerase genes (polBs) of Megaviridae. MEGAPRIMER was designed based on 921 Megaviridae polBs from sequenced genomes and metagenomes. By applying this primer set to environmental DNA meta-barcoding of a coastal seawater sample, we report 5595 non-singleton operational taxonomic units (OTUs) of Megaviridae at 97% nucleotide sequence identity. The majority of the OTUs were found to form diverse clades, which were phylogenetically distantly related to known viruses such as Mimivirus. The Megaviridae OTUs detected in this study outnumber the giant virus OTUs identified in previous individual studies by more than an order of magnitude. Hence, MEGAPRIMER represents a useful tool to study the diversity of Megaviridae at the population level in natural environments.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.