The potential of the diverse chemistries present in natural products (NP) for biotechnology and medicine remains untapped because NP databases are not searchable with raw data and the NP community has no way to share data other than in published papers. Although mass spectrometry techniques are well-suited to high-throughput characterization of natural products, there is a pressing need for an infrastructure to enable sharing and curation of data. We present Global Natural Products Social molecular networking (GNPS, http://gnps.ucsd.edu), an open-access knowledge base for community wide organization and sharing of raw, processed or identified tandem mass (MS/MS) spectrometry data. In GNPS crowdsourced curation of freely available community-wide reference MS libraries will underpin improved annotations. Data-driven social-networking should facilitate identification of spectra and foster collaborations. We also introduce the concept of ‘living data’ through continuous reanalysis of deposited data.
The ability to correlate the production of specialized metabolites to the genetic capacity of the organism that produces such molecules has become an invaluable tool in aiding the discovery of biotechnologically applicable molecules. Here, we accomplish this task by matching molecular families with gene cluster families, making these correlations to 60 microbes at one time instead of connecting one molecule to one organism at a time, such as how it is traditionally done. We can correlate these families through the use of nanospray desorption electrospray ionization MS/MS, an ambient pressure MS technique, in conjunction with MS/MS networking and peptidogenomics. We matched the molecular families of peptide natural products produced by 42 bacilli and 18 pseudomonads through the generation of amino acid sequence tags from MS/MS data of specific clusters found in the MS/MS network. These sequence tags were then linked to biosynthetic gene clusters in publicly accessible genomes, providing us with the ability to link particular molecules with the genes that produced them. As an example of its use, this approach was applied to two unsequenced Pseudoalteromonas species, leading to the discovery of the gene cluster for a molecular family, the bromoalterochromides, in the previously sequenced strain P. piscicida JCM 20779 T . The approach itself is not limited to 60 related strains, because spectral networking can be readily adopted to look at molecular family-gene cluster families of hundreds or more diverse organisms in one single MS/MS network.MS/MS molecular networking | mass spectrometry | microbial ecology T ens of thousands of sequenced microbial genomes or rough drafts of genomes are available at this time, and this number is predicted to grow into the millions over the next decades. This wealth of sequence data has the potential to be used for the discovery of small bioactive molecules through genome mining (1-6). Genome mining is a process in which small molecules are discovered by predicting what compound will be genetically encoded based on the sequences of biosynthetic gene clusters. However, the process of mining genetically encoded small molecules is not keeping pace with the rate by which genome sequences are being obtained. In general, genome mining is still done one gene cluster at a time and requires many person-years of effort to annotate a single molecule. The time and significant expertise that current genome mining requires also make genome mining very expensive. In light of this extensive effort and cost, alternative approaches to genome mining and annotating specialized metabolites must be developed that not only take advantage of the sequenced resources available and make it efficient to perform genome mining on a more global scale but also enable the molecular analysis of unsequenced organisms. Such methods will then significantly reduce the cost of genome mining by increasing the speed with which molecules are connected to candidate genes and using resources already available. Here, we put fo...
These findings identify the El Niño episodes as a reliable vehicle for the introduction and propagation of Vibrio pathogens in South America. The movement of oceanic waters seems to be one of the driving forces of the spread of Vibrio diseases.
Coral reefs are intricate ecosystems that harbor diverse organisms, including 25% of all marine fish. Healthy corals exhibit a complex symbiosis between coral polyps, endosymbiotic alga, and an array of microorganisms, called the coral holobiont. Secretion of specialized metabolites by coral microbiota is thought to contribute to the defense of this sessile organism against harmful biotic and abiotic factors. While few causative agents of coral diseases have been unequivocally identified, fungi have been implicated in the massive destruction of some soft corals worldwide. Because corals are nocturnal feeders, they may be more vulnerable to fungal infection at night, and we hypothesized that the coral microbiota would have the capability to enhance their defenses against fungi in the dark. A Pseudoalteromonas sp. isolated from a healthy octocoral displayed light-dependent antifungal properties when grown adjacent to Penicilliumcitrinum (P. citrinum) isolated from a diseased Gorgonian octocoral. Microbial MALDI-imaging mass spectrometry (IMS) coupled with molecular network analyses revealed that Pseudoalteromonas produced higher levels of antifungal polyketide alteramides in the dark than in the light. The alteramides were inactivated by light through a photoinduced intramolecular cyclization. Further NMR studies led to a revision of the stereochemical structure of the alteramides. Alteramide A exhibited antifungal properties and elicited changes in fungal metabolite distributions of mycotoxin citrinin and citrinadins. These data support the hypothesis that coral microbiota use abiotic factors such as light to regulate the production of metabolites with specialized functions to combat opportunistic pathogens at night.
Fungal infections are increasing worldwide, including in the aquatic environment. Microbiota that coexist with marine life can provide protection against fungal infections by secretion of metabolites with antifungal properties. Our laboratory has developed mass spectrometric methodologies with the goal of improving our functional understanding of microbial metabolites and guiding the discovery process of anti-infective agents from natural sources. GA40, a Bacillus amyloliquefaciens strain isolated from an octocoral in Panama, displayed antifungal activity against various terrestrial and marine fungal strains. Using matrix-assisted laser desorption/ionization-imaging mass spectrometry (MALDI-IMS), the molecular species produced by this microbe were visualized in a side-by-side interaction with two representative fungal strains, Aspergillus fumigatus and Aspergillus niger. The visualization was performed directly on the agar without the need for extraction. By comparison of spatial distributions, relative intensities and m/z values of GA40 secreted metabolites in the fungal interactions versus singly grown control colonies, we obtained insight into the antifungal activity of secreted metabolites. Annotation of GA40 metabolites observed in MALDI-IMS was facilitated by MS/MS networking analysis, a mass spectrometric technique that clusters metabolites with similar MS/MS fragmentation patterns. This analysis established that the predominant GA40 metabolites belong to the iturin family. In a fungal inhibition assay of A. fumigatus, the GA40 iturin metabolites were found to be responsible for the antifungal properties of this Bacillus strain.
Vibrio parahaemolyticus is a foodborne pathogen that has become a public health concern at the global scale. The epidemiological significance of V. parahaemolyticus infections in Latin America received little attention until the winter of 1997 when cases related to the pandemic clone were detected in the region, changing the epidemic dynamics of this pathogen in Peru. With the aim to assess the impact of the arrival of the pandemic clone on local populations of pathogenic V. parahaemolyticus in Peru, we investigated the population genetics and genomic variation in a complete collection of non-pandemic strains recovered from clinical sources in Peru during the pre- and post-emergence periods of the pandemic clone. A total of 56 clinical strains isolated in Peru during the period 1994 to 2007, 13 strains from Chile and 20 strains from Asia were characterized by Multilocus Sequence Typing (MLST) and checked for the presence of Variable Genomic Regions (VGRs). The emergence of O3:K6 cases in Peru implied a drastic disruption of the seasonal dynamics of infections and a shift in the serotype dominance of pathogenic V. parahaemolyticus. After the arrival of the pandemic clone, a great diversity of serovars not previously reported was detected in the country, which supports the introduction of additional populations cohabitating with the pandemic group. Moreover, the presence of genomic regions characteristic of the pandemic clone in other non-pandemic strains may represent early evidence of genetic transfer from the introduced population to the local communities. Finally, the results of this study stress the importance of population admixture, horizontal genetic transfer and homologous recombination as major events shaping the structure and diversity of pathogenic V. parahaemolyticus.
Vibrio parahaemolyticus is an important human pathogen whose transmission is associated with the consumption of contaminated seafood. Consistent multilocus sequence typing for V. parahaemolyticus has shown difficulties in the amplification of the recA gene by PCR associated with a lack of amplification or a larger PCR product than expected. In one strain (090–96, Peru, 1996), the produced PCR product was determined to be composed of two recA fragments derived from different Vibrio species. To better understand this phenomenon, we sequenced the whole genome of this strain. The hybrid recA gene was found to be the result of a fragmentation of the original lineage-specific recA gene resulting from a DNA insertion of approximately 30 kb in length. This insert had a G+C content of 38.8%, lower than that of the average G+C content of V. parahaemolyticus (45.2%), and contained 19 ORFs, including a complete recA gene. This new acquired recA gene deviated 24% in sequence from the original recA and was distantly related to recA genes from bacteria of the Vibrionaceae family. The reconstruction of the original recA gene (recA3) identified the precursor as belonging to ST189, a sequence type reported previously only in Asian countries. The identification of this singular genetic feature in strains from Asia reveals new evidence for genetic connectivity between V. parahaemolyticus populations at both sides of the Pacific Ocean that, in addition to the previously described pandemic clone, supports the existence of a recurrent transoceanic spreading of pathogenic V. parahaemolyticus with the corresponding potential risk of pandemic expansion.
The dynamics of dissemination of the environmental human pathogen Vibrio parahaemolyticus are uncertain. The O3:K6 clone was restricted to Asia until its detection along the Peruvian coasts and in northern Chile in 1997 in phase with the arrival of El Niño waters. A subsequent emergence of O3:K6 strains was detected in austral Chile in 2004. The origin of these 1997 and 2004 population radiations has not yet been conclusively determined. Multiple loci VNTR analysis using seven polymorphic loci was carried out with a number of representative strains from Asia, Peru and Chile to determine their genetic characteristics and population structure. Asian and Chilean subpopulations were the most genetically distant groups with an intermediate subpopulation in Peru. Population structure inferred from a minimum-spanning tree and Bayesian analysis divided the populations into two genetically distinct groups, consistent with the epidemic dynamics of the O3:K6 clone in South America. One group comprised strains from the original Asiatic population and strains arriving in Peru and Chile in 1997. The second group included the remaining Peruvian Strains and Chilean strains obtained from Puerto Montt in 2004. The analysis of the arrival of the O3:K6 clone at the Pacific coasts of South America has provided novel insights linking the origin of the invasion in 1997 to Asian populations and describing the successful establishment of the O3:K6 populations, first in Peru and subsequently in the South of Chile owing to a possible radiation of Peruvian populations.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.