Summary The ‘Unknown Mutation Analysis (XMAn)’ database is a compilation of Homo sapiens mutated peptides in FASTA format, that was constructed for facilitating the identification of protein sequence alterations by tandem mass spectrometry detection. The database comprises 2 539 031 non-redundant mutated entries from 17 599 proteins, of which 2 377 103 are missense and 161 928 are nonsense mutations. It can be used in conjunction with search engines that seek the identification of peptide amino acid sequences by matching experimental tandem mass spectrometry data to theoretical sequences from a database. Availability and implementation XMAn v2 can be accessed from github.com/lazarlab/XMAnv2. Supplementary information Supplementary data are available at Bioinformatics online.
Routine strain-level identification of plant pathogens directly from symptomatic tissue could significantly improve plant disease control and prevention. Here we tested the Oxford Nanopore Technologies (ONT) MinION sequencer for metagenomic sequencing of tomato plants either artificially inoculated with a known strain of the bacterial speck pathogen Pseudomonas syringae pv. tomato or collected in the field and showing bacterial spot symptoms caused by one of four Xanthomonas species. After species-level identification via ONT’s WIMP software and the third-party tools Sourmash and MetaMaps, we used Sourmash and MetaMaps with a custom database of representative genomes of bacterial tomato pathogens to attempt strain-level identification. In parallel, each metagenome was assembled and the longest contigs were used as query with the genome-based microbial identification Web service LINbase. Both the read-based and assembly-based approaches correctly identified P. syringae pv. tomato strain T1 in the artificially inoculated samples. The pathogen strain in most field samples was identified as a member of Xanthomonas perforans group 2. This result was confirmed by whole genome sequencing of colonies isolated from one of the samples. Although in our case metagenome-based pathogen identification at the strain level was achieved, caution still must be exercised in interpreting strain-level results because of the challenges inherent to assigning reads to specific strains and the error rate of nanopore sequencing.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.