Microbiology Comment provides a platform for readers of Microbiology to communicate their personal observations and opinions in a more informal way than through the submission of papers.Most of us feel, from time to time, that other authors have not acknowledged the work of our own or other groups or have omitted to interpret important aspects of their own data. Perhaps we have observations that, although not sufficient to merit a full paper, add a further dimension to one published by others. In other instances we may have a useful piece of methodology that we would like to share.The Editors hope that readers will take full advantage of this section and use it to raise matters that hitherto have been confined to a limited audience.
Christopher M. Thomas, Editor-in-chiefNotes on designing a partial genomic database : The PfSBW25 Encyclopaedia, a sequence database for Pseudomonas fluorescens SBW25Managing and analysing DNA sequence is a major problem for those research programmes that generate large amounts of short-run noncontiguous (SRNC) sequence data as is the case for many genomic-level projects. Data must be accurately collated and archived, it must be accessible and preferably in a form that enables it to be data-mined. The problem of dealing with SRNC data is one that our laboratory has had to contend with as a result of our analysis of the genome of Pseudomonas fluorescens strain SBW5. To find a way forward, we designed an SNRC database using a novel set of Perl-CGI scripts that enable storing and mining of sequence data. The use of Perl scripting means that the database is user-friendly and fully accessible from the Web, which promotes communication between collaborators. It also promotes the publication of both raw and processed data, and means that SNRC data and information derived from bioinformatic analysis can be hyperlinked with remote databases (e.g. GenBank, SWISS-PROT, PubMed etc.).Microbial DNA analysis has become highly focused in the last five years on the manipulation and analysis of whole genome (WG) DNA sequences. The first of these was the 1n8 Mbp Haemophilus influenzae Rd genome completed in 1995 and since then, 27 additional microbial genomes have been completed. The value of these WG databases is tremendous, and there is substantial support for additional whole-genome sequencing efforts. Less obvious however, is the effort being put into sequencing projects where the complete genomic sequence is neither the primary nor the long-term goal of the research effort.Many projects aim to obtain biologically relevant sequences from specific experimental strategies. One example is IVET (in vitro expression technology) screening, where genes expressed only under certain environmental circumstances are recovered (2). A second is direct fluorescence induction, where individual bacteria expressing green fluorescent protein fusions are isolated by cell sorting (4). A third approach is signature-tagged mutagenesis, in which sequences are obtained from transposons inserted into virulence genes (1). T...