BackgroundThe pan-genome of a bacterial species consists of a core and an accessory gene pool. The accessory genome is thought to be an important source of genetic variability in bacterial populations and is gained through lateral gene transfer, allowing subpopulations of bacteria to better adapt to specific niches. Low-cost and high-throughput sequencing platforms have created an exponential increase in genome sequence data and an opportunity to study the pan-genomes of many bacterial species. In this study, we describe a new online pan-genome sequence analysis program, Panseq.ResultsPanseq was used to identify Escherichia coli O157:H7 and E. coli K-12 genomic islands. Within a population of 60 E. coli O157:H7 strains, the existence of 65 accessory genomic regions identified by Panseq analysis was confirmed by PCR. The accessory genome and binary presence/absence data, and core genome and single nucleotide polymorphisms (SNPs) of six L. monocytogenes strains were extracted with Panseq and hierarchically clustered and visualized. The nucleotide core and binary accessory data were also used to construct maximum parsimony (MP) trees, which were compared to the MP tree generated by multi-locus sequence typing (MLST). The topology of the accessory and core trees was identical but differed from the tree produced using seven MLST loci. The Loci Selector module found the most variable and discriminatory combinations of four loci within a 100 loci set among 10 strains in 1 s, compared to the 449 s required to exhaustively search for all possible combinations; it also found the most discriminatory 20 loci from a 96 loci E. coli O157:H7 SNP dataset.ConclusionPanseq determines the core and accessory regions among a collection of genomic sequences based on user-defined parameters. It readily extracts regions unique to a genome or group of genomes, identifies SNPs within shared core genomic regions, constructs files for use in phylogeny programs based on both the presence/absence of accessory regions and SNPs within core regions and produces a graphical overview of the output. Panseq also includes a loci selector that calculates the most variable and discriminatory loci among sets of accessory loci or core gene SNPs.AvailabilityPanseq is freely available online at http://76.70.11.198/panseq. Panseq is written in Perl.
BackgroundAdherent and invasive Escherichia coli (AIEC) are commonly found in ileal lesions of Crohn's Disease (CD) patients, where they adhere to intestinal epithelial cells and invade into and survive in epithelial cells and macrophages, thereby gaining access to a typically restricted host niche. Colonization leads to strong inflammatory responses in the gut suggesting that AIEC could play a role in CD immunopathology. Despite extensive investigation, the genetic determinants accounting for the AIEC phenotype remain poorly defined. To address this, we present the complete genome sequence of an AIEC, revealing the genetic blueprint for this disease-associated E. coli pathotype.ResultsWe sequenced the complete genome of E. coli NRG857c (O83:H1), a clinical isolate of AIEC from the ileum of a Crohn's Disease patient. Our sequence data confirmed a phylogenetic linkage between AIEC and extraintestinal pathogenic E. coli causing urinary tract infections and neonatal meningitis. The comparison of the NRG857c AIEC genome with other pathogenic and commensal E. coli allowed for the identification of unique genetic features of the AIEC pathotype, including 41 genomic islands, and unique genes that are found only in strains exhibiting the adherent and invasive phenotype.ConclusionsUp to now, the virulence-like features associated with AIEC are detectable only phenotypically. AIEC genome sequence data will facilitate the identification of genetic determinants implicated in invasion and intracellular growth, as well as enable functional genomic studies of AIEC gene expression during health and disease.
(Bacterio)phage PVP-SE1, isolated from a German wastewater plant, presents a high potential value as a biocontrol agent and as a diagnostic tool, even compared to the well-studied typing phage Felix 01, due to its broad lytic spectrum against different Salmonella strains. Sequence analysis of its genome (145,964 bp) shows it to be terminally redundant and circularly permuted. Its G؉C content, 45.6 mol%, is lower than that of its hosts (50 to 54 mol%). We found a total of 244 open reading frames (ORFs), representing 91.6% of the coding capacity of the genome. Approximately 46% of encoded proteins are unique to this phage, and 22.1% of the proteins could be functionally assigned. This myovirus encodes a large number of tRNAs (n ؍ 24), reflecting its lytic capacity and evolution through different hosts. Tandem mass spectrometric analysis using electron spray ionization revealed 25 structural proteins as part of the mature phage particle. The genome sequence was found to share homology with 140 proteins of the Escherichia coli bacteriophage rV5. Both phages are unrelated to any other known virus, which suggests that an "rV5-like virus" genus should be created within the Myoviridae to contain these two phages.
Phage vB_EcoM_CBA120 (CBA120), isolated against Escherichia coli O157:H7 from a cattle feedlot, is morphologically very similar to the classic phage ViI of Salmonella enterica serovar Typhi. Until recently, little was known genetically or physiologically about the ViI-like phages, and none targeting E. coli have been described in the literature. The genome of CBA120 has been fully sequenced and is highly similar to those of both ViI and the Shigella phage AG3. The core set of structural and replication-related proteins of CBA120 are homologous to those from T-even phages, but generally are more closely related to those from T4-like phages of Vibrio, Aeromonas and cyanobacteria than those of the Enterobacteriaceae. The baseplate and method of adhesion to the host are, however, very different from those of either T4 or the cyanophages. None of the outer baseplate proteins are conserved. Instead of T4's long and short tail fibers, CBA120, like ViI, encodes tail spikes related to those normally seen on podoviruses. The 158 kb genome, like that of T4, is circularly permuted and terminally redundant, but unlike T4 CBA120 does not substitute hmdCyt for cytosine in its DNA. However, in contrast to other coliphages, CBA120 and related coliphages we have isolated cannot incorporate 3H-thymidine (3H-dThd) into their DNA. Protein sequence comparisons cluster the putative "thymidylate synthase" of CBA120, ViI and AG3 much more closely with those of Delftia phage φW-14, Bacillus subtilis phage SPO1, and Pseudomonas phage YuA, all known to produce and incorporate hydroxymethyluracil (hmdUra).
Despite multiple control measures, Escherichia coli O157:H7 (STEC O157:H7) continues to be responsible for many food borne outbreaks in North America and elsewhere. Bacteriophage therapy may prove useful for controlling this pathogen in the host, their environment and food. Bacteriophage vB_EcoS_AKFV33 (AKFV33), a T5-like phage of Siphoviridae lysed common phage types of STEC O157:H7 and not non-O157 E. coli . Moreover, STEC O157:H7 isolated from the same feedlot pen from which the phage was obtained, were highly susceptible to AKFV33. Adsorption rate constant and burst size were estimated to be 9.31×10 −9 ml/min and 350 PFU/infected cell, respectively. The genome of AKVF33 was 108,853 bp (38.95% G+C), containing 160 open reading frames (ORFs), 22 tRNA genes and 32 strong promoters recognized by host RNA polymerase. Of 12 ORFs without homologues to T5-like phages, 7 predicted novel proteins while others exhibited low identity (<60%) to proteins in the National Centre for Biotechnology Information database. AKVF33 also lacked the L-shaped tail fiber protein typical of T5, but was predicted to have tail fibers comprised of 2 novel proteins with low identity (37–41%) to tail fibers of E. coli phage phiEco32 of Podoviridae , a putative side tail fiber protein of a prophage from E. coli IAI39 and a conserved domain protein of E. coli MS196-1. The receptor-binding tail protein ( pb5 ) shared an overall identify of 29–72% to that of other T5-like phages, with no region coding for more than 6 amino acids in common. Proteomic analysis identified 4 structural proteins corresponding to the capsid, major tail, tail fiber and pore-forming tail tip ( pb2 ). The genome of AKFV33 lacked regions coding for known virulence factors, integration-related proteins or antibiotic resistance determinants. Phage AKFV33 is a unique, highly lytic STEC O157:H7-specific T5-like phage that may have considerable potential as a pre- and post-harvest biocontrol agent.
BackgroundType 2 diabetes mellitus (T2DM) has been linked to a state of pre-clinical chronic inflammation resulting from abnormalities in the innate immune pathway. Serum levels of pro-inflammatory cytokines and acute-phase proteins, collectively known as 'inflammatory network', are elevated in the pre-, or early, stages of T2DM and increase with disease progression. Genetic variation can affect the innate immune response to certain environmental factors, and may, therefore, determine an individual's lifetime risk of disease.MethodsWe conducted a cross-sectional study in 6,720 subjects from the TwinsUK Registry to evaluate the association between 18 single nucleotide polymorphisms (SNPs) in five genes (TLR4, IL1A, IL6, TNFA, and CRP) along the innate immunity-related inflammatory pathway and biomarkers of predisposition to T2DM [fasting insulin and glucose, HDL- and LDL- cholesterols, triglycerides (TGs), amyloid-A, sensitive C-reactive protein (sCRP) and vitamin D binding protein (VDBP) and body mass index (BMI)].ResultsOf 18 the SNPs examined for their association with nine metabolic phenotypes of interest, six were significantly associated with five metabolic phenotypes (Bonferroni correction, P ≤ 0.0027). Fasting insulin was associated with SNPs in IL6 and TNFA, serum HDL-C with variants of TNFA and CRP and serum sCRP level with SNPs in CRP. Cross-correlation analysis among the different metabolic factors related to risk of T2DM showed several significant associations. For example, BMI was directly correlated with glucose (r = 0.11), insulin (r = 0.15), sCRP (r = 0.23), LDL-C (r = 0.067) and TGs (r = 0.18) but inversely with HDL-C (r = -0.14). sCRP was also positively correlated (P < 0.0001) with insulin (r = 0.17), amyloid-A (r = 0.39), TGs (r = 0.26), and VDBP (r = 0.36) but inversely with HDL-C (r = -0.12).ConclusionGenetic variants in the innate immunity pathway and its related inflammatory cascade is associated with some metabolic risk factors for T2DM; an observation that may provide a rationale for further studying their role as biomarkers for disease early risk prediction.
BackgroundLytic bacteriophages have been applied successfully to control the growth of various foodborne pathogens. Sequencing of their genomes is considered as an important preliminary step to ensure their safety prior to food applications.ResultsThe lytic bacteriophage, ΦSboM-AG3, targets the important foodborne pathogen, Shigella. It is morphologically similar to phage ViI of Salmonella enterica serovar Typhi and a series of phages of Acinetobacter calcoaceticus and Rhizobium meliloti. The complete genome of ΦSboM-AG3 was determined to be 158 kb and was terminally redundant and circularly permuted. Two hundred and sixteen open reading frames (ORFs) were identified and annotated, most of which displayed homology to proteins of Salmonella phage ViI. The genome also included four genes specifying tRNAs.ConclusionsThis is the first time that a Vi-specific phage for Shigella has been described. There is no evidence for the presence of virulence and lysogeny-associated genes. In conclusion, the genome analysis of ΦSboM-AG3 indicates that this phage can be safely used for biocontrol purposes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.