No single genealogical reconstruction or typing method currently encompasses all levels of bacterial diversity, from domain to strain. We propose ribosomal multilocus sequence typing (rMLST), an approach which indexes variation of the 53 genes encoding the bacterial ribosome protein subunits (rps genes), as a means of integrating microbial genealogy and typing. As with multilocus sequence typing (MLST), rMLST employs curated reference sequences to identify gene variants efficiently and rapidly. The rps loci are ideal targets for a universal characterization scheme as they are: (i) present in all bacteria; (ii) distributed around the chromosome; and (iii) encode proteins which are under stabilizing selection for functional conservation. Collectively, the rps loci exhibit variation that resolves bacteria into groups at all taxonomic and most typing levels, providing significantly more resolution than 16S small subunit rRNA gene phylogenies. A web-accessible expandable database, comprising whole-genome data from more than 1900 bacterial isolates, including 28 draft genomes assembled de novo from the European Bioinformatics Institute (EBI) sequence read archive, has been assembled. The rps gene variation catalogued in this database permits rapid and computationally non-intensive identification of the phylogenetic position of any bacterial sequence at the domain, phylum, class, order, family, genus, species and strain levels. The groupings generated with rMLST data are consistent with current nomenclature schemes and independent of the clustering algorithm used. This approach is applicable to the other domains of life, potentially providing a rational and universal approach to the classification of life that is based on one of its fundamental features, the translation mechanism.
A multilocus sequence typing (MLST) scheme has been developed for the unambiguous characterization of encapsulated and noncapsulated Haemophilus influenzae isolates. The sequences of internal fragments of seven housekeeping genes were determined for 131 isolates, comprising a diverse set of 104 serotype a, b, c, d, e, and f isolates and 27 noncapsulated isolates. Many of the encapsulated isolates had previously been characterized by multilocus enzyme electrophoresis (MLEE), and the validity of the MLST scheme was established by the very similar clustering of isolates obtained by these methods. Isolates of serotypes c, d, e, and f formed monophyletic groups on a dendrogram constructed from the differences in the allelic profiles of the isolates, whereas there were highly divergent lineages of both serotype a and b isolates. Noncapsulated isolates were distinct from encapsulated isolates and, with one exception, were within two highly divergent clusters. The relationships between the major lineages of encapsulated H. influenzae inferred from MLEE data could not be discerned on a dendrogram constructed from differences in the allelic profiles, but were apparent on a tree reconstructed from the concatenated nucleotide sequences. Recombination has not therefore completely eliminated phylogenetic signal, and in support of this, for encapsulated isolates, there was significant congruence between many of the trees reconstructed from the sequences of the seven individual loci. Congruence was less apparent for noncapsulated isolates, suggesting that the impact of recombination is greater among noncapsulated than encapsulated isolates. The H. influenzae MLST scheme is available at www.mlst.net, it allows any isolate to be compared with those in the MLST database, and (for encapsulated isolates) it assigns isolates to their phylogenetic lineage, via the Internet.
Hybridization between distantly related organisms can facilitate rapid adaptation to novel environments, but is potentially constrained by epistatic fitness interactions among cell components. The zoonotic pathogens Campylobacter coli and C. jejuni differ from each other by around 15% at the nucleotide level, corresponding to an average of nearly 40 amino acids per protein-coding gene. Using whole genome sequencing, we show that a single C. coli lineage, which has successfully colonized an agricultural niche, has been progressively accumulating C. jejuni DNA. Members of this lineage belong to two groups, the ST-828 and ST-1150 clonal complexes. The ST-1150 complex is less frequently isolated and has undergone a substantially greater amount of introgression leading to replacement of up to 23% of the C. coli core genome as well as import of novel DNA. By contrast, the more commonly isolated ST-828 complex bacteria have 10–11% introgressed DNA, and C. jejuni and nonagricultural C. coli lineages each have <2%. Thus, the C. coli that colonize agriculture, and consequently cause most human disease, have hybrid origin, but this cross-species exchange has so far not had a substantial impact on the gene pools of either C. jejuni or nonagricultural C. coli. These findings also indicate remarkable interchangeability of basic cellular machinery after a prolonged period of independent evolution.
Sequence-based typing is essential for understanding the epidemiology of Campylobacter infections, a major worldwide cause of bacterial gastroenteritis. We demonstrate the practical and rapid exploitation of whole-genome sequencing to provide routine definitive characterization of Campylobacter jejuni and Campylobacter coli for clinical and public health purposes. Short-read data from 384 Campylobacter clinical isolates collected over 4 months in Oxford, United Kingdom, were assembled de novo . Contigs were deposited at the pubMLST.org /campylobacter website and automatically annotated for 1,667 loci. Typing and phylogenetic information was extracted and comparative analyses were performed for various subsets of loci, up to the level of the whole genome, using the Genome Comparator and Neighbor-net algorithms. The assembled sequences (for 379 isolates) were diverse and resembled collections from previous studies of human campylobacteriosis. Small subsets of very closely related isolates originated mainly from repeated sampling from the same patients and, in one case, likely laboratory contamination. Much of the within-patient variation occurred in phase-variable genes. Clinically and epidemiologically informative data can be extracted from whole-genome sequence data in real time with straightforward, publicly available tools. These analyses are highly scalable, are transparent, do not require closely related genome reference sequences, and provide improved resolution (i) among Campylobacter clonal complexes and (ii) between very closely related isolates. Additionally, these analyses rapidly differentiated unrelated isolates, allowing the detection of single-strain clusters. The approach is widely applicable to analyses of human bacterial pathogens in real time in clinical laboratories, with little specialist training required.
The myeloid restricted membrane glycoprotein, CD33, is a member of the recently characterized "sialic acidbinding immunoglobulin-related lectin" family. Although CD33 can mediate sialic acid-dependent cell interactions as a recombinant protein, its function in myeloid cells has yet to be determined. Since CD33 contains two potential immunoreceptor tyrosine-based inhibition motifs in its cytoplasmic tail, we investigated whether it might act as a signaling receptor in myeloid cells. Tyrosine phosphorylation of CD33 in myeloid cell lines was stimulated by cell surface cross-linking or by pervanadate, and inhibited by PP2, a specific inhibitor of Src family tyrosine kinases. Phosphorylated CD33 recruited both the protein-tyrosine phosphatases, SHP-1 and SHP-2. CD33 was dephosphorylated in vitro by the co-immunoprecipitated tyrosine phosphatases, suggesting that it might also be an in vivo substrate. The first CD33 phosphotyrosine motif is dominant in CD33-SHP-1/SHP-2 interactions, since mutating tyrosine 340 in a CD33-cytoplasmic tail fusion protein significantly reduced binding to SHP-1 and SHP-2 in THP-1 lysates, while mutation of tyrosine 358 had no effect. Furthermore, the NH 2 -terminal Src homology 2 domain of SHP-1 and SHP-2, believed to be essential for phosphatase activation, selectively bound a CD33 phosphopeptide containing tyrosine 340 but not one containing tyrosine 358. Finally, mutation of tyrosine 340 increased red blood cell binding by CD33 expressed in COS cells. Hence, CD33 signaling through selective recruitment of SHP-1/ SHP-2 may modulate its ligand(s) binding activity.Over the last few years, a novel family of sialic acid-dependent recognition molecules has emerged. This family, recently designated "siglecs" (sialic acid-binding Ig-related lectins), is a structurally related subgroup of the immunoglobulin superfamily that includes CD22 (siglec-2), sialoadhesin (siglec-1), MAG (siglec-4), CD33 (siglec-3), and the newest member of the family, siglec-5 (1). All siglecs have an NH 2 -terminal V-set Ig-like domain that contains the sialic acid binding site, followed by varying numbers of C2-set domains. In addition to common structural features that would appear to adapt these molecules for functional protein-carbohydrate cellular interactions, each member exhibits a very specific pattern of tissue distribution. While CD22 is restricted to B cells, sialoadhesin to macrophages, and myelin-associated glycoprotein (MAG) to myelinating oligodendrocytes and Schwann cells, CD33 and siglec 5 are expressed only on cells of the myelomonocytic lineage.CD22 is perhaps the best characterized member of the siglec family. In addition to being an adhesion receptor for sialic-acid bearing ligands on leukocytes and erythrocytes, CD22 has an important regulatory role as a signal transduction molecule in B cells (2, 3). The cytoplasmic tail of CD22 has six tyrosines, two of which are encompassed within sequences which conform with immunoreceptor tyrosine-based activation motifs (ITAM), 1 while the other four form po...
Genetic attribution of bacterial genotypes has become a major tool in the investigation of the epidemiology of campylobacteriosis and has implicated retail chicken meat as the major source of human infection in several countries. To investigate the robustness of this approach to the provenance of the reference data sets used, a collection of 742 Campylobacter jejuni and 261 Campylobacter coli isolates obtained from United Kingdomsourced chicken meat was established and typed by multilocus sequence typing. Comparative analyses of the data with those from other isolates sourced from a variety of host animals and countries were undertaken by genetic attribution, genealogical, and population genetic approaches. The genotypes from the United Kingdom data set were highly diverse, yet structured into sequence types, clonal complexes, and genealogical groups very similar to those seen in chicken isolates from the Netherlands, the United States, and Senegal, but more distinct from isolates obtained from ruminant, swine, and wild bird sources. Assignment analyses consistently grouped isolates from different host animal sources regardless of geographical source; these associations were more robust than geographic associations across isolates from three continents. We conclude that, notwithstanding the high diversity of these pathogens, there is a strong signal of association of multilocus genotypes with particular hosts, which is greater than the geographic signal. These findings are consistent with local and international transmission of host-associated lineages among food animal species and provide a foundation for further improvements in genetic attribution.Members of the genus Campylobacter, specifically Campylobacter jejuni and Campylobacter coli, are major causes of human morbidity worldwide and are the most common bacterial cause of gastroenteritis in industrialized countries (4). These bacteria are commonly found as apparently harmless members of the gut microbiota of many farmed and wild mammals and birds. This, together with the sporadic nature of most human disease, has contributed to the remaining uncertainty regarding the relative importance of different potential sources of human infection (34), inhibiting the implementation of effective public health interventions, which may have major economic consequences on intensive food production. Human infection with C. jejuni and C. coli has been epidemiologically linked to contact with pets and farm animals and to consumption of red meat, water, milk, and poultry (17,20,34).The advent of multilocus sequence typing (MLST) for both C. jejuni and C. coli and its application to large and diverse isolate collections have enhanced understanding of the ecology (38) and epidemiology (10) of these important pathogens. It has been shown that there is substantial genetic differentiation between farmed ruminants and chickens (29) and even greater differentiation between farmed chickens and wild birds at the same farm site (5). Furthermore, MLST supports the application of popul...
We have identified a gene for the addition of N‐acetylneuraminic acid (Neu5Ac) in an α‐2,3‐linkage to a lactosyl acceptor moiety of the lipopolysaccharide (LPS) of the human pathogen Haemophilus influenzae. The gene is one that was identified previously as a phase‐variable gene known as lic3A. Extracts of H. influenzae, as well as recombinant Escherichia coli strains producing Lic3A, demonstrate sialyltransferase activity in assays using synthetic fluorescent acceptors with a terminal galactosyl, lactosyl or N‐acetyl‐lactosaminyl moiety. In the RM118 strain of H. influenzae, Lic3A activity is modulated by the action of another phase‐variable glycosyltransferase, LgtC, which competes for the same lactosyl acceptor moiety. Structural analysis of LPS from a RM118:lgtC mutant and the non‐typeable strain 486 using mass spectrometry and nuclear magnetic resonance (NMR) spectroscopy confirmed that the major sialylated species has a sialyl‐α‐(2–3)‐lactosyl extension off the distal heptose. This sialylated glycoform was absent in strains containing a lic3A gene disruption. Low amounts of sialylated higher molecular mass glycoforms were present in RM118:lgtC lic3A, indicating the presence of a second sialyltransferase. Lic3A mutants of H. influenzae strains show reduced resistance to the killing effects of normal human serum. Lic3A, encoding an α‐2,3‐sialyltransferase activity, is the first reported phase‐variable sialyltransferase gene.
Campylobacter jejuni and Campylobacter coli are the biggest causes of bacterial gastroenteritis in the developed world, with human infections typically arising from zoonotic transmission associated with infected meat. Because Campylobacter is not thought to survive well outside the gut, host-associated populations are genetically isolated to varying degrees. Therefore, the likely origin of most strains can be determined by host-associated variation in the genome. This is instructive for characterizing the source of human infection. However, some common strains, notably isolates belonging to the ST-21, ST-45 and ST-828 clonal complexes, appear to have broad host ranges, hindering source attribution. Here whole-genome sequencing has the potential to reveal fine-scale genetic structure associated with host specificity. We found that rates of zoonotic transmission among animal host species in these clonal complexes were so high that the signal of host association is all but obliterated, estimating one zoonotic transmission event every 1.6, 1.8 and 12 years in the ST-21, ST-45 and ST828 complexes, respectively. We attributed 89% of clinical cases to a chicken source, 10% to cattle and 1% to pig. Our results reveal that common strains of C. jejuni and C. coli infectious to humans are adapted to a generalist lifestyle, permitting rapid transmission between different hosts. Furthermore, they show that the weak signal of host association within these complexes presents a challenge for pinpointing the source of clinical infections, underlining the view that whole-genome sequencing, powerful though it is, cannot substitute for intensive sampling of suspected transmission reservoirs.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.