Background. Many countries worldwide have reported increasing numbers of emm89 group A Streptococcus (GAS) infections during last decade. Pathogen genetic factors linked to this increase need assessment.Methods. We investigated epidemiological characteristics of emm89 GAS bacteremic infections, including 7-day and 30-day case-fatality rates, in Finland during 2004–2014 and linked them to whole-genome sequencing data obtained from corresponding strains. The Fisher exact test and exact logistic regression were used to compare differences between bacteremic infections due to emm89 GAS belonging to different genetic clades and subclades.Results. Out of 1928 cases of GAS bacteremic infection, 278 were caused by emm89 GAS. We identified 2 genetically distinct clades, arbitrarily designated clade 2 and clade 3. Both clades were present during 2004–2008, but clade 3 increased rapidly from 2009 onward. Six subclades (designated subclades A–F) were identified within clade 3, based on phylogenetic core genome analysis. The case-fatality rate differed significantly between subclades (P < .05), with subclade D having the highest 30-day estimated case-fatality rate (19% vs 3%–14%).Conclusions. A new emm89 clone, clade 3, emerged in 2009 and spread rapidly in Finland. Patients infected with certain subclades of clade 3 were significantly more likely to die. A specific polymerase chain reaction assay was developed to follow the spread of subclade D in 2015.
Knowledge of the genomic variation among different strains of a pathogenic microbial species can help in selecting optimal candidates for diagnostic assays and vaccine development. Pooled sequencing (Pool-seq) is a cost effective approach for population level genetic studies that require large numbers of samples such as various strains of a microbe. To test the use of Pool-seq in identifying variation, we pooled DNA of 100 Streptococcus pyogenes strains of different emm types in two pools, each containing 50 strains. We used four variant calling tools (Freebayes, UnifiedGenotyper, SNVer, and SAMtools) and one emm1 strain, SF370, as a reference genome. In total 63719 SNPs and 164 INDELs were identified in the two pools concordantly by at least two of the tools. Majority of the variants (93.4%) from six individually sequenced strains used in the pools could be identified from the two pools and 72.3% and 97.4% of the variants in the pools could be mined from the analysis of the 44 complete Str. pyogenes genomes and 3407 sequence runs deposited in the European Nucleotide Archive respectively. We conclude that DNA sequencing of pooled samples of large numbers of bacterial strains is a robust, rapid and cost-efficient way to discover sequence variation.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.