A year of genomic surveillance reveals how the SARS-CoV-2 pandemic unfolded in Africa
Background The Public Health Alliance for Genomic Epidemiology (PHA4GE) (https://pha4ge.org) is a global coalition that is actively working to establish consensus standards, document and share best practices, improve the availability of critical bioinformatics tools and resources, and advocate for greater openness, interoperability, accessibility, and reproducibility in public health microbial bioinformatics. In the face of the current pandemic, PHA4GE has identified a need for a fit-for-purpose, open-source SARS-CoV-2 contextual data standard. Results As such, we have developed a SARS-CoV-2 contextual data specification package based on harmonizable, publicly available community standards. The specification can be implemented via a collection template, as well as an array of protocols and tools to support both the harmonization and submission of sequence data and contextual information to public biorepositories. Conclusions Well-structured, rich contextual data add value, promote reuse, and enable aggregation and integration of disparate datasets. Adoption of the proposed standard and practices will better enable interoperability between datasets and systems, improve the consistency and utility of generated data, and ultimately facilitate novel insights and discoveries in SARS-CoV-2 and COVID-19. The package is now supported by the NCBI’s BioSample database.
The Public Health Alliance for Genomic Epidemiology (PHA4GE) (https://pha4ge.org) is a global coalition that is actively working to establish consensus standards, document and share best practices, improve the availability of critical bioinformatic tools and resources, and advocate for greater openness, interoperability, accessibility and reproducibility in public health microbial bioinformatics. In the face of the current pandemic, PHA4GE has identified a clear and present need for a fit-for-purpose, open source SARS-CoV-2 contextual data standard. As such, we have developed an extension to the INSDC pathogen package, providing a SARS-CoV-2 contextual data specification based on harmonisable, publicly available, community standards. The specification is implementable via a collection template, as well as an array of protocols and tools to support the harmonisation and submission of sequence data and contextual information to public repositories. Well-structured, rich contextual data adds value, promotes reuse, and enables aggregation and integration of disparate data sets. Adoption of the proposed standard and practices will better enable interoperability between datasets and systems, improve the consistency and utility of generated data, and ultimately facilitate novel insights and discoveries in SARS-CoV-2 and COVID-19.
While investigating a signal of adaptive evolution in humans at the gene LARGE, we encountered an intriguing finding by Dr. Stefan Kunz that the gene plays a critical role in Lassa virus binding and entry. This led us to pursue field work to test our hypothesis that natural selection acting on LARGE—detected in the Yoruba population of Nigeria—conferred resistance to Lassa Fever in some West African populations. As we delved further, we conjectured that the “emerging” nature of recently discovered diseases like Lassa fever is related to a newfound capacity for detection, rather than a novel viral presence, and that humans have in fact been exposed to the viruses that cause such diseases for much longer than previously suspected. Dr. Stefan Kunz’s critical efforts not only laid the groundwork for this discovery, but also inspired and catalyzed a series of events that birthed Sentinel, an ambitious and large-scale pandemic prevention effort in West Africa. Sentinel aims to detect and characterize deadly pathogens before they spread across the globe, through implementation of its three fundamental pillars: Detect, Connect, and Empower. More specifically, Sentinel is designed to detect known and novel infections rapidly, connect and share information in real time to identify emerging threats, and empower the public health community to improve pandemic preparedness and response anywhere in the world. We are proud to dedicate this work to Stefan Kunz, and eagerly invite new collaborators, experts, and others to join us in our efforts.
The rampaging effect of coronavirus disease (COVID‐19) in Africa is huge and have impacted almost every area of life. Across African states, there exist variations in the laboratory measures adopted, and these heterogeneous approaches, in turn, determines the successes or otherwise recorded. In this study, we assessed the various forms of laboratory responses to the containment, risk analyses, structures and features of COVID‐19 in high incidence African countries (Nigeria, South Africa, Egypt, Ghana, Algeria, Morocco, etc.) to aid better and efficient laboratory responses to the highly infectious diseases.
Next generation sequencing technologies are becoming more accessible and affordable over the years, with entire genome sequences of several pathogens being deciphered in few hours. However, there is the need to analyze multiple genomes within a short time, in order to provide critical information about a pathogen of interest such as drug resistance, mutations and genetic relationship of isolates in an outbreak setting. Many pipelines that currently do this are stand-alone workflows and require huge computational requirements to analyze multiple genomes. We present an automated and scalable pipeline called BAGEP for monomorphic bacteria that performs quality control on FASTQ paired end files, scan reads for contaminants using a taxonomic classifier, maps reads to a reference genome of choice for variant detection, detects antimicrobial resistant (AMR) genes, constructs a phylogenetic tree from core genome alignments and provide interactive short nucleotide polymorphism (SNP) visualization across core genomes in the data set. The objective of our research was to create an easy-to-use pipeline from existing bioinformatics tools that can be deployed on a personal computer. The pipeline was built on the Snakemake framework and utilizes existing tools for each processing step: fastp for quality trimming, snippy for variant calling, Centrifuge for taxonomic classification, Abricate for AMR gene detection, snippy-core for generating whole and core genome alignments, IQ-TREE for phylogenetic tree construction and vcfR for an interactive heatmap visualization which shows SNPs at specific locations across the genomes. BAGEP was successfully tested and validated with Mycobacterium tuberculosis (n = 20) and Salmonella enterica serovar Typhi (n = 20) genomes which are about 4.4 million and 4.8 million base pairs, respectively. Running these test data on a 8 GB RAM, 2.5 GHz quad core laptop took 122 and 61 minutes on respective data sets to complete the analysis. BAGEP is a fast, calls accurate SNPs and an easy to run pipeline that can be executed on a mid-range laptop; it is freely available on: https://github.com/idolawoye/BAGEP.
Identifying the dissemination patterns and impacts of a virus of economic or health importance during a pandemic is crucial, as it informs the public on policies for containment in order to reduce the spread of the virus. In this study, we integrated genomic and travel data to investigate the emergence and spread of the SARS-CoV-2 B.1.1.318 and B.1.525 (Eta) variants of interest in Nigeria and the wider Africa region. By integrating travel data and phylogeographic reconstructions, we find that these two variants that arose during the second wave in Nigeria emerged from within Africa, with the B.1.525 from Nigeria, and then spread to other parts of the world. Data from this study show how regional connectivity of Nigeria drove the spread of these variants of interest to surrounding countries and those connected by air-traffic. Our findings demonstrate the power of genomic analysis when combined with mobility and epidemiological data to identify the drivers of transmission, as bidirectional transmission within and between African nations are grossly underestimated as seen in our import risk index estimates.
Multi-drug (MDR) and extensively drug-resistant (XDR) tuberculosis (TB) continues to be a global public health problem especially in high TB burden countries like Nigeria. Many of these cases are undetected and go on to infect high risk individuals. Clinical samples from positive rifampicin resistant Xpert®MTB/Rif assay were subjected to direct whole genome sequencing and bioinformatics analysis to identify the full antibiotics resistance and lineage profile. We report two (2) XDR TB samples also belonging to the East-Asian/Beijing family of lineage 2 Mycobacterium tuberculosis complex from clinical samples in Nigeria. Our findings further reveal the presence of mutations that confer resistance to first-line drugs (rifampicin, isoniazid, ethambutol and pyrazanimide), second-line injectables (capreomycin, streptomycin, kanamycin and/or amikacin) and at least one of the fluoroquinolones (ofloxacin, moxifloxacin, levofloxacin and/or ciprofloxacin) in both samples. The genomic sequence data from this study not only provide the first evidence of XDR TB in Nigeria and West Africa, but also emphasize the importance of WGS in accurately detecting MDR and XDR TB, to ensure adequate and proper management treatment regimens for affected individuals. This will greatly aid in preventing the spread of drug resistance TB in high burden countries.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.