2023
DOI: 10.1371/journal.pcbi.1011648
|View full text |Cite
|
Sign up to set email alerts
|

The MAGMA pipeline for comprehensive genomic analyses of clinical Mycobacterium tuberculosis samples

Tim H. Heupink,
Lennert Verboven,
Abhinav Sharma
et al.

Abstract: Background Whole genome sequencing (WGS) holds great potential for the management and control of tuberculosis. Accurate analysis of samples with low mycobacterial burden, which are characterized by low (<20x) coverage and high (>40%) levels of contamination, is challenging. We created the MAGMA (Maximum Accessible Genome for Mtb Analysis) bioinformatics pipeline for analysis of clinical Mtb samples. Methods and results High accuracy variant calling is achieved by using a long seedlength during read map… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(7 citation statements)
references
References 50 publications
(76 reference statements)
0
7
0
Order By: Relevance
“…For snpCL28, the variant in position 1028437 c > T which defines snpCL28 is located in a transposase gene (Rv0922), and for snpCL29, the variant in position 3640351 c > T would be excluded in most SNP routine calling algorithms [ 12 ]. Other researchers have also observed that reliable calling of SNPs in these generally excluded regions is possible [ 13 , 14 ]. Importantly, nine of the 30 characteristic SNPs identified occur outside annotated reading frames and thus would not be captured using most core genome multi locus sequence typing (MLST) typing systems.…”
Section: Discussionmentioning
confidence: 99%
“…For snpCL28, the variant in position 1028437 c > T which defines snpCL28 is located in a transposase gene (Rv0922), and for snpCL29, the variant in position 3640351 c > T would be excluded in most SNP routine calling algorithms [ 12 ]. Other researchers have also observed that reliable calling of SNPs in these generally excluded regions is possible [ 13 , 14 ]. Importantly, nine of the 30 characteristic SNPs identified occur outside annotated reading frames and thus would not be captured using most core genome multi locus sequence typing (MLST) typing systems.…”
Section: Discussionmentioning
confidence: 99%
“…This is supported by the increasing number of unclassified reads in dMDA samples, particularly noticeable with very low input Mtb -DNA, as evidenced by the taxonomic classification tool, Kraken 2's, inability to assign a taxonomic origin to increasing proportions of reads. There is thus a compelling need for the use of Mtb WGS analysis pipelines, like MAGMA 19 , 20 , that are capable of handling contaminants, whether originating from genuine contamination or artificially introduced MDA artefacts. While MDA has not yet been evaluated for NGS of Mtb , MDA has been used successfully to increase the amount of genomic DNA from clinical smear-negative sputum specimens before processing the DNA with standard IS 6110 -specific PCR methods 30 .…”
Section: Discussionmentioning
confidence: 99%
“…The raw Illumina WGS reads (FASTQ) from the four dMDA and positive control sample were analysed using the MAGMA bioinformatics pipeline which aligns reads to the Mtb H37Rv (NC000962.3) reference genome for variant identification 19 , 20 . Variants called through the major variants workflow of MAGMA were used for subsequent analyses.…”
Section: Methodsmentioning
confidence: 99%
“…(3) MAGMA Analysis MAGMA analysis is a gene-based analysis model that uses multiple linear principal components regression to analyze SNP function to obtain a gene and pathway analysis tool. The MAGMA analyses were performed on the software named magma ( https://ctg.cncr.nl/software/mactg.cncr.nl/software/ma ) [19] .…”
Section: Methods (1) Mendelian Randomizationmentioning
confidence: 99%