2018
DOI: 10.1038/s41467-018-06634-y
|View full text |Cite
|
Sign up to set email alerts
|

Machine learning and structural analysis of Mycobacterium tuberculosis pan-genome identifies genetic signatures of antibiotic resistance

Abstract: Mycobacterium tuberculosis is a serious human pathogen threat exhibiting complex evolution of antimicrobial resistance (AMR). Accordingly, the many publicly available datasets describing its AMR characteristics demand disparate data-type analyses. Here, we develop a reference strain-agnostic computational platform that uses machine learning approaches, complemented by both genetic interaction analysis and 3D structural mutation-mapping, to identify signatures of AMR evolution to 13 antibiotics. This platform i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

4
121
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
3
2
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 132 publications
(125 citation statements)
references
References 62 publications
(50 reference statements)
4
121
0
Order By: Relevance
“…These mutations occur in non-AMR genes in order to accommodate the potentially reduced fitness cost of maintaining the primary AMR conferring genes or SNPs [67][68][69][70] . The results of this study, and previous AI studies focusing on AMR 25,27 , suggest that these changes could be widespread throughout the genome. Indeed, although each model had a few genes with very high total feature importance values, most genes were contributing k-mers to the decision trees for each model.…”
Section: Discussionsupporting
confidence: 68%
See 2 more Smart Citations
“…These mutations occur in non-AMR genes in order to accommodate the potentially reduced fitness cost of maintaining the primary AMR conferring genes or SNPs [67][68][69][70] . The results of this study, and previous AI studies focusing on AMR 25,27 , suggest that these changes could be widespread throughout the genome. Indeed, although each model had a few genes with very high total feature importance values, most genes were contributing k-mers to the decision trees for each model.…”
Section: Discussionsupporting
confidence: 68%
“…The average very major error rate (VME), which is defined as resistant genomes that are erroneously predicted to be susceptible, and the average major error rate (ME), which is defined as susceptible genomes that are erroneously predicted to be resistant, tend to go down as gene set size increases. Although the core gene set models described in Figure 1 have lower F1 scores and higher error rates than full-genome models that have been published previously [21][22][23][24]27,29 , their accuracies are striking given the small sizes of the input data sets and the removal of well-annotated AMR genes.…”
Section: Amr Models Based On Core Genes Have Predictive Powermentioning
confidence: 84%
See 1 more Smart Citation
“…Databases such as the Pathosystems Resource Integration Center (PATRIC) (Wattam et al ; Davis et al ; Wattam et al ) and the Antibiotic Resistance Genes Database (ARDB) (Liu and Pop ) currently provide genomes with AR metadata. In some recent studies (Her and Wu ; Kavvas et al ; Moradigaravand et al ), machine learning techniques were applied to find resistance to varieties of antibiotics from whole genome sequences using databases of known AR genes. These studies first constructed a pan‐genome with identifying core and accessory gene clusters, then computed features such as presence–absence patterns and the population structures of gene clusters, and finally applied a machine learning algorithm to identity putative AR genes using the extracted features.…”
Section: Introductionmentioning
confidence: 99%
“…(12) Compounding the general problem of antibiotic resistance are challenges in developing and approving new antibiotics. (13) This has spurred research into understanding resistance mechanisms, (14,15) host defences, (16)(17)(18) diagnostics (19) and antibiotic generation. (20,21) Ultimately, without a rapid and perhaps general method to develop new targeted antibiotics, therapies might relapse towards those of the pre-antibiotic era.…”
Section: Introductionmentioning
confidence: 99%