Katherine E. Niehaus scite author profile

SummaryBackgroundDiagnosing drug-resistance remains an obstacle to the elimination of tuberculosis. Phenotypic drug-susceptibility testing is slow and expensive, and commercial genotypic assays screen only common resistance-determining mutations. We used whole-genome sequencing to characterise common and rare mutations predicting drug resistance, or consistency with susceptibility, for all first-line and second-line drugs for tuberculosis.MethodsBetween Sept 1, 2010, and Dec 1, 2013, we sequenced a training set of 2099 Mycobacterium tuberculosis genomes. For 23 candidate genes identified from the drug-resistance scientific literature, we algorithmically characterised genetic mutations as not conferring resistance (benign), resistance determinants, or uncharacterised. We then assessed the ability of these characterisations to predict phenotypic drug-susceptibility testing for an independent validation set of 1552 genomes. We sought mutations under similar selection pressure to those characterised as resistance determinants outside candidate genes to account for residual phenotypic resistance.FindingsWe characterised 120 training-set mutations as resistance determining, and 772 as benign. With these mutations, we could predict 89·2% of the validation-set phenotypes with a mean 92·3% sensitivity (95% CI 90·7–93·7) and 98·4% specificity (98·1–98·7). 10·8% of validation-set phenotypes could not be predicted because uncharacterised mutations were present. With an in-silico comparison, characterised resistance determinants had higher sensitivity than the mutations from three line-probe assays (85·1% vs 81·6%). No additional resistance determinants were identified among mutations under selection pressure in non-candidate genes.InterpretationA broad catalogue of genetic mutations enable data from whole-genome sequencing to be used clinically to predict drug resistance, drug susceptibility, or to identify drug phenotypes that cannot yet be genetically predicted. This approach could be integrated into routine diagnostic workflows, phasing out phenotypic drug-susceptibility testing while reporting drug resistance early.

show abstract

Machine Learning and Decision Support in Critical Care

Johnson

et al. 2016

View full text Add to dashboard Cite

Clinical data management systems typically provide caregiver teams with useful information, derived from large, sometimes highly heterogeneous, data sources that are often changing dynamically. Over the last decade there has been a significant surge in interest in using these data sources, from simply re-using the standard clinical databases for event prediction or decision support, to including dynamic and patient-specific information into clinical monitoring and prediction problems. However, in most cases, commercial clinical databases have been designed to document clinical activity for reporting, liability and billing reasons, rather than for developing new algorithms. With increasing excitement surrounding “secondary use of medical records” and “Big Data” analytics, it is important to understand the limitations of current databases and what needs to change in order to enter an era of “precision medicine.” This review article covers many of the issues involved in the collection and preprocessing of critical care data. The three challenges in critical care are considered: compartmentalization, corruption, and complexity. A range of applications addressing these issues are covered, including the modernization of static acuity scoring; on-line patient tracking; personalized prediction and risk assessment; artifact detection; state estimation; and incorporation of multimodal data sources such as genomic and free text data.

show abstract

Risk of Cardiovascular Disease from Antiretroviral Therapy for HIV: A Systematic Review

et al. 2013

View full text Add to dashboard Cite

BackgroundRecent studies suggest certain antiretroviral therapy (ART) drugs are associated with increases in cardiovascular disease.PurposeWe performed a systematic review and meta-analysis to summarize the available evidence, with the goal of elucidating whether specific ART drugs are associated with an increased risk of myocardial infarction (MI).Data SourcesWe searched Medline, Web of Science, the Cochrane Library, and abstract archives from the Conference on Retroviruses and Opportunistic Infections and International AIDS Society up to June 2011 to identify published articles and abstracts.Study SelectionEligible studies were comparative and included MI, strokes, or other cardiovascular events as outcomes.Data ExtractionEligibility screening, data extraction, and quality assessment were performed independently by two investigators.Data SynthesisRandom effects methods and Fisher’s combined probability test were used to summarize evidence.FindingsTwenty-seven studies met inclusion criteria, with 8 contributing to a formal meta-analysis. Findings based on two observational studies indicated an increase in risk of MI for patients recently exposed (usually defined as within last 6 months) to abacavir (RR 1.92, 95% CI 1.51–2.42) and protease inhibitors (PI) (RR 2.13, 95% CI 1.06–4.28). Our analysis also suggested an increased risk associated with each additional year of exposure to indinavir (RR 1.11, 95% CI 1.05–1.17) and lopinavir (RR 1.22, 95% CI 1.01–1.47). Our findings of increased cardiovascular risk from abacavir and PIs were in contrast to four published meta-analyses based on secondary analyses of randomized controlled trials, which found no increased risk from cardiovascular disease.ConclusionAlthough observational studies implicated specific drugs, the evidence is mixed. Further, meta-analyses of randomized trials did not find increased risk from abacavir and PIs. Our findings that implicate specific ARTs in the observational setting provide sufficient evidence to warrant further investigation of this relationship in studies designed for that purpose.

show abstract

Machine learning for classifying tuberculosis drug-resistance from DNA sequencing data

et al. 2017

View full text Add to dashboard Cite

MotivationCorrect and rapid determination of Mycobacterium tuberculosis (MTB) resistance against available tuberculosis (TB) drugs is essential for the control and management of TB. Conventional molecular diagnostic test assumes that the presence of any well-studied single nucleotide polymorphisms is sufficient to cause resistance, which yields low sensitivity for resistance classification.SummaryGiven the availability of DNA sequencing data from MTB, we developed machine learning models for a cohort of 1839 UK bacterial isolates to classify MTB resistance against eight anti-TB drugs (isoniazid, rifampicin, ethambutol, pyrazinamide, ciprofloxacin, moxifloxacin, ofloxacin, streptomycin) and to classify multi-drug resistance.ResultsCompared to previous rules-based approach, the sensitivities from the best-performing models increased by 2-4% for isoniazid, rifampicin and ethambutol to 97% (P < 0.01), respectively; for ciprofloxacin and multi-drug resistant TB, they increased to 96%. For moxifloxacin and ofloxacin, sensitivities increased by 12 and 15% from 83 and 81% based on existing known resistance alleles to 95% and 96% (P < 0.01), respectively. Particularly, our models improved sensitivities compared to the previous rules-based approach by 15 and 24% to 84 and 87% for pyrazinamide and streptomycin (P < 0.01), respectively. The best-performing models increase the area-under-the-ROC curve by 10% for pyrazinamide and streptomycin (P < 0.01), and 4–8% for other drugs (P < 0.01).Availability and implementationThe details of source code are provided at http://www.robots.ox.ac.uk/~davidc/code.php.Supplementary information Supplementary data are available at Bioinformatics online.

show abstract

Machine learning enables detection of early-stage colorectal cancer by whole-genome sequencing of plasma cell-free DNA

et al. 2019

View full text Add to dashboard Cite

Background Blood-based methods using cell-free DNA (cfDNA) are under development as an alternative to existing screening tests. However, early-stage detection of cancer using tumor-derived cfDNA has proven challenging because of the small proportion of cfDNA derived from tumor tissue in early-stage disease. A machine learning approach to discover signatures in cfDNA, potentially reflective of both tumor and non-tumor contributions, may represent a promising direction for the early detection of cancer. Methods Whole-genome sequencing was performed on cfDNA extracted from plasma samples ( N = 546 colorectal cancer and 271 non-cancer controls). Reads aligning to protein-coding gene bodies were extracted, and read counts were normalized. cfDNA tumor fraction was estimated using IchorCNA. Machine learning models were trained using k-fold cross-validation and confounder-based cross-validations to assess generalization performance. Results In a colorectal cancer cohort heavily weighted towards early-stage cancer (80% stage I/II), we achieved a mean AUC of 0.92 (95% CI 0.91–0.93) with a mean sensitivity of 85% (95% CI 83–86%) at 85% specificity. Sensitivity generally increased with tumor stage and increasing tumor fraction. Stratification by age, sequencing batch, and institution demonstrated the impact of these confounders and provided a more accurate assessment of generalization performance. Conclusions A machine learning approach using cfDNA achieved high sensitivity and specificity in a large, predominantly early-stage, colorectal cancer cohort. The possibility of systematic technical and institution-specific biases warrants similar confounder analyses in other studies. Prospective validation of this machine learning method and evaluation of a multi-analyte approach are underway. Electronic supplementary material The online version of this article (10.1186/s12885-019-6003-8) contains supplementary material, which is available to authorized users.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.