Artificial intelligence versus clinicians: systematic review of design, reporting standards, and claims of deep learning studies

Nagendran, Myura; Chen, Yang; Lovejoy, Christopher A.; Gordon, Anthony; Komorowski, Matthieu; Harvey, Hugh; Topol, Eric J.; Ioannidis, John P. A.; Collins, Gary S.; Maruthappu, Mahiben

doi:10.1136/bmj.m689

Cited by 641 publications

(576 citation statements)

References 35 publications

Supporting

Mentioning

500

Contrasting

Unclassified

Order By: Relevance

“…It has been recognised that most recent AI studies are inadequately reported and existing reporting guidelines do not fully cover potential sources of bias specific to AI systems 25. The welcome emergence of randomised controlled trials (RCTs) seeking to evaluate newer interventions based on, or including, an AI component (hereafter “AI interventions”)23262728293031 has similarly been met with concerns about the design and reporting 25323334. This has highlighted the need to provide reporting guidance that is “fit-for-purpose” in this domain.…”

Section: Introductionmentioning

confidence: 99%

Reporting guidelines for clinical trial reports for interventions involving artificial intelligence: the CONSORT-AI Extension

Liu

Rivera

Moher

et al. 2020

BMJ

250

187

View full text Add to dashboard Cite

The CONSORT 2010 (Consolidated Standards of Reporting Trials) statement provides minimum guidelines for reporting randomised trials. Its widespread use has been instrumental in ensuring transparency when evaluating new interventions. More recently, there has been a growing recognition that interventions involving artificial intelligence (AI) need to undergo rigorous, prospective evaluation to demonstrate impact on health outcomes.The CONSORT-AI extension is a new reporting guideline for clinical trials evaluating interventions with an AI component. It was developed in parallel with its companion statement for clinical trial protocols: SPIRIT-AI. Both guidelines were developed through a staged consensus process, involving a literature review and expert consultation to generate 29 candidate items, which were assessed by an international multi-stakeholder group in a two-stage Delphi survey (103 stakeholders), agreed on in a two-day consensus meeting (31 stakeholders) and refined through a checklist pilot (34 participants).The CONSORT-AI extension includes 14 new items, which were considered sufficiently important for AI interventions, that they should be routinely reported in addition to the core CONSORT 2010 items. CONSORT-AI recommends that investigators provide clear descriptions of the AI intervention, including instructions and skills required for use, the setting in which the AI intervention is integrated, the handling of inputs and outputs of the AI intervention, the human-AI interaction and providing analysis of error cases.CONSORT-AI will help promote transparency and completeness in reporting clinical trials for AI interventions. It will assist editors and peer-reviewers, as well as the general readership, to understand, interpret and critically appraise the quality of clinical trial design and risk of bias in the reported outcomes.

show abstract

Section: Introductionmentioning

confidence: 99%

Reporting guidelines for clinical trial reports for interventions involving artificial intelligence: the CONSORT-AI Extension

Liu

Rivera

Moher

et al. 2020

BMJ

250

187

View full text Add to dashboard Cite

show abstract

“…We will develop a data extraction instrument for study data based on several previous systematic reviews of prediction model [51][52][53]. As the reviewers have different levels of experience and knowledge, the items listed will be reviewed and discussed to ensure that all reviewers had clear knowledge of the procedures.…”

Section: Data Collection -Data Extractionmentioning

confidence: 99%

“…However, they include many items that may not available for prediction models built with traditional statistical methods based on clinical characteristics, laboratory examinations, or genetic factors. On the other hand, TROPOD, CHARMS and PROBAST have been already proved suitable for assessing prediction models using arti cial intelligence methods [53]. Thus, we will choose three more widely-adapted and more extensively-accepted tools, to develop our critical appraisal instrument.…”

Section: -Critical Appraisalmentioning

confidence: 99%

Prognostic Models for Knee Osteoarthritis: A Protocol for Systematic Review, Critical Appraisal and Meta-Analysis

Zhong

Zhang

et al. 2020

Preprint

View full text Add to dashboard Cite

Background: Osteoarthritis is the most common degenerative joint disease diagnosed in clinical practice. It is associated with significant socioeconomic burden and poor quality of life, a large proportion of which is due to knee osteoarthritis (KOA), mainly driven by total knee arthroplasty (TKA). As the difficulty of being detected early and deficiency of disease-modifying drug, the focus of KOA is shifting to disease prevention and the treatment to delay its rapid progression. Thus, the prognostic prediction models are called for, to stratify individuals to guide clinical decision making. The aim of our review is to identify and characterize reported multivariable prognostic models for KOA which concern about three clinical concerns: (1) the risk of developing KOA in general population; (2) the risk of receiving TKA in KOA patients; and (3) the outcome of TKA in KOA patients who plan to receive TKA.Methods: Studies will be identified by searching seven electronic databases. Title and abstract screening and full-text review will be accomplished by two independent reviewers. Data extraction instrument and critical appraisal instrument will be developed before formal assessment, and will be modified during a training phase in advance. Study reporting transparency, methodological quality, and risk of bias will be assessed according to Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) statement, CHecklist for critical Appraisal and data extraction for systematic Reviews of prediction Modelling Studies (CHARMS) and Prediction model Risk Of Bias ASsessment Tool (PROBAST). Prognostic prediction models will be summarized qualitatively. Quantitative metrics on predictive performance of these models will be synthesized with meta-analyses if appropriate.Discussion: Our systematic review will collate evidence from prognostic prediction models that can be used through the whole process of KOA. The review may identify models which are capable of allowing personalized preventative and therapeutic interventions to be precisely targeted at those individuals who are at the highest risk. To accomplish the prediction models to cross the translational gaps between an exploratory research method and a valued addition to precision medicine workflows, research recommendations relating to model development, validation or impact assessment will be made.Systematic review registration: PROSPERO (registered, waiting for assessment, ID 203543)

show abstract

“…It has been recognised that most recent AI studies are inadequately reported and existing reporting guidelines do not fully cover potential sources of bias specific to AI systems 17. The welcome emergence of randomised controlled trials (RCTs) seeking to evaluate clinical efficacy of newer interventions based on, or including, an AI component (hereafter ‘AI interventions’) 15181920212223 has similarly been met with concerns about design and reporting 17242526. This has highlighted the need to provide reporting guidance that is ‘fit-for-purpose’ in this domain.…”

Section: Introductionmentioning

confidence: 99%

Guidelines for clinical trial protocols for interventions involving artificial intelligence: the SPIRIT-AI Extension

Rivera

Liu

Chan

et al. 2020

BMJ

236

194

View full text Add to dashboard Cite

The SPIRIT 2013 (The Standard Protocol Items: Recommendations for Interventional Trials) statement aims to improve the completeness of clinical trial protocol reporting, by providing evidence-based recommendations for the minimum set of items to be addressed. This guidance has been instrumental in promoting transparent evaluation of new interventions. More recently, there is a growing recognition that interventions involving artificial intelligence need to undergo rigorous, prospective evaluation to demonstrate their impact on health outcomes.The SPIRIT-AI extension is a new reporting guideline for clinical trials protocols evaluating interventions with an AI component. It was developed in parallel with its companion statement for trial reports: CONSORT-AI. Both guidelines were developed using a staged consensus process, involving a literature review and expert consultation to generate 26 candidate items, which were consulted on by an international multi-stakeholder group in a 2-stage Delphi survey (103 stakeholders), agreed on in a consensus meeting (31 stakeholders) and refined through a checklist pilot (34 participants).The SPIRIT-AI extension includes 15 new items, which were considered sufficiently important for clinical trial protocols of AI interventions. These new items should be routinely reported in addition to the core SPIRIT 2013 items. SPIRIT-AI recommends that investigators provide clear descriptions of the AI intervention, including instructions and skills required for use, the setting in which the AI intervention will be integrated, considerations around the handling of input and output data, the human-AI interaction and analysis of error cases.SPIRIT-AI will help promote transparency and completeness for clinical trial protocols for AI interventions. Its use will assist editors and peer-reviewers, as well as the general readership, to understand, interpret and critically appraise the design and risk of bias for a planned clinical trial.

show abstract

Artificial intelligence versus clinicians: systematic review of design, reporting standards, and claims of deep learning studies

Cited by 641 publications

References 35 publications

Reporting guidelines for clinical trial reports for interventions involving artificial intelligence: the CONSORT-AI Extension

Reporting guidelines for clinical trial reports for interventions involving artificial intelligence: the CONSORT-AI Extension

Prognostic Models for Knee Osteoarthritis: A Protocol for Systematic Review, Critical Appraisal and Meta-Analysis

Guidelines for clinical trial protocols for interventions involving artificial intelligence: the SPIRIT-AI Extension

Contact Info

Product

Resources

About