2024
DOI: 10.1101/2024.03.12.24303785
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Influence of a Large Language Model on Diagnostic Reasoning: A Randomized Clinical Vignette Study

Ethan Goh,
Robert Gallo,
Jason Hom
et al.

Abstract: Importance: Diagnostic errors are common and cause significant morbidity. Large language models (LLMs) have shown promise in their performance on both multiple-choice and open-ended medical reasoning examinations, but it remains unknown whether the use of such tools improves physician performance. Objective: To assess the impact of the GPT-4 LLM on physicians diagnostic reasoning compared to conventional resources. Design: Multi-center, randomized clinical vignette study. Setting: The study was conducted using… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 37 publications
0
0
0
Order By: Relevance
“…However, regardless of the specific performance metrics of any LLM-based tool 16 , correct tool usage remains crucial. Effective prompting strategies 17,18 and appropriate application by users are essential for optimizing the performance of these tools 19,20 .…”
Section: Discussionmentioning
confidence: 99%
“…However, regardless of the specific performance metrics of any LLM-based tool 16 , correct tool usage remains crucial. Effective prompting strategies 17,18 and appropriate application by users are essential for optimizing the performance of these tools 19,20 .…”
Section: Discussionmentioning
confidence: 99%
“…There is also significant practical interest in examining whether ChatGPT exhibits a more pronounced beneficial effect on diagnostic accuracy and the quantity of differential diagnoses considered, potentially attributable to its heightened computational capabilities. 12 Additionally, we seek to assess whether brief instructional training emphasising the importance of expanding the hypothesis space augments these effects. To achieve this, our primary focus is on modelling the dependent variables diagnostic accuracy and number of generated differential diagnoses using linear mixed-effects models 54 in R. 55…”
Section: Methods and Analysismentioning
confidence: 99%
“… 19 23 It is, therefore, imperative to comprehensively explore the extent, application and constraints of LLMs in clinical decision support to guarantee their conscientious and efficient implementation in practice. 12 18 24 25 To address these concerns, this prospective, randomised controlled clinical vignette study examines the influence of decision support using an LLM (ChatGPT) on the diagnostic process and outcomes compared with that of a human coach. This will advance the understanding of how human–AI collaboration can be leveraged to enhance diagnostic decision-making.…”
Section: Introductionmentioning
confidence: 99%