2023
DOI: 10.1371/journal.pdig.0000198
|View full text |Cite
|
Sign up to set email alerts
|

Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models

Abstract: We evaluated the performance of a large language model called ChatGPT on the United States Medical Licensing Exam (USMLE), which consists of three exams: Step 1, Step 2CK, and Step 3. ChatGPT performed at or near the passing threshold for all three exams without any specialized training or reinforcement. Additionally, ChatGPT demonstrated a high level of concordance and insight in its explanations. These results suggest that large language models may have the potential to assist with medical education, and pot… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

22
841
3
18

Year Published

2023
2023
2023
2023

Publication Types

Select...
8
2

Relationship

0
10

Authors

Journals

citations
Cited by 1,438 publications
(1,042 citation statements)
references
References 20 publications
22
841
3
18
Order By: Relevance
“…This raises concerns about fair access to the same educational material despite using the same prompt. For instance, Kung et al (2023) found the accuracy of ChatGPT to be around 60%, demanding careful assessment of its output before use. Therefore, more research should be focused on ensuring fairness, accuracy, and equity among students using chatbots generally and ChatGPT particularly, which might be achieved through, for instance, having transparent and open algorithms (Bulathwela et al, 2020).…”
Section: Nothing Should Be Taken For Grantedmentioning
confidence: 99%
“…This raises concerns about fair access to the same educational material despite using the same prompt. For instance, Kung et al (2023) found the accuracy of ChatGPT to be around 60%, demanding careful assessment of its output before use. Therefore, more research should be focused on ensuring fairness, accuracy, and equity among students using chatbots generally and ChatGPT particularly, which might be achieved through, for instance, having transparent and open algorithms (Bulathwela et al, 2020).…”
Section: Nothing Should Be Taken For Grantedmentioning
confidence: 99%
“…Moreover, ChatGPT showed moderate accuracy to determine the imaging steps needed in breast cancer screening and evaluation of breast pain which can be a promising application in decision making in radiology [50]. There is also the prospects of personalized medicine and improved health literacy by providing easily accessible and understandable health information for the general public [52,54,59,61,72]. This utility was demonstrated by ChatGPT responses highlighting the need to consult healthcare providers among other reliable sources on specific situations [16,56].…”
Section: Discussionmentioning
confidence: 99%
“…ChatGPT will change the pedagogy in management education. For example, formative assessments (including assignment reports) to evaluate how someone is learning from the material throughout a course is not a promising idea now (Kung et al, 2023). It is time to define guidelines to use ChatGPT in education.…”
Section: Role Of Ai and Gpt In Academic Practicesmentioning
confidence: 99%