2023
DOI: 10.1097/jcma.0000000000000946
|View full text |Cite
|
Sign up to set email alerts
|

ChatGPT failed Taiwan’s Family Medicine Board Exam

Abstract: Background: Chat Generative Pre-trained Transformer (ChatGPT), OpenAI Limited Partnership, San Francisco, CA, USA is an artificial intelligence language model gaining popularity because of its large database and ability to interpret and respond to various queries. Although it has been tested by researchers in different fields, its performance varies depending on the domain. We aimed to further test its ability in the medical field. Methods: We used ques… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

1
25
1

Year Published

2023
2023
2024
2024

Publication Types

Select...
10

Relationship

1
9

Authors

Journals

citations
Cited by 54 publications
(27 citation statements)
references
References 20 publications
1
25
1
Order By: Relevance
“…It did not pass the 2023 Japanese National Medical Licensing Examination with an overall correct answer rate of 55.0%. Furthermore, it did not succeed in the Taiwan Family Medicine Board Exam, 24 Taiwan internal medicine exams, 25 the Taiwan Pharmacist Licensing Examination, 21 Chinese Medical Licensing Examination, Chinese Pharmacist Licensing Examination, and Chinese Nurse Licensing Examination, 26 and the Chinese medical licensing exams in simplified Chinese. 17 Nevertheless, our results indicate that ChatGPT attained an accuracy of up to 93.75% in the Taiwan medical licensing exams, though there was a noticeable drop in performance in the July 2023 exam.…”
Section: Discussionmentioning
confidence: 98%
“…It did not pass the 2023 Japanese National Medical Licensing Examination with an overall correct answer rate of 55.0%. Furthermore, it did not succeed in the Taiwan Family Medicine Board Exam, 24 Taiwan internal medicine exams, 25 the Taiwan Pharmacist Licensing Examination, 21 Chinese Medical Licensing Examination, Chinese Pharmacist Licensing Examination, and Chinese Nurse Licensing Examination, 26 and the Chinese medical licensing exams in simplified Chinese. 17 Nevertheless, our results indicate that ChatGPT attained an accuracy of up to 93.75% in the Taiwan medical licensing exams, though there was a noticeable drop in performance in the July 2023 exam.…”
Section: Discussionmentioning
confidence: 98%
“…Therefore, it supported the fact that many attending staff and residents felt that the response by ChatGPT was superficial and did not show a deep understanding of the topic. For more advanced examination levels, such as resident-level examinations, ChatGPT performed more poorly [7,34,35]. For example, ChatGPT's score in the plastic surgery in-training examination was ranked at the 49th percentile compared with first-year residents but significantly worse than fifth-and sixth-year residents at the zeroth percentile [9].…”
Section: Discussionmentioning
confidence: 98%
“…Nuanced discrepancies in grammar rules and other aspects between the Chinese and English languages might affect ChatGPT’s effectiveness when used with Chinese. The current performance is restricted by the corpus, and further optimization and adjustment are required [ 17 ]. Consequently, the findings of this study provide an incomplete representation of ChatGPT’s overall performance level.…”
Section: Discussionmentioning
confidence: 99%