2023
DOI: 10.1101/2023.11.07.23298133
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Capability of GPT-4V(ision) in Japanese National Medical Licensing Examination

Takahiro Nakao,
Soichiro Miki,
Yuta Nakamura
et al.

Abstract: BackgroundPrevious research applying large language models (LLMs) to medicine was focused on text-based information. Recently, multimodal variants of LLMs acquired the capability of recognizing images.ObjectiveTo evaluate the capability of GPT-4V, a recent multimodal LLM developed by OpenAI, in recognizing images in the medical field by testing its capability to answer questions in the 117th Japanese National Medical Licensing Examination.MethodsWe focused on 108 questions that had one or more images as part o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 19 publications
0
3
0
Order By: Relevance
“…A total of 557 case reports were identified. The exclusion criteria were carefully chosen based on previous studies for CDSSs [ 32 ] and ChatGPT-4V [ 28 ] to ensure the focus remained on diagnostically challenging adult cases with relevant image data. Specifically, cases were excluded for the following reasons: nondiagnosis (130 cases), patients younger than 10 years (35 cases), and the absence of image data (29 cases).…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…A total of 557 case reports were identified. The exclusion criteria were carefully chosen based on previous studies for CDSSs [ 32 ] and ChatGPT-4V [ 28 ] to ensure the focus remained on diagnostically challenging adult cases with relevant image data. Specifically, cases were excluded for the following reasons: nondiagnosis (130 cases), patients younger than 10 years (35 cases), and the absence of image data (29 cases).…”
Section: Methodsmentioning
confidence: 99%
“…Preliminary studies in various fields, including medicine [ 26 - 28 ] and others [ 29 - 31 ] have shown the effectiveness of ChatGPT-4V. Some of these studies have highlighted its efficacy in interpreting medical images [ 26 , 28 ], though they were limited in scope. However, clinical image data includes a wide range of elements, from physical examinations to various investigation results.…”
Section: Introductionmentioning
confidence: 99%
“…Recent studies have further explored the diagnostic application of multimodal LLMs (also called 'vision-language models') that are able to ingest not only text but also image data as input (12)(13)(14)(15)(16)(17)(18)(19)(20). However, several studies demonstrated low performance of Generative Pretrained Transformer 4 Vision (GPT-4V) by OpenAI in differential diagnosis based on various types of radiological images (12,16,18,20,21).…”
Section: Introductionmentioning
confidence: 99%