2025
Comparative Evaluation of Large Language and Multimodal Models in Detecting Spinal Stabilization Systems on X-Ray Images
Abstract: Background/Objectives: Open-source AI models are increasingly applied in medical imaging, yet their effectiveness in detecting and classifying spinal stabilization systems remains underexplored. This study compares ChatGPT-4o (a large language model) and BiomedCLIP (a multimodal model) in their analysis of posturographic X-ray images (AP projection) to assess their accuracy in identifying the presence, type (growing vs. non-growing), and specific system (MCGR vs. PSF). Methods: A dataset of 270 X-ray images (9…
Search citation statements
Paper Sections
Select...
4
0
0
0
Citation Types
0
1
0
0
Year Published
2025
2026
Publication Types
Select...
4
Relationship
0
4
Authors
Journals
Cited by 2 publications
(1 citation statement)
References 38 publications
0
1
0
0
“…Commercial models such as GPT-4 have been evaluated and demonstrated to be able to generate accurate, safe, and helpful neurosurgical information [ 37 ]. Therefore, LLMs hold significant promise in streamlining and enhancing the workflow for various pediatric spine care [ 38 , 39 , 40 ]. In the future, these may include a surgeon performance program from the Setting Scoliosis Straight Foundation or registry participation in Harms, or the Pediatric Spine Study Group.…”
Section: Large Language Models (Llm) In Practicementioning
confidence: 99%
“…Commercial models such as GPT-4 have been evaluated and demonstrated to be able to generate accurate, safe, and helpful neurosurgical information [ 37 ]. Therefore, LLMs hold significant promise in streamlining and enhancing the workflow for various pediatric spine care [ 38 , 39 , 40 ]. In the future, these may include a surgeon performance program from the Setting Scoliosis Straight Foundation or registry participation in Harms, or the Pediatric Spine Study Group.…”
Section: Large Language Models (Llm) In Practicementioning
confidence: 99%
