“…Large language models (LLMs), such as ChatGPT (Brown et al, 2020; OpenAI, 2023b), LLaMA (Touvron et al, 2023a), and PaLM (Chowdhery et al, 2022), are increasingly being recognized for their potential in healthcare to aid clinical decision-making and provide innovative solutions for complex healthcare problems (Patel et al, 2023; Shen et al, 2023), e.g., discharge summary generation (Patel and Lam, 2023), health education (Safranek et al, 2023), and care planning (Fleming et al, 2023). Several recent efforts have been made to fine-tune publicly available general LLMs, e.g., LLaMA (Touvron et al, 2023b) and ChatGLM (Tsinghua KEG, 2023), to develop medical LLMs (Singhal et al, 2023a,c), resulting in ChatDoctor (Li et al, 2023b), MedAlpaca (Han et al, 2023), BenTsao (Wang et al, 2023a), and ClinicalCamel (Toma et al, 2023). Previous research shows that medical LLMs outperform human experts across a variety of medical tasks.…”