Challenges and barriers of using large language models (LLM) such as ChatGPT for diagnostic medicine with a focus on digital pathology – a recent scoping review

Ullah, Ehsan; Parwani, Anil; Baig, Mirza Mansoor; Singh, Rajendra

doi:10.1186/s13000-024-01464-7

Cited by 12 publications

(4 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Additional guardrails and training incorporating human feedback is envisioned to improve LLM and MLLM performance, while reducing the incidence of erroneous responses [93]. Some errors produced by LLMs may be attributable to biases and errors present in its training set, so efforts will be undertaken to improve training data quality [111]. More publicly available pathology text and text/image datasets will become available, which will open up opportunities for training more accurate and powerful models.…”

Section: Future Directionsmentioning

confidence: 99%

Applications of Large Language Models in Pathology

Cheng

2024

Bioengineering

View full text Add to dashboard Cite

Large language models (LLMs) are transformer-based neural networks that can provide human-like responses to questions and instructions. LLMs can generate educational material, summarize text, extract structured data from free text, create reports, write programs, and potentially assist in case sign-out. LLMs combined with vision models can assist in interpreting histopathology images. LLMs have immense potential in transforming pathology practice and education, but these models are not infallible, so any artificial intelligence generated content must be verified with reputable sources. Caution must be exercised on how these models are integrated into clinical practice, as these models can produce hallucinations and incorrect results, and an over-reliance on artificial intelligence may lead to de-skilling and automation bias. This review paper provides a brief history of LLMs and highlights several use cases for LLMs in the field of pathology.

show abstract

Section: Future Directionsmentioning

confidence: 99%

Applications of Large Language Models in Pathology

Cheng

2024

Bioengineering

View full text Add to dashboard Cite

show abstract

“…Doubts also persist about the dependability of their outputs for making clinical decisions. 6 7 As LLMs become more common in healthcare, the necessity to test their applications increases. This review evaluates the application of LLMs in the field of hematology, systematically assessing their benefits, limitations, and potential risks in medical training, education, and diagnosis.…”

Section: Introductionmentioning

confidence: 99%

Exploring the role of Large Language Models (LLMs) in hematology: a systematic review of applications, benefits, and limitations

Mudrik,

Nadkarni,

Efros

et al. 2024

Preprint

View full text Add to dashboard Cite

Rationale and Objectives: Large Language Models (LLMs) have the potential to enhance medical training, education, and diagnosis. However, since these models were not originally designed for medical purposes, there are concerns regarding their reliability and safety in clinical settings. This review systematically assesses the utility, advantages, and potential risks of employing LLMs in the field of hematology. Materials and Methods: We searched PubMed, Web of Science, and Scopus databases for original publications on LLMs application in hematology. We limited the search to articles published in English from December 01 2022 to March 25, 2024, coinciding with the introduction of ChatGPT. To evaluate the risk of bias, we used the adapted version of the Quality Assessment of Diagnostic Accuracy Studies criteria (QUADAS-2). Results: Eleven studies fulfilled the eligibility criteria. The studies varied in their goals and methods, covering medical education, diagnosis, and clinical practice. GPT-3.5 and GPT-4's demonstrated superior performance in diagnostic tasks and medical information propagation compared to other models like Google's Bard (currently called Gemini). GPT-4 demonstrated particularly high accuracy in tasks such as interpreting hematology cases and diagnosing hemoglobinopathy, with performance metrics of 76% diagnostic accuracy and 88% accuracy in identifying normal blood cells. However, the study also revealed discrepancies in model consistency and the accuracy of provided references, indicating variability in their reliability. Conclusion: While LLMs present significant opportunities for advancing clinical hematology, their incorporation into medical practice requires careful evaluation of their benefits and limitations. Key Words: Hematology; Large Language Models; ChatGPT; Microsoft Bing; Google Bard; PaLM; LlaMA.

show abstract

“…In this way, LLM responses can be modulated and compared after being fed with accurate data and evidence-based clinical practice guidelines to meet patient needs ( 26 ). A recent scoping review by Ullah et al assessed the challenges and barriers to using LLMs in diagnostic medicine in the field of pathology ( 27 ). Many language models, such as Claude, Command, and Bloomz, have been programmed for creating accurate medical advice ( 28 ) ( Figure 2 ).…”

Section: Introductionmentioning

confidence: 99%

Large language models in physical therapy: time to adapt and adept

Naqvi,

Shaikh,

Mishra

2024

Front. Public Health

View full text Add to dashboard Cite

Healthcare is experiencing a transformative phase, with artificial intelligence (AI) and machine learning (ML). Physical therapists (PTs) stand on the brink of a paradigm shift in education, practice, and research. Rather than visualizing AI as a threat, it presents an opportunity to revolutionize. This paper examines how large language models (LLMs), such as ChatGPT and BioMedLM, driven by deep ML can offer human-like performance but face challenges in accuracy due to vast data in PT and rehabilitation practice. PTs can benefit by developing and training an LLM specifically for streamlining administrative tasks, connecting globally, and customizing treatments using LLMs. However, human touch and creativity remain invaluable. This paper urges PTs to engage in learning and shaping AI models by highlighting the need for ethical use and human supervision to address potential biases. Embracing AI as a contributor, and not just a user, is crucial by integrating AI, fostering collaboration for a future in which AI enriches the PT field provided data accuracy, and the challenges associated with feeding the AI model are sensitively addressed.

show abstract

Challenges and barriers of using large language models (LLM) such as ChatGPT for diagnostic medicine with a focus on digital pathology – a recent scoping review

Cited by 12 publications

References 21 publications

Applications of Large Language Models in Pathology

Applications of Large Language Models in Pathology

Exploring the role of Large Language Models (LLMs) in hematology: a systematic review of applications, benefits, and limitations

Large language models in physical therapy: time to adapt and adept

Contact Info

Product

Resources

About