A Survey of Clinicians' Views of the Utility of Large Language Models

Matthew Spotnitz; Betina Idnay; Emily R. Gordon; Rebecca Shyu; Gongbo Zhang; Cong Liu; James J. Cimino; Chunhua Weng

doi:10.1055/a-2281-7092

RSS-Feed abonnieren

Bitte kopieren Sie die angezeigte URL und fügen sie dann in Ihren RSS-Reader ein.

https://www.thieme-connect.de/rss/thieme/de/10.1055-s-00035026.xml

Teilen / Bookmarken

Facebook Linkedin Weibo

PDF herunterladen

CC BY-NC-ND 4.0 · Appl Clin Inform 2024; 15(02): 306-312
DOI: 10.1055/a-2281-7092

Research Article

A Survey of Clinicians' Views of the Utility of Large Language Models

Matthew Spotnitz

¹Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, New York, United States

,

Betina Idnay

¹Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, New York, United States

,

Emily R. Gordon

¹Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, New York, United States

²Department of Dermatology, Vagelos College of Physicians and Surgeons, Columbia University Irving Medical Center, New York, New York, United States

,

Rebecca Shyu

¹Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, New York, United States

,

Gongbo Zhang

¹Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, New York, United States

,

Cong Liu

¹Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, New York, United States

,

James J. Cimino

¹Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, New York, United States

³Department of Biomedical Informatics and Data Science, Informatics Institute, Heersink School of Medicine, University of Alabama at Birmingham, Birmingham, Alabama, United States

,

Chunhua Weng

¹Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, New York, United States

› Institutsangaben
Funding This work was supported by National Library of Medicine (NLM) grants R01LM014344 and R01LM009886 to C.W., National Human Genome Institute grant R01HG012655 to C.L., and by National Center for Advancing Clinical and Translational Science grant UL1TR001873 to Columbia University Irving Medical Center. B.I. and R.S. acknowledge support from NLM grant T15LM007079.

› Weitere Informationen

Auch verfügbar auf

Abstract
Volltext
Referenzen
Zusatzmaterial

Lizenzen und Reprints

Abstract

Objectives Large language models (LLMs) like Generative pre-trained transformer (ChatGPT) are powerful algorithms that have been shown to produce human-like text from input data. Several potential clinical applications of this technology have been proposed and evaluated by biomedical informatics experts. However, few have surveyed health care providers for their opinions about whether the technology is fit for use.

Methods We distributed a validated mixed-methods survey to gauge practicing clinicians' comfort with LLMs for a breadth of tasks in clinical practice, research, and education, which were selected from the literature.

Results A total of 30 clinicians fully completed the survey. Of the 23 tasks, 16 were rated positively by more than 50% of the respondents. Based on our qualitative analysis, health care providers considered LLMs to have excellent synthesis skills and efficiency. However, our respondents had concerns that LLMs could generate false information and propagate training data bias.

Our survey respondents were most comfortable with scenarios that allow LLMs to function in an assistive role, like a physician extender or trainee.

Conclusion In a mixed-methods survey of clinicians about LLM use, health care providers were encouraging of having LLMs in health care for many tasks, and especially in assistive roles. There is a need for continued human-centered development of both LLMs and artificial intelligence in general.

Keywords

artificial intelligence - clinical decision support - clinical informatics - clinical information systems

Protection of Human Subjects

The study was performed in compliance with the World Medical Association Declaration of Helsinki on Ethical Principles for Medical Research Involving Human Subjects, and was reviewed by Columbia University Irving Medical Center Institutional Review Board (AAAU7954).

Supplementary Material

Supplementary Material

Publikationsverlauf

Eingereicht: 01. Dezember 2023

Angenommen: 15. Februar 2024

Accepted Manuscript online:
05. März 2024

Artikel online veröffentlicht:
17. April 2024

© 2024. The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution-NonDerivative-NonCommercial License, permitting copying and reproduction so long as the original work is given appropriate credit. Contents may not be used for commercial purposes, or adapted, remixed, transformed or built upon. (https://creativecommons.org/licenses/by-nc-nd/4.0/)

Georg Thieme Verlag KG
Rüdigerstraße 14, 70469 Stuttgart, Germany

References
1 Sezgin E, Sirrianni J, Linwood SL. Operationalizing and implementing pretrained, large artificial intelligence linguistic models in the US health care system: outlook of generative pretrained transformer 3 (GPT-3) as a service model. JMIR Med Inform 2022; 10 (02) e32875

MissingFormLabel
Crossref PubMed Suche in Google Scholar
2 Elkassem AA, Smith AD. Potential use cases for ChatGPT in radiology reporting. AJR Am J Roentgenol 2023; 221 (03) 373-376

MissingFormLabel
Crossref PubMed Suche in Google Scholar
3 Rao A, Pang M, Kim J. et al. Assessing the utility of ChatGPT throughout the entire clinical workflow: development and usability study. J Med Internet Res 2023; 25: e48659

MissingFormLabel
Crossref PubMed Suche in Google Scholar
4 Cascella M, Montomoli J, Bellini V, Bignami E. Evaluating the feasibility of ChatGPT in healthcare: an analysis of multiple clinical and research scenarios. J Med Syst 2023; 47 (01) 33

MissingFormLabel
Crossref PubMed Suche in Google Scholar
5 Sallam M. ChatGPT utility in healthcare education, research, and practice: systematic review on the promising perspectives and valid concerns. Healthc Pap 2023; 11 (06) 867

MissingFormLabel
PubMed Suche in Google Scholar
6 Athaluri SA, Manthena SV, Kesapragada VSRKM, Yarlagadda V, Dave T, Duddumpudi RTS. Exploring the boundaries of reality: investigating the phenomenon of artificial intelligence hallucination in scientific writing through ChatGPT references. Cureus 2023; 15 (04) e37432

MissingFormLabel
PubMed Suche in Google Scholar
7 Peng Y, Rousseau JF, Shortliffe EH, Weng C. AI-generated text may have a role in evidence-based medicine. Nat Med 2023; 29 (07) 1593-1594

MissingFormLabel
Crossref PubMed Suche in Google Scholar
8 Tang L, Sun Z, Idnay B. et al. Evaluating large language models on medical evidence summarization. NPJ Digit Med 2023; 6 (01) 158

MissingFormLabel
Crossref PubMed Suche in Google Scholar
9 Deik A. Potential benefits and perils of incorporating ChatGPT to the movement disorders clinic. J Mov Disord 2023; 16 (02) 158-162

MissingFormLabel
Crossref PubMed Suche in Google Scholar
10 Shashavar Y, Choudhury A. User intentions to use ChatGPT for self-diagnosis and health-related purposes: cross-sectional survey study. JMIR Hum Factors 2023; 10: e47564

MissingFormLabel
Crossref PubMed Suche in Google Scholar
11 Liu S, Wright AP, Patterson BL. et al. Using AI-generated suggestions from ChatGPT to optimize clinical decision support. J Am Med Inform Assoc 2023; 30 (07) 1237-1245

MissingFormLabel
Crossref PubMed Suche in Google Scholar
12 Hosseini M, Gao CA, Leibovitz DM. et al. An exploratory survey about using ChatGPT in education, healthcare, and research. medRxiv 2023.03.31.23287979

MissingFormLabel
PubMed
13 Choudhury A, Shamszare H. Investigating the impact of user trust on the adoption and use of ChatGPT: survey analysis. J Med Internet Res 2023; 25: e47184

MissingFormLabel
Crossref PubMed Suche in Google Scholar
14 Dash D, Rahul T, Banda JM. et al. Evaluation of GPT-3.5 and GPT-4 for supporting real-world information needs in healthcare delivery. 2023 . arXiv:2304.13714

MissingFormLabel
PubMed Suche in Google Scholar
15 Hirosawa T, Harada Y, Yokose M, Sakamoto T, Kawamura R, Shimizu T. Diagnostic accuracy of differential-diagnosis lists generated by generative pretrained transformer 3 chatbot for clinical vignettes with common chief complaints: a pilot study. Int J Environ Res Public Health 2023; 20 (04) 3378

MissingFormLabel
Crossref PubMed Suche in Google Scholar
16 Cheng K, Li Z, He Y. et al. Potential use of artificial intelligence in infectious disease: take ChatGPT as an example. Ann Biomed Eng 2023; 51 (06) 1130-1135

MissingFormLabel
Crossref PubMed Suche in Google Scholar
17 Patel SB, Lam K. ChatGPT: the future of discharge summaries?. Lancet Digit Health 2023; 5 (03) e107-e108

MissingFormLabel
Crossref PubMed Suche in Google Scholar
18 Galido PV, Butala S, Chakerian M, Agustines D. A case study demonstrating applications of ChatGPT in the clinical management of treatment-resistant schizophrenia. Cureus 2023; 15 (04) e38166

MissingFormLabel
PubMed Suche in Google Scholar
19 Sharma S, Pajai S, Prasad R. et al. A critical review of ChatGPT as a potential substitute for diabetes educators. Cureus 2023; 15 (05) e38380

MissingFormLabel
PubMed Suche in Google Scholar
20 Macdonald C, Adeloye D, Sheikh A, Rudan I. Can ChatGPT draft a research article? An example of population-level vaccine effectiveness analysis. J Glob Health 2023; 13: 01003

MissingFormLabel
Crossref PubMed Suche in Google Scholar
21 Májovský M, Černý M, Kasal M, Komarc M, Netuka D. Artificial intelligence can generate fraudulent but authentic-looking scientific medical articles: Pandora's box has been opened. J Med Internet Res 2023; 25: e46924

MissingFormLabel
Crossref PubMed Suche in Google Scholar
22 Abd-Alrazaq A, AlSaad R, Alhuwail D. et al. Large language models in medical education: opportunities, challenges, and future directions. JMIR Med Educ 2023; 9: e48291

MissingFormLabel
Crossref PubMed Suche in Google Scholar
23 Eysenbach G. The role of ChatGPT, generative language models, and artificial intelligence in medical education: a conversation with ChatGPT and a call for papers. JMIR Med Educ 2023; 9: e46885

MissingFormLabel
Crossref PubMed Suche in Google Scholar
24 Karabacak M, Ozkara BB, Margetis K, Wintermark M, Bisdas S. The advent of generative language models in medical education. JMIR Med Educ 2023; 9: e48163

MissingFormLabel
Crossref PubMed Suche in Google Scholar
25 Kanjee Z, Crowe B, Rodman A. Accuracy of a generative artificial intelligence model in a complex diagnostic challenge. JAMA 2023; 330 (01) 78-80

MissingFormLabel
Crossref PubMed Suche in Google Scholar
26 Friedman CPA. A “fundamental theorem” of biomedical informatics. J Am Med Inform Assoc 2009; 16 (02) 169-170

MissingFormLabel
Crossref PubMed Suche in Google Scholar

Zusatzmaterial

Supplementary Material

RSS-Feed abonnieren

Teilen / Bookmarken

A Survey of Clinicians' Views of the Utility of Large Language Models

Abstract

Keywords

Protection of Human Subjects

Supplementary Material

Publikationsverlauf

References