Improving the use of LLMs in radiology through prompt engineering: from precision prompts to zero-shot learning

Maximilian Frederik Russe; Marco Reisert; Fabian Bamberg; Alexander Rau

doi:10.1055/a-2264-5631

RSS-Feed abonnieren

Bitte kopieren Sie die angezeigte URL und fügen sie dann in Ihren RSS-Reader ein.

https://www.thieme-connect.de/rss/thieme/de/10.1055-s-00000066.xml

PDF herunterladen

Rofo 2024; 196(11): 1166-1170
DOI: 10.1055/a-2264-5631

Technical Innovations

Improving the use of LLMs in radiology through prompt engineering: from precision prompts to zero-shot learning

Verbesserung des Einsatzes von Großen Sprachmodellen in der Radiologie durch „Prompt Engineering“: von präzisen Prompts zu Zero-Shot Learning

Authors

Maximilian Frederik Russe

Department of Radiology, University Hospital Freiburg, Freiburg, Germany
Marco Reisert

Department of Radiology, University Hospital Freiburg, Freiburg, Germany
Fabian Bamberg

Department of Radiology, University Hospital Freiburg, Freiburg, Germany
Alexander Rau

Department of Radiology, University Hospital Freiburg, Freiburg, Germany

Weitere Informationen

Auch verfügbar auf

Lizenzen und Reprints

Abstract

Purpose Large language models (LLMs) such as ChatGPT have shown significant potential in radiology. Their effectiveness often depends on prompt engineering, which optimizes the interaction with the chatbot for accurate results. Here, we highlight the critical role of prompt engineering in tailoring the LLMs’ responses to specific medical tasks.

Materials and Methods Using a clinical case, we elucidate different prompting strategies to adapt the LLM ChatGPT using GPT4 to new tasks without additional training of the base model. These approaches range from precision prompts to advanced in-context methods such as few-shot and zero-shot learning. Additionally, the significance of embeddings, which serve as a data representation technique, is discussed.

Results Prompt engineering substantially improved and focused the chatbot’s output. Moreover, embedding of specialized knowledge allows for more transparent insight into the model’s decision-making and thus enhances trust.

Conclusion Despite certain challenges, prompt engineering plays a pivotal role in harnessing the potential of LLMs for specialized tasks in the medical domain, particularly radiology. As LLMs continue to evolve, techniques like few-shot learning, zero-shot learning, and embedding-based retrieval mechanisms will become indispensable in delivering tailored outputs.

Key Points

Large language models might impact radiological practice and decision-masking.
However, implementation and performance are dependent on the assigned task.
Optimization of prompting strategies can substantially improve model performance.
Strategies for prompt engineering range from precision prompts to zero-shot learning.

Citation Format

Russe MF, Reisert M, Bamberg F et al. Improving the use of LLMs in radiology through prompt engineering: from precision prompts to zero-shot learning . Fortschr Röntgenstr 2024; 196: 1166 – 1170

Zusammenfassung

Ziel Große Sprachmodelle (engl. LLMs) wie ChatGPT haben ein erhebliches Potenzial in der Radiologie gezeigt. Ihre Effektivität hängt oft vom sog. Prompt-Engineering ab, das die Interaktion mit der künstlichen Intelligenz für genaue Ergebnisse optimiert. Hier wird die kritische Rolle des Prompt-Engineerings bei der Anpassung der Antworten der LLMs an spezifische medizinische Aufgaben hervorgehoben.

Material und Methoden Anhand eines klinischen Falles erläutern wir verschiedene Prompting-Strategien zur Anpassung des LLM ChatGPT mit GPT4 an neue Aufgaben ohne zusätzliches Training des Basismodells. Diese Ansätze reichen von präzisierten Prompts bis hin zu fortgeschrittenen In-Kontext-Methoden wie „few-shot“- und „zero-shot“-Lernen. Zusätzlich wird die Bedeutung des „Embeddings“ als Datenrepräsentationstechnik diskutiert.

Ergebnisse Das Prompt-Engineering verbesserte und fokussierte den Output des Chatbots erheblich. Darüber hinaus ermöglicht die Einbettung von Fachwissen einen transparenteren Einblick in die Entscheidungsfindung des Modells und stärkt so das Vertrauen.

Schlussfolgerung Trotz gewisser Herausforderungen spielt das Prompt-Engineering eine zentrale Rolle bei der Nutzung des Potenzials von LLMs für spezialisierte Aufgaben im medizinischen Bereich, insbesondere in der Radiologie. Im Zuge der Weiterentwicklung von LLMs werden Techniken wie „few-shot learning“, „zero-shot learning“ und „Embedding“ für die Bereitstellung maßgeschneiderter Ergebnisse unverzichtbar werden.

Kernaussagen

Große Sprachmodelle könnten die radiologische Routine und Entscheidungsfindung beeinflussen.
Die Implementierung und Leistung hängen jedoch von der zugewiesenen Aufgabe ab.
Die Optimierung von Prompting-Strategien kann die Modellleistung erheblich verbessern.
Strategien für das Prompt-Engineering reichen von Präzision-Prompts bis zum Zero-Shot-Lernen.

Keywords

radiology - artificial Intelligence - natural Language Processing

Publikationsverlauf

Eingereicht: 24. Oktober 2023

Angenommen: 30. Januar 2024

Artikel online veröffentlicht:
26. Februar 2024

Georg Thieme Verlag KG
Rüdigerstraße 14, 70469 Stuttgart, Germany

References
1 OpenAI Platform [Internet]. [zitiert 31. August 2023]. Verfügbar unter: https://platform.openai.com

Reference Link Ris
PubMed
2 Kung TH, Cheatham M, Medenilla A. et al. Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models. PLOS Digit Health 2023; 2 (02) e0000198

Reference Link Ris
Crossref PubMed Suche in Google Scholar
3 Lyu Q, Tan J, Zapadka ME. et al. Translating radiology reports into plain language using ChatGPT and GPT-4 with prompt learning: results, limitations, and potential. Vis Comput Ind Biomed Art. 18 2023; 6 (01) 9

Reference Link Ris
Crossref PubMed Suche in Google Scholar
4 Amin KS, Davis MA, Doshi R. et al. Accuracy of ChatGPT, Google Bard, and Microsoft Bing for Simplifying Radiology Reports. Radiology 2023; 309 (02) e232561

Reference Link Ris
Crossref PubMed Suche in Google Scholar
5 Schmidt S, Zimmerer A, Cucos T. et al. Simplifying radiologic reports with natural language processing: a novel approach using ChatGPT in enhancing patient understanding of MRI results. Arch Orthop Trauma Surg [Internet] 2023;

Reference Link Ris
Crossref PubMed Suche in Google Scholar
6 Jeblick K, Schachtner B, Dexl J. et al. ChatGPT Makes Medicine Easy to Swallow: An Exploratory Case Study on Simplified Radiology Reports [Internet]. arXiv 2022; http://arxiv.org/abs/2212.14882

Reference Link Ris
Crossref PubMed Suche in Google Scholar
7 Sushil M, Kennedy VE, Miao BY. et al. Extracting detailed oncologic history and treatment plan from medical oncology notes with large language models [Internet]. arXiv 2023; http://arxiv.org/abs/2308.03853

Reference Link Ris
Crossref PubMed Suche in Google Scholar
8 Russe MF, Fink A, Ngo H. et al. Performance of ChatGPT, human radiologists, and context-aware ChatGPT in identifying AO codes from radiology reports. Sci Rep 2023; 13 (01) 14215

Reference Link Ris
Crossref PubMed Suche in Google Scholar
9 Wang J, Shi E, Yu S. et al. Prompt Engineering for Healthcare: Methodologies and Applications [Internet]. arXiv 2023; http://arxiv.org/abs/2304.14670

Reference Link Ris
Crossref PubMed Suche in Google Scholar
10 White J, Fu Q, Hays S. et al. A Prompt Pattern Catalog to Enhance Prompt Engineering with ChatGPT [Internet]. arXiv 2023; http://arxiv.org/abs/2302.11382

Reference Link Ris
Crossref PubMed Suche in Google Scholar
11 Pinto dos Santos D, Brodehl S, Baeßler B. et al. Structured report data can be used to develop deep learning algorithms: a proof of concept in ankle radiographs. Insights into Imaging 2019; 10 (01) 93

Reference Link Ris
Crossref PubMed Suche in Google Scholar
12 Ye S, Hwang H, Yang S. et al. In-Context Instruction Learning [Internet]. arXiv 2023; http://arxiv.org/abs/2302.14691

Reference Link Ris
Crossref PubMed Suche in Google Scholar
13 Brown TB, Mann B, Ryder N. et al. Language Models are Few-Shot Learners [Internet]. arXiv 2020; http://arxiv.org/abs/2005.14165

Reference Link Ris
Crossref PubMed Suche in Google Scholar
14 Liu Z, Yu X, Zhang L. et al. DeID-GPT: Zero-shot Medical Text De-Identification by GPT-4 [Internet]. arXiv 2023; http://arxiv.org/abs/2303.11032

Reference Link Ris
Crossref PubMed Suche in Google Scholar
15 Jin Q, Dhingra B, Cohen WW. et al. Probing Biomedical Embeddings from Language Models [Internet]. arXiv 2019; http://arxiv.org/abs/1904.02181

Reference Link Ris
Crossref PubMed Suche in Google Scholar
16 Rau A, Rau S, Zoeller D. et al. A Context-based Chatbot Surpasses Trained Radiologists and Generic ChatGPT in Following the ACR Appropriateness Guidelines. Radiology 2023; 308 (01) e230970

Reference Link Ris
Crossref PubMed Suche in Google Scholar
17 Geis JR, Brady AP, Wu CC. et al. Ethics of Artificial Intelligence in Radiology: Summary of the Joint European and North American Multisociety Statement. Radiology 2019; 293 (02) 436-440

Reference Link Ris
Crossref PubMed Suche in Google Scholar
18 Keskinbora KH. Medical ethics considerations on artificial intelligence. Journal of Clinical Neuroscience 2019; 64: 277-282

Reference Link Ris
Crossref PubMed Suche in Google Scholar
19 Goddard J. Hallucinations in ChatGPT: A Cautionary Tale for Biomedical Researchers. The American Journal of Medicine 2023; 136 (11) 1059-1060

Reference Link Ris
Crossref PubMed Suche in Google Scholar

Bücher zum Thema

RSS-Feed abonnieren

Teilen / Bookmarken

Improving the use of LLMs in radiology through prompt engineering: from precision prompts to zero-shot learning

Authors

Abstract

Zusammenfassung

Keywords

Publikationsverlauf

Leserbrief zu diesem Artikel:

References