JAO

Not Logged In Login
- Username or e-mail address:
  
  Password:
  
  Forgot Access Data? Register Now OpenAthens/Shibboleth Login
Shopping Cart

Year (Archive)

2023

Issues

Former Name

Journal of Clinical & Academic Ophthalmology

Subscribe to RSS

Please copy the URL and add it into your RSS Feed Reader.

https://www.thieme-connect.de/rss/thieme/en/10.1055-s-00033285.xml

Share / Bookmark

Facebook X Linkedin Weibo

Download PDF

CC BY-NC-ND 4.0 · Journal of Academic Ophthalmology 2023; 15(02): e184-e187
DOI: 10.1055/s-0043-1774399

Research Article

Improved Performance of ChatGPT-4 on the OKAP Examination: A Comparative Study with ChatGPT-3.5

¹Department of Ophthalmology and Visual Sciences, UMass Chan Medical School, Worcester, Massachusetts

,

Lauren Colwell

¹Department of Ophthalmology and Visual Sciences, UMass Chan Medical School, Worcester, Massachusetts

,

Emma Wood

¹Department of Ophthalmology and Visual Sciences, UMass Chan Medical School, Worcester, Massachusetts

,

Antonio Yaghy

¹Department of Ophthalmology and Visual Sciences, UMass Chan Medical School, Worcester, Massachusetts

,

¹Department of Ophthalmology and Visual Sciences, UMass Chan Medical School, Worcester, Massachusetts

› Author Affiliations Funding/Acknowledgment No financial support was received in pursuant to this research.

› Further Information

Abstract
Full Text
References

Permissions and Reprints

Abstract

Introduction: This study aims to evaluate the performance of ChatGPT-4, an advanced artificial intelligence (AI) language model, on the Ophthalmology Knowledge Assessment Program (OKAP) examination compared to its predecessor, ChatGPT-3.5.

Methods: Both models were tested on 180 OKAP practice questions covering various ophthalmology subject categories.

Results: ChatGPT-4 significantly outperformed ChatGPT-3.5 (81% vs. 57%; p<0.001), indicating improvements in medical knowledge assessment.

Discussion: The superior performance of ChatGPT-4 suggests potential applicability in ophthalmologic education and clinical decision support systems. Future research should focus on refining AI models, ensuring a balanced representation of fundamental and specialized knowledge, and determining the optimal method of integrating AI into medical education and practice.

Keywords

artificial intelligence - Ophthalmology Knowledge Assessment Program - OKAP - ChatGPT - medical education

Publication History

Received: 13 April 2023

Accepted: 10 August 2023

Article published online:
11 September 2023

© 2023. The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution-NonDerivative-NonCommercial License, permitting copying and reproduction so long as the original work is given appropriate credit. Contents may not be used for commercial purposes, or adapted, remixed, transformed or built upon. (https://creativecommons.org/licenses/by-nc-nd/4.0/)

Thieme Medical Publishers, Inc.
333 Seventh Avenue, 18th Floor, New York, NY 10001, USA

References
1 Yan A. How a robot passed China's medical licensing exam. scmp.com. Published November 20, 2017 . Accessed January 02, 2023 at: https://www.scmp.com/news/china/society/article/2120724/how-robot-passed-chinas-medical-licensing-exam

PubMed Search in Google Scholar
2 Shelmerdine SC, Martin H, Shirodkar K, Shamshuddin S, Weir-McCall JR. Can artificial intelligence pass the Fellowship of the Royal College of Radiologists examination? Multi-reader diagnostic accuracy study. BMJ 2022; x: e072826

Crossref PubMed Search in Google Scholar
3 Singhal K, Azizi S, Tu T. et al. Large Language Models Encode Clinical Knowledge. Published online December 26, 2022. Accessed March 30, 2023 at: http://arxiv.org/abs/2212.13138

PubMed
4 GPT-4 is OpenAI's most advanced system, producing safer and more useful responses. OpenAI; Accessed August 29, 2023 at: https://openai.com/product/gpt-4

PubMed
5 American Academy of Ophthalmology. Basic and Clinical Science Course Self-Assessment Program. Accessed December 03, 2023 at: https://store.aao.org/basic-and-clinical-science-course-self-assessment-program.html

PubMed
6 Antaki F, Touma S, Milad D, El-Khoury J, Duval R. Evaluating the performance of ChatGPT in ophthalmology: an analysis of its successes and shortcomings. Ophthalmol Sci 2023; 3 (04) 100324

Crossref PubMed Search in Google Scholar
7 Wiens J, Saria S, Sendak M. et al. Do no harm: a roadmap for responsible machine learning for health care. Nat Med 2019; 25 (09) 1337-1340

Crossref PubMed Search in Google Scholar
8 Oke I. The pursuit of generalizability and equity through artificial intelligence-based risk prediction models. JAMA Ophthalmol 2022; 140 (08) 798-799

Crossref PubMed Search in Google Scholar