Open Access
CC BY-NC-ND 4.0 · Endosc Int Open 2024; 12(07): E849-E853
DOI: 10.1055/a-2333-8138
Original article

Assessment of colonoscopy skill using machine learning to measure quality: Proof-of-concept and initial validation

Matthew Wittbrodt
1   Information Services, Northwestern Medicine, Chicago, United States
,
Matthew Klug
1   Information Services, Northwestern Medicine, Chicago, United States
,
Mozziyar Etemadi
1   Information Services, Northwestern Medicine, Chicago, United States
2   Anesthesiology, Northwestern University Feinberg School of Medicine, Chicago, United States (Ringgold ID: RIN12244)
,
Anthony Yang
3   Surgery, Indiana University School of Medicine, Indianapolis, United States (Ringgold ID: RIN12250)
,
John E. Pandolfino
4   Medicine, Northwestern University Feinberg School of Medicine, Chicago, United States (Ringgold ID: RIN12244)
,
Rajesh N. Keswani
4   Medicine, Northwestern University Feinberg School of Medicine, Chicago, United States (Ringgold ID: RIN12244)
› Institutsangaben

Gefördert durch: Betty and Gordon Moore Foundation
Gefördert durch: Digestive Health Foundation
Preview

Abstract

Background and study aims Low-quality colonoscopy increases cancer risk but measuring quality remains challenging. We developed an automated, interactive assessment of colonoscopy quality (AI-CQ) using machine learning (ML).

Methods Based on quality guidelines, metrics selected for AI development included insertion time (IT), withdrawal time (WT), polyp detection rate (PDR), and polyps per colonoscopy (PPC). Two novel metrics were also developed: HQ-WT (time during withdrawal with clear image) and WT-PT (withdrawal time subtracting polypectomy time). The model was pre-trained using a self-supervised vision transformer on unlabeled colonoscopy images and then finetuned for multi-label classification on another mutually exclusive colonoscopy image dataset. A timeline of video predictions and metric calculations were presented to clinicians in addition to the raw video using a web-based application. The model was externally validated using 50 colonoscopies at a second hospital.

Results The AI-CQ accuracy to identify cecal intubation was 88%. IT (P = 0.99) and WT (P = 0.99) were highly correlated between manual and AI-CQ measurements with a median difference of 1.5 seconds and 4.5 seconds, respectively. AI-CQ PDR did not significantly differ from manual PDR (47.6% versus 45.5%, P = 0.66). Retroflexion was correctly identified in 95.2% and number of right colon evaluations in 100% of colonoscopies. HQ-WT was 45.9% of, and significantly correlated with (P = 0.85) WT time.

Conclusions An interactive AI assessment of colonoscopy skill can automatically assess quality. We propose that this tool can be utilized to rapidly identify and train providers in need of remediation.



Publikationsverlauf

Eingereicht: 22. April 2024

Angenommen nach Revision: 14. Mai 2024

Accepted Manuscript online:
27. Mai 2024

Artikel online veröffentlicht:
03. Juli 2024

© 2024. The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution-NonDerivative-NonCommercial-License, permitting copying and reproduction so long as the original work is given appropriate credit. Contents may not be used for commercial purposes, or adapted, remixed, transformed or built upon. (https://creativecommons.org/licenses/by-nc-nd/4.0/).

Georg Thieme Verlag KG
Rüdigerstraße 14, 70469 Stuttgart, Germany