German CheXpert Chest X-ray Radiology Report Labeler

Alessandro Wollek; Sardi Hyska; Thomas Sedlmeyr; Philip Haitzer; Johannes Rueckel; Bastian O. Sabel; Michael Ingrisch; Tobias Lasser

doi:10.1055/a-2234-8268

RSS-Feed abonnieren

Bitte kopieren Sie die angezeigte URL und fügen sie dann in Ihren RSS-Reader ein.

https://www.thieme-connect.de/rss/thieme/de/10.1055-s-00000066.xml

PDF herunterladen

Rofo 2024; 196(09): 956-965
DOI: 10.1055/a-2234-8268

Chest

German CheXpert Chest X-ray Radiology Report Labeler

Deutscher CheXpert-Röntgenthorax-Befundlabeler

Autoren

Alessandro Wollek

¹Munich Institute of Biomedical Engineering, Technical University of Munich, Garching b. München, Germany

²School of Computation, Information and Technology, Technical University of Munich, Garching b. München, Germany
Sardi Hyska

³Department of Radiology, Ludwig-Maximilians-University Hospital Munich, München, Germany
Thomas Sedlmeyr

¹Munich Institute of Biomedical Engineering, Technical University of Munich, Garching b. München, Germany

²School of Computation, Information and Technology, Technical University of Munich, Garching b. München, Germany
Philip Haitzer

¹Munich Institute of Biomedical Engineering, Technical University of Munich, Garching b. München, Germany

²School of Computation, Information and Technology, Technical University of Munich, Garching b. München, Germany
Johannes Rueckel

³Department of Radiology, Ludwig-Maximilians-University Hospital Munich, München, Germany

⁴Institute of Neuroradiology, Ludwig-Maximilians-University Hospital Munich, München, Germany
Bastian O. Sabel

⁵Institute for Clinical Radiology, Ludwig-Maximilians-University Hospital Munich, Germany, München, Germany
Michael Ingrisch

³Department of Radiology, Ludwig-Maximilians-University Hospital Munich, München, Germany
Tobias Lasser

¹Munich Institute of Biomedical Engineering, Technical University of Munich, Garching b. München, Germany

²School of Computation, Information and Technology, Technical University of Munich, Garching b. München, Germany

Bundesministerium für Gesundheit grant agreement no. 2520DAT920

Weitere Informationen

Auch verfügbar auf

Lizenzen und Reprints

Abstract

Purpose The aim of this study was to develop an algorithm to automatically extract annotations from German thoracic radiology reports to train deep learning-based chest X-ray classification models.

Materials and Methods An automatic label extraction model for German thoracic radiology reports was designed based on the CheXpert architecture. The algorithm can extract labels for twelve common chest pathologies, the presence of support devices, and “no finding”. For iterative improvements and to generate a ground truth, a web-based multi-reader annotation interface was created. With the proposed annotation interface, a radiologist annotated 1086 retrospectively collected radiology reports from 2020–2021 (data set 1). The effect of automatically extracted labels on chest radiograph classification performance was evaluated on an additional, in-house pneumothorax data set (data set 2), containing 6434 chest radiographs with corresponding reports, by comparing a DenseNet-121 model trained on extracted labels from the associated reports, image-based pneumothorax labels, and publicly available data, respectively.

Results Comparing automated to manual labeling on data set 1: “mention extraction” class-wise F1 scores ranged from 0.8 to 0.995, the “negation detection” F1 scores from 0.624 to 0.981, and F1 scores for “uncertainty detection” from 0.353 to 0.725. Extracted pneumothorax labels on data set 2 had a sensitivity of 0.997 [95 % CI: 0.994, 0.999] and specificity of 0.991 [95 % CI: 0.988, 0.994]. The model trained on publicly available data achieved an area under the receiver operating curve (AUC) for pneumothorax classification of 0.728 [95 % CI: 0.694, 0.760], while the models trained on automatically extracted labels and on manual annotations achieved values of 0.858 [95 % CI: 0.832, 0.882] and 0.934 [95 % CI: 0.918, 0.949], respectively.

Conclusion Automatic label extraction from German thoracic radiology reports is a promising substitute for manual labeling. By reducing the time required for data annotation, larger training data sets can be created, resulting in improved overall modeling performance. Our results demonstrated that a pneumothorax classifier trained on automatically extracted labels strongly outperformed the model trained on publicly available data, without the need for additional annotation time and performed competitively compared to manually labeled data.

Key Points

An algorithm for automatic German thoracic radiology report annotation was developed.
Automatic label extraction is a promising substitute for manual labeling.
The classifier trained on extracted labels outperformed the model trained on publicly available data.

Zitierweise

Wollek A, Hyska S, Sedlmeyr T et al. German CheXpert Chest X-ray Radiology Report Labeler. Fortschr Röntgenstr 2024; 196: 956 – 965

Zusammenfassung

Ziel Das Ziel dieser Studie war die Entwicklung eines Algorithmus zur automatischen Extraktion von Labels aus deutschen Röntgenthoraxbefunden, um damit tiefe neuronale Netze zur Klassifikation von Röntgenthoraxaufnahmen zu trainieren.

Material und Methoden Basierend auf der CheXpert-Architektur wurde ein Modell zur automatischen Label-Extraktion für deutsche Röntgenthoraxbefunde entworfen. Der Algorithmus kann Labels für zwölf häufige Thoraxpathologien, die Anwesenheit von Fremdmaterial und „Normalbefund“ extrahieren. Zur iterativen Verbesserung und Generierung eines Referenzstandards wurde ein webbasiertes Multi-Reader-Annotationsinterface erstellt. Mit dem vorgeschlagenen Programm hat ein Radiologe 1086 retrospektiv gesammelte Befunde aus dem Zeitraum 2020–2021 (Datensatz 1) annotiert. Die Auswirkungen der automatisch extrahierten Labels auf die Leistung der Röntgenbildklassifikation wurden an einem zusätzlichen internen Pneumothorax-Datensatz (Datensatz 2) mit 6434 Thorax-Röntgenaufnahmen und entsprechenden Befunden bewertet, indem ein DenseNet-121-Modell verglichen wurde, das auf extrahierten Labels basierend auf zugehörigen Befunden, bildbasierten Pneumothorax-Labels oder öffentlich verfügbaren Daten trainiert wurde.

Ergebnisse Beim Vergleich automatischer mit manueller Annotation des Datensatzes 1 ergaben sich für die klassenspezifischen F1-Scores der Erwähnungsextraktion Werte zwischen 0,8 und 0,995, für die F1-Scores der Negationserkennung zwischen 0,624 und 0,981 und für die F1-Scores der Unsicherheitserkennung zwischen 0,353 und 0,725. Die extrahierten Pneumothorax-Labels des Datensatzes 2 hatten eine Sensitivität von 0,997 [95 %-KI: 0,994, 0,999] und eine Spezifität von 0,991 [95 %-KI: 0,988, 0,994]. Das auf öffentlich verfügbaren Daten trainierte Modell erreichte eine Fläche unter der Operationscharakteristik-Kurve (AUC) für die Pneumothorax-Klassifikation von 0,728 [95 %-KI: 0,694, 0,760], das Modell, das auf automatisch extrahierten Labels trainiert wurde, erreichte 0,858 [95 %-KI: 0,832, 0,882] und auf manuellen Annotationen 0,934 [95 %-KI: 0,918, 0,949].

Schlussfolgerung Die automatische Annotation von deutschen Röntgenthoraxbefunden ist ein vielversprechender Ersatz für die manuelle Annotation. Durch die schnellere Annotation können größere Trainingsdatensätze erstellt werden, was eine höhere Modellleistung verspricht. Unsere Ergebnisse zeigten, dass ein Pneumothorax-Klassifikator, der auf automatisch extrahierten Labels trainiert wurde, das Modell, das auf öffentlich verfügbaren Daten trainiert wurde, deutlich übertraf, ohne zusätzliche Annotationszeit. Verglichen mit manuell annotierten Daten klassifiziert das Modell vielversprechend.

Kernaussagen

Ein Algorithmus für das automatische Labeln von Röntgehnthoraxbefunden wurde entwickelt.
Das automatische Labeln ist ein vielversprechender Ersatz für das manuelle Labeln.
Der mit den extrahierten Labels trainierte Klassifikator übertraf das mit öffentlich verfügbaren Daten trainierte Modell.

Keywords

thorax - radiography - pneumothorax - neural networks - label extraction - annotation

Publikationsverlauf

Eingereicht: 01. Juni 2023

Angenommen: 15. Dezember 2023

Artikel online veröffentlicht:
31. Januar 2024

Georg Thieme Verlag KG
Rüdigerstraße 14, 70469 Stuttgart, Germany

References
1 Directorate-General for Energy (European Commission). Medical radiation exposure of the European population. LU: Publications Office of the European Union; 2015

PubMed Suche in Google Scholar
Download RIS citation
2 Gershan V, Madjunarova SN, Stikova E. Survey on the frequency of typical x-ray examinations and estimation of associated population doses in the Republic of Macedonia. In: CONFERENCE ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING. 2013: 14

PubMed Suche in Google Scholar
Download RIS citation
3 Idowu B, Okedere T. Diagnostic Radiology in Nigeria: A Country Report. Journal of Global Radiology 2020; 6

Crossref Suche in Google Scholar
Download RIS citation
4 Rimmer A. Radiologist shortage leaves patient care at risk, warns royal college. BMJ: British Medical Journal (Online) 2017; 359

Suche in Google Scholar
Download RIS citation
5 Rosenkrantz AB, Hughes DR, Duszak JrR. The US radiologist workforce: an analysis of temporal and geographic variation by using large national datasets. Radiology 2016; 279: 175-184

Crossref PubMed Suche in Google Scholar
Download RIS citation
6 Majkowska A, Mittal S, Steiner DF. et al. Chest radiograph interpretation with deep learning models: assessment with radiologist-adjudicated reference standards and population-adjusted evaluation. Radiology 2020; 294: 421-431

Crossref PubMed Suche in Google Scholar
Download RIS citation
7 Rajpurkar P, Irvin J, Ball RL. et al. Deep learning for chest radiograph diagnosis: A retrospective comparison of the CheXNeXt algorithm to practicing radiologists. PLOS Medicine 2018; 15: e1002686

Crossref PubMed Suche in Google Scholar
Download RIS citation
8 Russakovsky O, Deng J, Su H. et al. ImageNet Large Scale Visual Recognition Challenge. International Journal of Computer Vision 2015; 115: 211-252

Crossref Suche in Google Scholar
Download RIS citation
9 Krizhevsky A, Sutskever I, Hinton GE. Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems 2012; 25: 1097-1105

Suche in Google Scholar
Download RIS citation
10 Huang G, Liu Z, Van Der Maaten L. et al. Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 4700-4708

PubMed Suche in Google Scholar
Download RIS citation
11 Deng J, Dong W, Socher R. et al. ImageNet: A large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition. 2009: 248-255

PubMed Suche in Google Scholar
Download RIS citation
12 Wang X, Peng Y, Lu L. et al. ChestX-ray8: Hospital-scale Chest X-ray Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax Diseases. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2017: 3462-3471

PubMed Suche in Google Scholar
Download RIS citation
13 Rajpurkar P, Irvin J, Zhu K. et al Chexnet: Radiologist-level pneumonia detection on chest x-rays with deep learning. arXiv preprint arXiv:171105225 2017;

Download RIS citation
14 Dosovitskiy A, Beyer L, Kolesnikov A. et al. An Image is Worth 16×16 Words: Transformers for Image Recognition at Scale. In: International Conference on Learning Representations. 2020

PubMed Suche in Google Scholar
Download RIS citation
15 Wollek A, Graf R, Čečatka S. et al. Attention-based Saliency Maps Improve Interpretability of Pneumothorax Classification. Radiology: Artificial Intelligence 2023; e220187

Crossref PubMed Suche in Google Scholar
Download RIS citation
16 Irvin J, Rajpurkar P, Ko M. et al. Chexpert: A large chest radiograph dataset with uncertainty labels and expert comparison. In: Proceedings of the AAAI Conference on Artificial Intelligence. 2019: 590-597

PubMed Suche in Google Scholar
Download RIS citation
17 Johnson AE, Pollard TJ, Berkowitz SJ. et al. MIMIC-CXR, a de-identified publicly available database of chest radiographs with free-text reports. Scientific Data 2019; 6

PubMed Suche in Google Scholar
Download RIS citation
18 Reis EP, de Paiva JPQ, da Silva MCB. et al. BRAX, Brazilian labeled chest x-ray dataset. Sci Data 2022; 9: 487

Crossref PubMed Suche in Google Scholar
Download RIS citation
19 Nguyen TTB, Vo TM, Nguyen TV. et al. Learning to diagnose common thorax diseases on chest radiographs from radiology reports in Vietnamese. PLoS ONE 2022; 17: e0276545

Crossref PubMed Suche in Google Scholar
Download RIS citation
20 Nowak S, Biesner D, Layer YC. et al. Transformer-based structuring of free-text radiology report databases. Eur Radiol 2023;

Crossref PubMed Suche in Google Scholar
Download RIS citation
21 Cotik V, Roller R, Xu F. et al. Negation Detection in Clinical Reports Written in German. In: Proceedings of the Fifth Workshop on Building and Evaluating Resources for Biomedical Text Mining (BioTxtM2016). Osaka, Japan: The COLING 2016 Organizing Committee; 2016: 115-124

Suche in Google Scholar
Download RIS citation
22 Hansell DM, Bankier AA, MacMahon H. et al. Fleischner Society: glossary of terms for thoracic imaging. Radiology 2008; 246: 697-722

Crossref PubMed Suche in Google Scholar
Download RIS citation
23 Rueckel J, Trappmann L, Schachtner B. et al. Impact of Confounding Thoracic Tubes and Pleural Dehiscence Extent on Artificial Intelligence Pneumothorax Detection in Chest Radiographs. Invest Radiol 2020; 55: 792-798

Crossref PubMed Suche in Google Scholar
Download RIS citation

Bücher zum Thema

RSS-Feed abonnieren

Teilen / Bookmarken

German CheXpert Chest X-ray Radiology Report Labeler

Autoren

Abstract

Zusammenfassung

Keywords

Publikationsverlauf

References