Methods Inf Med 2016; 55(04): 381-386
DOI: 10.3414/ME15-02-0015
Focus Theme – Original Articles
Schattauer GmbH

Data Linkage from Clinical to Study Databases via an R Data Warehouse User Interface

Experiences from a Large Clinical Follow-up Study
Mathias Kaspar
1   Comprehensive Heart Failure Center (CHFC), University Hospital of Würzburg, Würzburg, Germany
,
Maximilian Ertl
2   Service Center Medical Informatics, University Hospital of Würzburg, Würzburg, Germany
,
Georg Fette
1   Comprehensive Heart Failure Center (CHFC), University Hospital of Würzburg, Würzburg, Germany
3   Chair of Computer Science VI, University of Würzburg, Würzburg, Germany
,
Georg Dietrich
3   Chair of Computer Science VI, University of Würzburg, Würzburg, Germany
,
Martin Toepfer
3   Chair of Computer Science VI, University of Würzburg, Würzburg, Germany
,
Christiane Angermann
1   Comprehensive Heart Failure Center (CHFC), University Hospital of Würzburg, Würzburg, Germany
,
Stefan Störk*
1   Comprehensive Heart Failure Center (CHFC), University Hospital of Würzburg, Würzburg, Germany
,
Frank Puppe*
3   Chair of Computer Science VI, University of Würzburg, Würzburg, Germany
› Author Affiliations
Fundings This research was funded by grant of German Federal Ministry of Education and Research (Comprehensive Heart Failure Center Würzburg, grants #01EO1004 and #01EO1504).
Further Information

Publication History

received: 15 December 2015

accepted in revised form: 15 June 2016

Publication Date:
08 January 2018 (online)

Summary

Background: Data that needs to be documented for clinical studies has often been acquired and documented in clinical routine. Usually this data is manually transferred to Case Report Forms (CRF) and/or directly into an electronic data capture (EDC) system.

Objectives: To enhance the documentation process of a large clinical follow-up study targeting patients admitted for acutely decompensated heart failure by accessing the data created during routine and study visits from a hospital information system (HIS) and by transferring it via a data warehouse (DWH) into the study‘s EDC system.

Methods: This project is based on the clinical DWH developed at the University of Würzburg. The DWH was extended by several new data domains including data created by the study team itself. An R user interface was developed for the DWH that allows to access its source data in all its detail, to transform data as comprehensively as possible by R into study-specific variables and to support the creation of data and catalog tables.

Results: A data flow was established that starts with labeling patients as study patients within the HIS and proceeds with updating the DWH with this label and further data domains at a daily rate. Several study-specific variables were defined using the implemented R user interface of the DWH. This system was then used to export these variables as data tables ready for import into our EDC system. The data tables were then used to initialize the first 296 patients within the EDC system by pseudonym, visit and data values. Afterwards, these records were filled with clinical data on heart failure, vital parameters and time spent on selected wards.

Conclusions: This solution focuses on the comprehensive access and transformation of data for a DWH-EDC system linkage. Using this system in a large clinical study has demonstrated the feasibility of this approach for a study with a complex visit schedule.

* These authors contributed equally to this work


 
  • References

  • 1 Prather JC, Lobach DF, Goodwin LK, Hales JW, Hage ML, Hammond WE. Medical data mining: knowledge discovery in a clinical data warehouse. Proceedings of the AMIA Annual Fall Symposium 1997; 101-105.
  • 2 Bréant C, Thurler G, Borst F, Geissbuhler A. Design of a Multi Dimensional Database for the Archimed DataWarehouse. Stud Health Technol Inform 2005; 116: 169-174.
  • 3 Bruland P, Forster C, Breil B, Ständer S, Dugas M, Fritz F. Does single-source create an added value? Evaluating the impact of introducing x4T into the clinical routine on workflow modifications, data quality and cost-benefit. Int J Med Inform 2014; 83: 915-928.
  • 4 Duftschmid G, Gall W, Eigenbauer E, Dorda W. Management of data from clinical trials using the ArchiMed system. Med Inform Internet 2002; 27: 85-98.
  • 5 Köpcke F, Kraus S, Scholler A, Nau C, Schüttler J, Prokosch HU, Ganslandt T. Secondary use of routinely collected patient data in a clinical trial: an evaluation of the effects on patient recruitment and data acquisition. Int J Med Inform 2013; 82: 185-192.
  • 6 Brammen D, Katzer C, Röhrig R, Weismüller K, Maier M, Hossain H, Menges T, Hempelmann G, Chakraborty T. An integrated data-warehouse-concept for clinical and biological information. Stud Health Technol Inform 2005; 116: 9-14.
  • 7 Lowe HJ, Ferris TA, Hernandez PM, Weber SC. STRIDE – An integrated standards-based translational research informatics platform. AMIA Annual Symposium proceedings 2009; 391-395.
  • 8 Murphy SN, Weber G, Mendis M, Gainer V, Chueh HC, Churchill S, Kohane I. Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2). J Am Med In-format Assoc 2010; 17: 124-130.
  • 9 Wiesenauer M, Johner C, Röhrig R. Secondary use of clinical data in healthcare providers – an overview on research, regulatory and ethical requirements. Stud Health Technol Inform 2012; 180: 614-618.
  • 10 Fette G, Ertl M, Wörner A, Kluegl P, Störk S, Puppe F. Information Extraction from Unstructured Electronic Health Records and Integration into a Data Warehouse. Lecture Notes in Informatics 2012; 208: 1238-52.
  • 11 Danciu I, Cowan JD, Basford M, Wang X, Saip A, Osgood S, Shirey-Rice J, Kirby J, Harris PA. Secondary use of clinical data: the Vanderbilt approach. J Biomed Inform 2014; 52: 28-35.
  • 12 Prokosch HU, Ganslandt T. Perspectives for medical informatics. Reusing the electronic medical record for clinical research. Methods Inf Med 2009; 48: 38-44.
  • 13 Ganslandt T, Mate S, Helbing K, Sax U, Prokosch H. Unlocking Data for Clinical Research – The German i2b2 Experience. Appl Clin Inform 2011; 2: 116-127.
  • 14 Gall W, Sachs P, Duftschmid G, Dorda W. A retrieval system for the selection and statistical analysis of clinical data. Med Inform Internet 1999; 24: 201-212.
  • 15 R Development Core Team.. R: A Language and Environment for Statistical Computing. Vienna: the R Foundation for Statistical Computing. 2011 Available from: http://www.R-project.org.
  • 16 Toepfer M, Corovic H, Fette G, Klügl P, Störk S, Puppe F. Fine-grained information extraction from German transthoracic echocardiography reports. Med Inform Decis 2015; 15: 91.
  • 17 Fette G, Dietrich G, Ertl M, Toepfer M, Kaspar M, Störk S, Puppe F. Die grafische Benutzeroberfläche PaDaWaN für das klinische Data Warehouse für Routinedaten an der Universitätsklinik Würzburg [The graphical user interface PaDaWaN for the clinical data warehouse of the Würzburg University Hospital]. 60 Jahrestagung GMDS 2015
  • 18 Vawdrey DK, Weng C, Herion D, Cimino JJ. Enhancing electronic health records to support clinical research. AMIA Joint Summits on Translational Science proceedings, Summit on Translational Science 2014; 102-108.
  • 19 Bauer CR, Ganslandt T, Baum B, Christoph J, Engel I, Löbe M, Mate S, Stäubert S, Drepper J, Prokosch HU, Winter A, Sax U. Integrated Data Repository Toolkit (IDRT). A Suite of Programs to Facilitate Health Analytics on Heterogeneous Medical Data. Methods Inf Med 2016; 55 (Suppl. 02) 125-35.
  • 20 Klann JG, Abend A, Raghavan VA, Mandl KD, Murphy SN. Data interchange using i2b2. J Am Med Inform Assoc. 2016 [Epub ahead of print]
  • 21 Bae CJ, Griffith S, Fan Y, Dunphy C, Thompson N, Urchek J, Parchman A, Katzan IL. The Challenges of Data Quality Evaluation in a Joint Data Warehouse. EGEMS, Washington, DC 2015; 3: 1125.