Methods Inf Med 2006; 45(02): 146-152
DOI: 10.1055/s-0038-1634058
Original Article
Schattauer GmbH

A Generic Concept for Large-scale Microarray Analysis Dedicated to Medical Diagnostics

M. Dugas
1   Department of Medical Informatics and Biomathematics, University of Münster, Münster, Germany
,
F. Weninger
1   Department of Medical Informatics and Biomathematics, University of Münster, Münster, Germany
,
S. Merk
1   Department of Medical Informatics and Biomathematics, University of Münster, Münster, Germany
,
A. Kohlmann
2   Roche Molecular Systems Inc., Pleasanton, CA, USA
,
T. Haferlach
3   Munich Leukemia Laboratory, Munich, Germany
› Author Affiliations
Further Information

Publication History

Publication Date:
06 February 2018 (online)

Summary

Background: The development of diagnostic procedures based on microarray analysis confronts the bioinformatician and the biomedical researcher with a variety of challenges. Microarrays generate a huge amount of data. There are many, not yet clearly defined, data processing steps and many clinical response variables which may not match gene expression patterns.

Objectives: To design a generic concept for large-scale microarray experiments dedicated to medical diagnostics; to create a system capable of handling several 1000 microarrays per analysis and more than 100 clinical response variables; to design a standardized workflow for quality control, data calibration, identification of differentially expressed genes and estimation of classification accuracy; and to provide a user-friendly interface for clinical researchers with respect to biomedical interpretation.

Methods: We designed a database structure suitable for the storage of microarray data and analysis results. We applied statistical procedures to identify differential genes and developed a technique to estimate classification accuracy of gene patterns with confidence intervals.

Results: We implemented a Gene Analysis Management System (GAMS) based on this concept, using MySQL for data storage, R/Bioconductor for analysis and PHP for a web-based front-end for the exploration of microarray data and analysis results. This system was utilized with large data sets from several medical disciplines, mainly from oncology (~ 2000 micro-arrays).

Conclusions: A systematic approach is necessary for the analysis of microarray experiments in a medical diagnostics setting to get comprehensible results. Due to the complexity of the analysis, data processing (by bioinformaticians) and interactive exploration of results (by biomedical experts) should be separated.