Subscribe to RSS
![](/products/assets/desktop/img/oa-logo.png)
DOI: 10.1055/a-1914-1985
The Leipzig Health Atlas—An Open Platform to Present, Archive, and Share Biomedical Data, Analyses, and Models Online
Funding/Acknowledgments The LHA project was funded by the German Ministry of Education and Research with the reference number 031L0026 within the program i:DSem—Integrative Data Semantics in Systems Medicine. The project further has received funding from the Innovative Medicines Initiative 2 Joint Undertaking (JU) under grant agreement No. 853988 including the EU (Horizon 2020)-funded imSAVAR project (to H.L-.W., H.B., and M.L.).![](https://www.thieme-connect.de/media/10.1055-s-00035037/2022S02/lookinside/thumbnails/10-1055-a-1914-1985_22010027-1.jpg)
Abstract
Background Clinical trials, epidemiological studies, clinical registries, and other prospective research projects, together with patient care services, are main sources of data in the medical research domain. They serve often as a basis for secondary research in evidence-based medicine, prediction models for disease, and its progression. This data are often neither sufficiently described nor accessible. Related models are often not accessible as a functional program tool for interested users from the health care and biomedical domains.
Objective The interdisciplinary project Leipzig Health Atlas (LHA) was developed to close this gap. LHA is an online platform that serves as a sustainable archive providing medical data, metadata, models, and novel phenotypes from clinical trials, epidemiological studies, and other medical research projects.
Methods Data, models, and phenotypes are described by semantically rich metadata. The platform prefers to share data and models presented in original publications but is also open for nonpublished data. LHA provides and associates unique permanent identifiers for each dataset and model. Hence, the platform can be used to share prepared, quality-assured datasets and models while they are referenced in publications. All managed data, models, and phenotypes in LHA follow the FAIR principles, with public availability or restricted access for specific user groups.
Results The LHA platform is in productive mode (https://www.health-atlas.de/). It is already used by a variety of clinical trial and research groups and is becoming increasingly popular also in the biomedical community. LHA is an integral part of the forthcoming initiative building a national research data infrastructure for health in Germany.
Keywords
FAIR - data semantics - research data management - metadata - clinical trials - biomathematical models - ontology - risk prediction models - omics dataEthical Consideration
All personal data provided by the LHA are from studies and projects, for which ethical approval has been obtained, together with informed written consent of all individuals.
* These authors contributed equally.
§ These authors shared senior authorship.
# Further LHA team members in alphabetical order: Anika Groß,5,6 Ying-Chi Lin,5 Katja Rillich,2 Samira Zeynalova,2 Marita Ziepert2.
Publication History
Received: 10 March 2022
Accepted: 11 June 2022
Accepted Manuscript online:
01 August 2022
Article published online:
23 December 2022
© 2022. The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution-NonDerivative-NonCommercial License, permitting copying and reproduction so long as the original work is given appropriate credit. Contents may not be used for commercial purposes, or adapted, remixed, transformed or built upon. (https://creativecommons.org/licenses/by-nc-nd/4.0/)
Georg Thieme Verlag KG
Rüdigerstraße 14, 70469 Stuttgart, Germany
-
References
- 1 Dryad. Homepage Dryad Digital Repository. Accessed February 10, 2022, at: https://datadryad.org
- 2 Edgar R, Domrachev M, Lash AE. Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res 2002; 30 (01) 207-210
- 3 National Center for Biotechnology Information. Homepage Gene Expression Omnibus. Accessed February 10, 2022, at: https://www.ncbi.nlm.nih.gov/geo/
- 4 European Parliament. European Council. Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the protection of natural persons with regard to the processing of personal data and on the free movement of such data, and repealing Directive 95/46/EC (General Data Protection Regulation). Official Journal of the European Union. 2016;(L 119):1–88. Accessed December 10, 2021, at: https://eur-lex.europa.eu/eli/reg/2016/679/oj
- 5 Wilkinson MD, Dumontier M, Aalbersberg IJJ. et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci Data 2016; 3: 160018
- 6 FAIR Principles in Health Research; 2020. Available at: https://www.thieme-connect.de/products/ejournals/issue/10.1055/s-011-50674
- 7 Herre H. Towards a new foundational ontology of properties, attributives and data. In: Borgo S, ed. Ontology Makes Sense: Essays in Honor of Nicola Guarino. Amsterdam: IOS Press Incorporated; 2019: 194-210
- 8 ISO. Space Data and Information Transfer Systems—Open Archival Information System (OAIS)—Reference Model. 2nd ed. 2012;14721:2012. Published 2012–09. Accessed December 10, 2021, at: https://www.iso.org/standard/57284.html
- 9 CDISC. Operational Data Model (ODM). Accessed December 10, 2021, at: https://www.cdisc.org/standards/data-exchange/odm
- 10 Kirsten T, Kiel A, Wagner J, Rühle M, Löffler M. Selecting, Packaging, and Granting Access for Sharing Study Data. In: Eibl M. and Gaedke M., eds. INFORMATIK 2017. Bonn: Gesellschaft für Informatik; 2017: 1381-1392
- 11 Semler SC, Wissing F, Heyder R. German Medical Informatics Initiative. Methods Inf Med 2018; 57 (Suppl. 01) e50-e56
- 12 Loeffler M, Engel C, Ahnert P. et al. The LIFE-Adult-Study: objectives and design of a population-based cohort study with 10,000 deeply phenotyped adults in Germany. BMC Public Health 2015; 15: 691
- 13 Murphy S, Churchill S, Bry L. et al. Instrumenting the health care enterprise for discovery research in the genomic era. Genome Res 2009; 19 (09) 1675-1681
- 14 LIFE – Leipziger Forschungszentrum für Zivilisationserkrankungen. LIFE Datenportal. Accessed February 10, 2022, at: https://ldp.life.uni-leipzig.de/
- 15 Wolstencroft K, Owen S, Krebs O. et al. SEEK: a systems biology data and model management platform. BMC Syst Biol 2015; 9: 33
- 16 Schriml LM, Arze C, Nadendla S. et al. Disease Ontology: a backbone for disease semantic integration. Nucleic Acids Res 2012; 40 (Database issue): D940-D946
- 17 Schriml LM, Mitraka E. The Disease Ontology: fostering interoperability between biological and clinical human disease-related data. Mamm Genome 2015; 26 (9–10): 584-589
- 18 Schriml LM, Mitraka E, Munro J. et al. Human Disease Ontology 2018 update: classification, content and workflow expansion. Nucleic Acids Res 2019; 47 (D1): D955-D962
- 19 Whetzel PL, Noy NF, Shah NH. et al. BioPortal: enhanced functionality via new Web services from the National Center for Biomedical Ontology to access and use ontologies in software applications. Nucleic Acids Res 2011; 39 (Web Server issue): W541-5
- 20 Ong E, Xiang Z, Zhao B. et al. Ontobee: a linked ontology data server to support ontology term dereferencing, linkage, query and integration. Nucleic Acids Res 2017; 45 (D1): D347-D352
- 21 Nelson B. Data sharing: empty archives. Nature 2009; 461 (7261): 160-163
- 22 Evans DGR, Lalloo F, Cramer A. et al. Addition of pathology and biomarker information significantly improves the performance of the Manchester scoring system for BRCA1 and BRCA2 testing. J Med Genet 2009; 46 (12) 811-817
- 23 Kast K, Rhiem K, Wappenschmidt B. et al; German Consortium for Hereditary Breast and Ovarian Cancer (GC-HBOC). Prevalence of BRCA1/2 germline mutations in 21 401 families with breast and ovarian cancer. J Med Genet 2016; 53 (07) 465-471
- 24 Barnetson RA, Tenesa A, Farrington SM. et al. Identification and survival of carriers of mutations in DNA mismatch-repair genes in colon cancer. N Engl J Med 2006; 354 (26) 2751-2763
- 25 Kastrinos F, Steyerberg EW, Mercado R. et al. The PREMM(1,2,6) model predicts risk of MLH1, MSH2, and MSH6 germline mutations based on cancer history. Gastroenterology 2011; 140 (01) 73-81
- 26 Kastrinos F, Uno H, Ukaegbu C. et al. Development and validation of the PREMM5 model for comprehensive risk assessment of Lynch syndrome. J Clin Oncol 2017; 35 (19) 2165-2172
- 27 Baniasadi N, Rauscher FG, Li D. et al. Norms of interocular circumpapillary retinal nerve fiber layer thickness differences at 768 retinal locations. Transl Vis Sci Technol 2020; 9 (09) 23
- 28 Wang M, Elze T, Li D. et al. Age, ocular magnification, and circumpapillary retinal nerve fiber layer thickness. J Biomed Opt 2017; 22 (12) 1-19
- 29 Li D, Rauscher FG, Choi EY. et al. Sex-specific differences in circumpapillary retinal nerve fiber layer thickness. Ophthalmology 2020; 127 (03) 357-368
- 30 Peschel T, Wang M, Kirsten T, Rauscher FG, Elze T. A cloud-based infrastructure for interactive analysis of RNFLT data. [accepted for publication]. In: 17th IEEE eScience 2021 International Conference; 2021. Accessed February 10, 2022, at: https://www.escience2021.org/
- 31 Kiel A, Wagner J, Rühle M, Twrdik A. Lens - The system behind the LIFE Data Portal. In: Arendt T, Heiker JT, Magin T, Schaefer M, Schulz-Siegmund M, Thiery J, eds. 15th Leipzig Research Festival for Life Sciences; 2019:186. Accessed December 10, 2021, at: http://www.resfest.uniklinikum-leipzig.de/pdf/2019_01_16-Abstract-Book-Research-2019.pdf
- 32 Wagner J. Softwaregestützte Bereitstellung Von Epidemiologischen Forschungsdaten [Master's thesis]. Leipzig, Germany: Leipzig University of Applied Sciences; 2016
- 33 Uciteli A, Beger C, Wagner J. et al. Ontological modelling and execution of phenotypic queries in the Leipzig Health Atlas. Stud Health Technol Inform 2021; 278: 66-74
- 34 Uciteli A, Beger C, Kirsten T, Meineke FA, Herre H. Ontological representation, classification and data-driven computing of phenotypes. J Biomed Semantics 2020; 11 (01) 15
- 35 WHO Team Nutrition and Food Safety. Waist circumference and waist-hip ratio: report of a WHO expert consultation; 2008. Accessed February 10, 2022, at: https://www.who.int/publications/i/item/9789241501491
- 36 Loeffler-Wirth H, Reikowski J, Hakobyan S, Wagner J, Binder H. oposSOM-Browser: an interactive tool to explore omics data landscapes in health science. BMC Bioinformatics 2020; 21 (01) 465
- 37 Wirth H, Löffler M, von Bergen M, Binder H. Expression cartography of human tissues using self organizing maps. BMC Bioinformatics 2011; 12: 306
- 38 Löffler-Wirth H, Kalcher M, Binder H. oposSOM: R-package for high-dimensional portraying of genome-wide expression landscapes on bioconductor. Bioinformatics 2015; 31 (19) 3225-3227
- 39 Schmidt M, Hopp L, Arakelyan A. et al. The human blood transcriptome in a large population cohort and its relation to aging and health. Front Big Data 2020; 3: 548873
- 40 Loeffler-Wirth H, Schmidt M, Binder H. Covid-19 transmission trajectories-monitoring the pandemic in the worldwide context. Viruses 2020; 12 (07) E777
- 41 Schmidt M, Arshad M, Bernhart SH. et al. The evolving faces of the SARS-CoV-2 genome. Viruses 2021; 13 (09) 1764
- 42 re3data.org. Leipzig Health Atlas. 2018. Accessed February 10, 2022, at: https://www.re3data.org/repository/r3d100012652
- 43 HL7. FHIR Release 4 (Technical Correction #1) (v4.0.1). Accessed February 10, 2022, at: https://www.hl7.org/fhir/R4/
- 44 Observational Health Data Sciences and Informatics. Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM). 2022. Accessed February 10, at: https://www.ohdsi.org/data-standardization/the-common-data-model/
- 45 National Patient-Centered Clinical Research Network. PCORnet Common Data Model. Accessed February 10, 2022, at: https://pcornet.org/data/
- 46 World Health Organization. ICD-10: International Statistical Classification of Diseases and Related Health Problems: Tenth Revision. 2nd ed. 2004. Accessed December 10, 2021, at: https://apps.who.int/iris/handle/10665/42980
- 47 Regenstrief Institute. Logical Observation Identifiers Names and Codes (LOINC). Accessed February 10, 2022, at: https://loinc.org/
- 48 SNOMED International. Systematized Nomenclature Of Medicine Clinical Terms (SNOMED CT). Accessed February 10, 2022, at: https://www.snomed.org
- 49 HITS gGmbH. FAIRDOMHub. Accessed February 10, 2022, at: https://fairdomhub.org/
- 50 Open Knowledge Foundation. Comprehensive Knowledge Archive Network (CKAN). Accessed February 10, 2022, at: https://ckan.org/
- 51 King G. Homepage Dataverse. Accessed February 10, 2022, at: https://dataverse.org/
- 52 King G. An Introduction to the Dataverse Network as an infrastructure for data sharing. Sociol Methods Res 2007; 36 (02) 173-199
- 53 ETH Zurich Scientific IT Services. Homepage openBIS. Accessed February 10, 2022, at: https://openbis.ch
- 54 Bauch A, Adamczyk I, Buczek P. et al. openBIS: a flexible framework for managing and analyzing complex data in biology research. BMC Bioinformatics 2011; 12: 468
- 55 onedata. Homepage onedata. Accessed February 10, 2022, at: https://onedata.org