gms | German Medical Science

Gesundheit – gemeinsam. Kooperationstagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie (GMDS), Deutschen Gesellschaft für Sozialmedizin und Prävention (DGSMP), Deutschen Gesellschaft für Epidemiologie (DGEpi), Deutschen Gesellschaft für Medizinische Soziologie (DGMS) und der Deutschen Gesellschaft für Public Health (DGPH)

08.09. - 13.09.2024, Dresden

Towards sustainable research data management of longitudinal, heterogeneous data from cardiac tissue engineering

Meeting Abstract

  • Sabine Andrea Smolorz - University Medical Center Göttingen, Department of Medical Informatics, Göttingen, Germany
  • Gesine Marie Dittrich - University Medical Center Göttingen, Institute of Pharmacology and Toxicology, Göttingen, Germany; German Center for Cardiovascular Research (DZHK), Partner Site Lower Saxony, Göttingen, Germany
  • Robert Kossen - University Medical Center Göttingen, Department of Medical Informatics, Göttingen, Germany
  • Linus Weber - University Medical Center Göttingen, Department of Medical Informatics, Göttingen, Germany
  • Tim Meyer - University Medical Center Göttingen, Institute of Pharmacology and Toxicology, Göttingen, Germany; German Center for Cardiovascular Research (DZHK), Partner Site Lower Saxony, Göttingen, Germany
  • Wolfram-Hubertus Zimmermann - University Medical Center Göttingen, Institute of Pharmacology and Toxicology, Göttingen, Germany; German Center for Cardiovascular Research (DZHK), Partner Site Lower Saxony, Göttingen, Germany; University of Göttingen, Cluster of Excellence “Multiscale Bioimaging: from Molecular Machines to Networks of Excitable Cells” (MBExC), Göttingen, Germany; German Center for Neurodegenerative Diseases (DZNE), Göttingen, Germany; Fraunhofer Institute for Translational Medicine and Pharmacology (ITMP), Göttingen, Germany
  • Sara Yasemin Nussbeck - University Medical Center Göttingen, Department of Medical Informatics, Göttingen, Germany; University Medical Center Göttingen, Central Biobank UMG, Göttingen, Germany
  • Ulrich Sax - University Medical Center Göttingen, Department of Medical Informatics, Göttingen, Germany; University of Göttingen, Campus-Institute Data Science (CIDAS), Göttingen, Germany
  • Harald Kusch - University Medical Center Göttingen, Department of Medical Informatics, Göttingen, Germany; University of Göttingen, Campus-Institute Data Science (CIDAS), Göttingen, Germany; University of Göttingen, Cluster of Excellence “Multiscale Bioimaging: from Molecular Machines to Networks of Excitable Cells” (MBExC), Göttingen, Germany

Gesundheit – gemeinsam. Kooperationstagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie (GMDS), Deutschen Gesellschaft für Sozialmedizin und Prävention (DGSMP), Deutschen Gesellschaft für Epidemiologie (DGEpi), Deutschen Gesellschaft für Medizinische Soziologie (DGMS) und der Deutschen Gesellschaft für Public Health (DGPH). Dresden, 08.-13.09.2024. Düsseldorf: German Medical Science GMS Publishing House; 2024. DocAbstr. 604

doi: 10.3205/24gmds125, urn:nbn:de:0183-24gmds1257

Published: September 6, 2024

© 2024 Smolorz et al.
This is an Open Access article distributed under the terms of the Creative Commons Attribution 4.0 License. See license information at http://creativecommons.org/licenses/by/4.0/.


Outline

Text

Introduction: Developing heart patches for repairing the heart muscle in patients with heart failure as well as spontaneously contracting engineered human myocardium (EHM) for drug discovery are core expertise of the Institute of Pharmacology and Toxicology in Göttingen [1], [2]. This translational biomedical research presents challenges beyond the scope of standard tools for research data management (RDM) due to complex, heterogeneous data from highly specialized techniques. Here, we describe our conceptualization for managing data from cardiac tissue engineering, focusing on the feasibility of already established RDM tools from the infrastructure framework provided by our group in the context of various biomedical consortia [3]. ????

Methods: To understand the RDM needs within the described biomedical context, we initially conducted interviews with experts from relevant working groups. Their insights were then translated into user stories using personas to capture user needs effectively. In parallel, we analyzed the data structure of a clearly defined, yet heterogeneous and complex dataset studying EHMs for up to twelve months. Finally, we assessed the feasibility of applying locally existing RDM infrastructures and software tools. We implemented the EHM dataset into these systems and evaluated their ability to meet the diverse stakeholder requirements identified earlier.

Results: Our requirements analysis revealed that the scope of what RDM should provide differed greatly across user groups. For instance, users with programming skills prioritized automatic extraction of standardized data with machine-readable metadata for AI-assisted analysis. In contrast, non-programmers preferred user-friendly interfaces for data exploration and visualization across various data types. The exemplary EHM dataset included longitudinal microscopic and contraction data alongside end-point measurements, particularly from OMICs techniques, adding challenges concerning large file volumes and the need for extensive preprocessing workflow documentation. It also confirmed that while in most working groups there are established, if not necessarily standardized digital workflows focusing on each lab’s specific experimental expertise, there is still a lack of infrastructure for FAIR sharing of data with (internal) collaborators. Encouragingly, the EHM data structure could be successfully mapped to the ISA-tab format within SEEK [4], fulfilling user needs for comprehensive crosslinking and granular access control. Microscopic data was uploaded to OMERO [5] for enhanced visualization, while large data files were stored in a research data archive with cross-references and metadata maintained in SEEK. However, this approach revealed limitations regarding search functionalities and seamless data integration across experiments.

Discussion: Sustainably reusing existing RDM infrastructure offers cost advantages in software development and maintenance. However, it is crucial to balance this with providing sufficient additional values for users to foster their engagement in FAIR data sharing. In our case, SEEK serves as a valuable foundation for organized data storage. Yet, additional tools should address data integration, high-performance computing (HPC) connectivity, advanced search functionalities, and visualization approaches as expressed by the designated users. We have initiated explorative prototype development for further enhanced data integration, while also contributing valuable feedback to the collaborative open-source SEEK project based on our experience.

Acknowledgments: Funded by DFG through CRC1002 (INF), Z projects of CRC1190, CRC1565 and Germany’s Excellence Strategy - EXC2067/1- 390729940.

The authors declare that they have no competing interests.

The authors declare that an ethics committee vote is not required.


References

1.
Bundesministerium für Bildung und Forschung (BMBF). IndiHEART – personalisierte Herzpflaster für Menschen mit Herzmuskelschwäche. Aktuelle Meldungen 2023. 2023 Oct 30 [cited 2024 Apr 29]. Available from: https://www.gesundheitsforschung-bmbf.de/de/indiheart-personalisierte-herzpflaster-fur-menschen-mit-herzmuskelschwache-17081.php External link
2.
Tiburcy M, Meyer T, Liaw NY, Zimmermann WH. Generation of Engineered Human Myocardium in a Multi-well Format. STAR Protoc. 2020:1(1):100032. DOI: 10.1016/j.xpro.2020.100032 External link
3.
Kusch H, Kossen R, Suhr M, Freckmann L, Weber L, Henke C, et al. Management of Metadata Types in Basic Cardiological Research. Stud Health Technol Inform. 2021;283:59–68. DOI: 10.3233/SHTI210542 External link
4.
Wolstencroft K, Owen S, Krebs O, Nguyen Q, Stanford NJ, Golebiewski M, et al. SEEK: A Systems Biology Data and Model Management Platform. BMC Systems Biol. 2015;9(1):33. DOI: 10.1186/s12918-015-0174-y External link
5.
Allan C, Burel JM, Moore J, Blackburn C, Linkert M, Loynton S, et al. OMERO: Flexible, Model-Driven Data Management for Experimental Biology. Nature Methods. 2012:9(3):245–53. DOI: 10.1038/nmeth.1896 External link