Article
Towards sustainable research data management of longitudinal, heterogeneous data from cardiac tissue engineering
Search Medline for
Authors
Published: | September 6, 2024 |
---|
Outline
Text
Introduction: Developing heart patches for repairing the heart muscle in patients with heart failure as well as spontaneously contracting engineered human myocardium (EHM) for drug discovery are core expertise of the Institute of Pharmacology and Toxicology in Göttingen [1], [2]. This translational biomedical research presents challenges beyond the scope of standard tools for research data management (RDM) due to complex, heterogeneous data from highly specialized techniques. Here, we describe our conceptualization for managing data from cardiac tissue engineering, focusing on the feasibility of already established RDM tools from the infrastructure framework provided by our group in the context of various biomedical consortia [3]. ????
Methods: To understand the RDM needs within the described biomedical context, we initially conducted interviews with experts from relevant working groups. Their insights were then translated into user stories using personas to capture user needs effectively. In parallel, we analyzed the data structure of a clearly defined, yet heterogeneous and complex dataset studying EHMs for up to twelve months. Finally, we assessed the feasibility of applying locally existing RDM infrastructures and software tools. We implemented the EHM dataset into these systems and evaluated their ability to meet the diverse stakeholder requirements identified earlier.
Results: Our requirements analysis revealed that the scope of what RDM should provide differed greatly across user groups. For instance, users with programming skills prioritized automatic extraction of standardized data with machine-readable metadata for AI-assisted analysis. In contrast, non-programmers preferred user-friendly interfaces for data exploration and visualization across various data types. The exemplary EHM dataset included longitudinal microscopic and contraction data alongside end-point measurements, particularly from OMICs techniques, adding challenges concerning large file volumes and the need for extensive preprocessing workflow documentation. It also confirmed that while in most working groups there are established, if not necessarily standardized digital workflows focusing on each lab’s specific experimental expertise, there is still a lack of infrastructure for FAIR sharing of data with (internal) collaborators. Encouragingly, the EHM data structure could be successfully mapped to the ISA-tab format within SEEK [4], fulfilling user needs for comprehensive crosslinking and granular access control. Microscopic data was uploaded to OMERO [5] for enhanced visualization, while large data files were stored in a research data archive with cross-references and metadata maintained in SEEK. However, this approach revealed limitations regarding search functionalities and seamless data integration across experiments.
Discussion: Sustainably reusing existing RDM infrastructure offers cost advantages in software development and maintenance. However, it is crucial to balance this with providing sufficient additional values for users to foster their engagement in FAIR data sharing. In our case, SEEK serves as a valuable foundation for organized data storage. Yet, additional tools should address data integration, high-performance computing (HPC) connectivity, advanced search functionalities, and visualization approaches as expressed by the designated users. We have initiated explorative prototype development for further enhanced data integration, while also contributing valuable feedback to the collaborative open-source SEEK project based on our experience.
Acknowledgments: Funded by DFG through CRC1002 (INF), Z projects of CRC1190, CRC1565 and Germany’s Excellence Strategy - EXC2067/1- 390729940.
The authors declare that they have no competing interests.
The authors declare that an ethics committee vote is not required.
References
- 1.
- Bundesministerium für Bildung und Forschung (BMBF). IndiHEART – personalisierte Herzpflaster für Menschen mit Herzmuskelschwäche. Aktuelle Meldungen 2023. 2023 Oct 30 [cited 2024 Apr 29]. Available from:
https://www.gesundheitsforschung-bmbf.de/de/indiheart-personalisierte-herzpflaster-fur-menschen-mit-herzmuskelschwache-17081.php
- 2.
- Tiburcy M, Meyer T, Liaw NY, Zimmermann WH. Generation of Engineered Human Myocardium in a Multi-well Format. STAR Protoc. 2020:1(1):100032. DOI: 10.1016/j.xpro.2020.100032
- 3.
- Kusch H, Kossen R, Suhr M, Freckmann L, Weber L, Henke C, et al. Management of Metadata Types in Basic Cardiological Research. Stud Health Technol Inform. 2021;283:59–68. DOI: 10.3233/SHTI210542
- 4.
- Wolstencroft K, Owen S, Krebs O, Nguyen Q, Stanford NJ, Golebiewski M, et al. SEEK: A Systems Biology Data and Model Management Platform. BMC Systems Biol. 2015;9(1):33. DOI: 10.1186/s12918-015-0174-y
- 5.
- Allan C, Burel JM, Moore J, Blackburn C, Linkert M, Loynton S, et al. OMERO: Flexible, Model-Driven Data Management for Experimental Biology. Nature Methods. 2012:9(3):245–53. DOI: 10.1038/nmeth.1896