gms | German Medical Science

63. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie e. V. (GMDS)

Deutsche Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie

02. - 06.09.2018, Osnabrück

Application of basic research data management with FAIRDOM/SEEK from a medical informatics perspective

Meeting Abstract

Suche in Medline nach

  • Christian R. Bauer - Department of Medical Informatics, University Medical Center Göttingen, Göttingen, Deutschland
  • Cornelius Knopp - Department of Medical Informatics, University Medical Center Göttingen, Göttingen, Deutschland
  • Theresa Bender - Department of Medical Informatics, University Medical Center Göttingen, Göttingen, Deutschland
  • Harald Kusch - Department of Medical Informatics, University Medical Center Göttingen, Göttingen, Deutschland; Department of Molecular Biology, University Medical Center Göttingen, Göttingen, Deutschland
  • Ulrich Sax - Department of Medical Informatics, University Medical Center Göttingen, Göttingen, Deutschland

Deutsche Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie. 63. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie e.V. (GMDS). Osnabrück, 02.-06.09.2018. Düsseldorf: German Medical Science GMS Publishing House; 2018. DocAbstr. 205

doi: 10.3205/18gmds093, urn:nbn:de:0183-18gmds0935

Veröffentlicht: 27. August 2018

© 2018 Bauer et al.
Dieser Artikel ist ein Open-Access-Artikel und steht unter den Lizenzbedingungen der Creative Commons Attribution 4.0 License (Namensnennung). Lizenz-Angaben siehe http://creativecommons.org/licenses/by/4.0/.


Gliederung

Text

Introduction: Clinical experimental and observational research typically involves data streams between multiple sources and actors, for example from multiple software systems through statisticians to clinicians. As shown in our experience with openBIS [1] possible solutions to handle this complexity can be based on generic research management systems with file storage capabilities involving access management, dataset versioning and audit trails.

SEEK [2] is an open source software with roots in the systems biology community providing data management, sharing and exploration of data and biological models. SEEK offers a data storage solution with web-interface, fine-grained access permissions and metadata storage utilizing the Resource Description Format (RDF). SEEK is now bundled together with openBIS under the umbrella FAIRDOM [3] offering a toolbox implementing the FAIR Guiding Principles [4]. Based on our experience with openBIS [1] we examined the utility of employing the SEEK platform to provide basic data management support to ongoing local research projects.

Methods: A local SEEK instance was created at the University Medical Center Göttingen using Docker with four independent Docker volumes for easy application/data separation and backup. Furthermore, four application containers were utilized to separate the main SEEK instance from database, workers and the SOLR search service for flexible update and resource management. Customizing of our SEEK instance involved local branding and the connection to the local University Medical Center user management based on LDAP followed by development and implementation of a mapping strategy for local workflows onto the SEEK data model. Mapping was guided by comprehensive evaluation of implementation experience with openBIS.

Results: We established a SEEK instance restricted to our local hospital network. Utilizing a complete Docker setup while developing enabled an uncomplicated transfer into our productive instance. The data management of two long-term observational studies with data created in disparate data capture applications for study, image and biomaterial data was realized in this instance. Both comprise of continuous workflows collecting data from multiple sources, data-linkage through an external service, data cleaning and upload into a unifying research platform, i2b2/tranSMART [5]. We implemented this workflow with access rights complying with data protection concepts established within the projects: data creators may only upload their data while data stewards need broader authorization for the data integration process. In addition, configurations, data integration scripts, and instructions to communicate with external services were also represented and versioned in SEEK.

Discussion: Our implementation of SEEK to establish a basic research data management solution simplified daily management tasks and promises secure and traceable data handling of our use-cases. In particular, the built-in user-friendly visualization of SEEKs management and workflow features like interactive document link representation was effective in engaging users in the platform. Though parts of SEEKs functionality are specially adapted to systems biology data (e.g. emulation of lab notebook), they may be customizable to our environment. Our next steps to further establish SEEK as a general and easy to adapt data managing platform inside our biomedical informatics infrastructure [6] will focus on improved adaption and automation of our workflows as well as reviewing security aspects.

Acknowledgements: This work was supported by the German Federal Ministry of Education and Research (BMBF) within the framework of the research and funding concepts e:Med (01ZX1606C/sysINFLAME), i:DSem (031L0024A/MyPathSem) and the DFG for the Collaborative Research Centre 1002 on Modulatory Units in Heart Failure, subproject INF.

The authors declare that they have no competing interests.

The authors declare that an ethics committee vote is not required.


References

1.
Knopp C, Bauer CR, Baum B, Löhnhardt B, Nietert M, Beißbarth T, Sax U. A validatable data management solution for multi-center research consortia: research data management with openBIS. In: HEC 2016: Health — Exploring Complexity: 2016 Joint Conference of GMDS, DGEpi, IEA-EEF, EFMI.
2.
Wolstencroft K, Owen S, Krebs O, Nguyen Q, Stanford NJ, Golebiewski M, Weidemann A, Bittkowski M, An L, Shockley D, Snoep JL, Mueller W, Goble C. SEEK: A systems biology data and model management platform. BMC Syst Biol. 2015; 9:33.
3.
Wolstencroft K, Krebs O, Snoep JL, Stanford NJ, Bacall F, Golebiewski M, et al. FAIRDOMHub: A repository and collaboration environment for sharing systems biology research. Nucleic acids research. 2017; 45(D1):D404-D407.
4.
Wilkinson MD, Dumontier M, Aalbersberg IJJ, Appleton G, Axton M, Baak A, et al. The FAIR Guiding Principles for scientific data management and stewardship. Scientific data. 2016; 3:160018.
5.
Christoph J, Knell C, Naschberger E, Stürzl M, Maier C, Prokosch H-U, Sedlmayr M. Two Years of tranSMART in a University Hospital for Translational Research and Education. Stud Health Technol Inform. 2017; 236:70–9.
6.
Bauer CR, Umbach N, Baum B, Buckow K, Franke T, Grütz R, et al. Architecture of a Biomedical Informatics Research Data Management Pipeline. Stud Health Technol Inform. 2016; 228:262–6.