gms | German Medical Science

68. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie e. V. (GMDS)

17.09. - 21.09.23, Heilbronn

Providing interactive deep learning models in the browser: a case study of scRNA data exploration in the context of research data management

Meeting Abstract

  • Sara J. Al-Rawi - Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center – University of Freiburg, Freiburg im Breisgau, Germany
  • Manuel Watter - Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center – University of Freiburg, Freiburg im Breisgau, Germany
  • Martin Treppner - Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center – University of Freiburg, Freiburg im Breisgau, Germany
  • Harald Binder - Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center – University of Freiburg, Freiburg im Breisgau, Germany
  • Jochen Knaus - Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center – University of Freiburg, Freiburg im Breisgau, Germany

Deutsche Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie. 68. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie e. V. (GMDS). Heilbronn, 17.-21.09.2023. Düsseldorf: German Medical Science GMS Publishing House; 2023. DocAbstr. 279

doi: 10.3205/23gmds126, urn:nbn:de:0183-23gmds1266

Published: September 15, 2023

© 2023 Al-Rawi et al.
This is an Open Access article distributed under the terms of the Creative Commons Attribution 4.0 License. See license information at http://creativecommons.org/licenses/by/4.0/.


Outline

Text

Introduction: Deep-learning techniques can be powerful, but the complex computational setup and software deployment required [1] have widened the gap between sophisticated data modelers and less tech-savvy users. To change this, offering deep learning methods via the web browser can be a solution that avoids complex setups but still allows interactive data exploration. An increasingly important issue in life science is modeling single-cell RNA sequencing (scRNA-seq). scRNA-seq captures the complex cellular heterogeneity of RNA transcripts by profiling the transcriptomes of individual cells. Deep-learning techniques based on variational autoencoders [2] facilitate biological insight [3], [4] by learning low dimensional latent representations of scRNA-seq data. We implement a novel dimensionality reduction method based on gradient optimization. Together with visualization, it allows users to interactively explore individual genes from scRNA-seq data locally in their web browsers. To improve reproducibility, the data and models are versioned together in a research data management system, and exploration results are eventually added to improve data documentation.

Methods: Gradient-based optimization is at the heart of deep learning and is usually provided by complex frameworks. Earlier approaches for deep learning methods in the browser [5] relied on reimplementing algorithms in Javascript to provide gradients; instead, we utilize the intermediate format WebAssembly [6], which is available in all modern browsers and can target different programming languages. We enhance the gradient optimization of our deep learning method by introducing a novel term that allows optimization with respect to a selected set of genes by domain experts. This approach enables exploration of the impact of these genes on low-dimensional latent representations. The gradient optimization is implemented in Swift combined with the Python-based Shinylive, a serverless framework for providing interactive interfaces. The bundle of all components is delivered via a research data management system and can therefore be used optionally to provide managed datasets. The user is provided with interactive visualizations, and results can be transferred back as data documentation.

Results: As a case study, we have shown that deploying deep learning via WebAssembly can make these methods more accessible in restricted environments or to less tech-savvy scientists. With this technology, domain experts can interactively explore the low-dimensional manifold produced by deep learning techniques in their web browsers. Combining method and data as revision-controlled digital objects further lower barriers to use increases reproducibility, and improves data documentation through optional reintegration of exploration results.

Discussion: WebAssembly is an emerging technology that simplifies the integration of methods developed in various programming languages. While it still has room for improvement, particularly in supporting data science languages, the solution demonstrates the feasibility of making complex models accessible and executable in local environments. Adoption to other deep learning-based models for Multiomics data integration is possible.

The authors declare that they have no competing interests.

The authors declare that an ethics committee vote is not required.


References

1.
Chen Z, Cao Y, Liu Y, Wang H, Xie T, Liu X. A comprehensive study on challenges in deploying deep learning based software. In: Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. Virtual Event. ACM; 2020. p. 750–62. DOI: 10.1145/3368089.3409759 External link
2.
Kingma DP and Welling M. Auto-Encoding Variational Bayes. In: Bengio Y, LeCun Y, editors. 2nd International Conference on Learning Representations (ICLR). Conference Track Proceedings. 2014.
3.
Lopez R, Regier J, Cole MB, Jordan MI, and Yosef N. Deep generative modeling for single-cell transcriptomics. Nature Methods. 2018;15:1053–8.
4.
Grønbech CH, Vording MF, Timshel PN, Sønderby CK, Pers TH, Winther O. scVAE: variational auto-encoders for single-cell gene expression data. Bioinformatics. 2020 Aug 15;36(16):4415–4422.
5.
Schmid-Burgk JL, Hornung V. BrowserGenome.org: web-based RNA-seq data analysis and visualization. Nat Methods. 2015 Nov;12(11):1001.
6.
Haas A, Rossberg A, Schuff DL, Titzer BL, Holman M, Gohman D, et al. Bringing the web up to speed with WebAssembly. SIGPLAN Not. 2017 Sep 14;52(6):185–200.