gms | German Medical Science

68. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie e. V. (GMDS)

17.09. - 21.09.23, Heilbronn

Integration of microbiome data in cBioPortal

Meeting Abstract

  • Lennart Graf - Department of Medical Informatics, University Medical Center Göttingen, Göttingen, Germany
  • Christoph Ammer-Herrmenau - Department of Gastroenterolgy, University Medical Center Göttingen, Germany, Göttingen, Germany
  • Nils Beyer - Department of Medical Informatics, University Medical Center Göttingen, Göttingen, Germany
  • Jonas Hügel - Department of Medical Informatics, University Medical Center Göttingen, Göttingen, Germany
  • Sophia Rheinländer - Department of Medical Informatics, University Medical Center Göttingen, Göttingen, Germany
  • Ulrich Sax - Department of Medical Informatics, University Medical Center Göttingen, Göttingen, Germany

Deutsche Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie. 68. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie e. V. (GMDS). Heilbronn, 17.-21.09.2023. Düsseldorf: German Medical Science GMS Publishing House; 2023. DocAbstr. 299

doi: 10.3205/23gmds019, urn:nbn:de:0183-23gmds0198

Published: September 15, 2023

© 2023 Graf et al.
This is an Open Access article distributed under the terms of the Creative Commons Attribution 4.0 License. See license information at http://creativecommons.org/licenses/by/4.0/.


Outline

Text

Introduction: Microbes have an influence on the outcome of cancer therapy. Despite this evidence, the number of microbes known to directly cause the development of cancer remains small, making further research necessary [1]. Within the context of the KFO5002, we loaded buccal and rectal samples from pancreatic cancer (PDAC) patients into our clinical data warehouse, cBioPortal, which enables holistic analyses on the cancer patient [2], [3]. The data derived from the gastroenterology department of the University Medical Center Göttingen and were stored as taxa counts. While the buccal samples were extracted by marker gene analysis, the rectal samples were extracted by whole metagenome analysis. Comparable research in this area is Poore et. al. who performed metatranscriptomic analysis to determine the microbial composition and loaded the taxa counts on a log2-scale in cBioPortal [4].

Methods: We developed ETL methods to extract and store microbiome and clinical data. Depending on the type of analysis, the data was available in a Phyloseq object in R, requiring the use of an R wrapper in our Python-ETL-script. The clinical data included in the Phyloseq object were comprehensive and provided information on, e.g., the patient's therapy, demographic details, operations or previous diseases of the patient. To annotate the clinical data, we used the suggested column names and descriptions from the clinical data dictionary of the OncoTree api recommended by cBioPortal. Since cBioPortal demands microbiome data on a logarithmic scale as described by Poore et. al. we had to transform the taxa counts. We used a transformation from the R package metagenomeSeq that performed log2(x+1) with taxa counts. We then loaded the microbiome data into cBioPortal and analyse it.

Results: We were able to load microbiome data and clinical data from 59 PDAC and 49 chronic pancreatitis (CP) patients into cBioPortal. The cBioPortal incorporated volcano plot does not reveal any differential abundant taxa between both groups regarding the buccal and rectal microbiome. That is contrary to the previous analysis in which differential abundances were obtained by LefSe [5]. Hereby, several taxa were differential abundant in rectal samples between CP and PDAC groups.

Discussion: The exploratory analysis techniques of cBioPortal allowed researchers to see their microbiome data from a different perspective. The group-comparison function and filtering of patients in cBioPortal were found to be useful. While the automatic generation of statistical plots such as p-value, Q-value or Volcano Plots saves time and benefits analysis workflows, it is nevertheless necessary for further microbiome analyses in cBioPortal to be able to use further customised statistical approaches (e.g. Mann-Whitney-U-Test, Alpha-Diversity, Beta-Diversity).

Conclusion: Integrating microbiome data and clinical data in cBioPortal makes data more accessible and discoverable for the clinicians, but lacks statistical tools for microbiome analysis in cBioPortal. Therefore, future research can work on implementing statistical tools for microbiome analyses in cBioPortal.

Funding: This work is funded by the DFG within CRU5002 (Project 426671079).

Ethics: The authors declare that a positive ethics committee vote has been obtained. All work involving patient data is carried out under ethics vote (11/5/17).

Acknowledgements: We would like to thank Prof. Dr. rer. nat. Andre Franke for his support.

The authors declare that they have no competing interests.


References

1.
Sepich-Poore GD, Zitvogel L, Straussman R, Hasty J, Wargo JA, Knight R. The microbiome and human cancer. Science. 2021 Mar 26;371(6536):eabc4552.
2.
Gao J, Aksoy BA, Dogrusoz U, Dresdner G, Gross B, Sumer SO, et al. Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Sci Signal. 2013 Apr 2;6(269):pl1.
3.
Cerami E, Gao J, Dogrusoz U, Gross BE, Sumer SO, Aksoy BA, et al. The cBio Cancer Genomics Portal: An Open Platform for Exploring Multidimensional Cancer Genomics Data. Cancer Discov. 2012 May;2(5):401–4.
4.
Poore GD, Kopylova E, Zhu Q, Carpenter C, Fraraccio S, Wandro S, et al. Microbiome analyses of blood and tissues suggest cancer diagnostic approach. Nature. 2020 Mar;579(7800):567-74.
5.
Segata N, Izard J, Waldron L, Gevers D, Miropolsky L, Garrett WS, et al. Metagenomic biomarker discovery and explanation. Genome Biol. 2011 Jun 24;12(6):R60.