gms | German Medical Science

Gesundheit – gemeinsam. Kooperationstagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie (GMDS), Deutschen Gesellschaft für Sozialmedizin und Prävention (DGSMP), Deutschen Gesellschaft für Epidemiologie (DGEpi), Deutschen Gesellschaft für Medizinische Soziologie (DGMS) und der Deutschen Gesellschaft für Public Health (DGPH)

08.09. - 13.09.2024, Dresden

Training Federated Liver Cancer Image Segmentation Models using PHT Infrastructure

Meeting Abstract

  • Macedo Maia - Leipzig University Medical Center, Dept. Medical Data Science, Leipzig, Germany
  • Mehrshad Jaberansary - Institute for Medical Informatics, University of Cologne, Cologne, Germany
  • Muhammad Hamza Akhtar - Department of Data Science and Artificial Intelligence, Fraunhofer FIT, Sankt Augustin, Germany; RWTH Aachen University, Aachen, Germany
  • Yeliz Ucer - Department of Data Science and Artificial Intelligence, Fraunhofer FIT, Sankt Augustin, Germany
  • Oya Beyan - Universität zu Köln, Cologne, Germany; Department of Data Science and Artificial Intelligence, Fraunhofer FIT, Sankt Augustin, Germany
  • Toralf Kirsten - Leipzig University Medical Center, Dept. Medical Data Science, Leipzig, Germany

Gesundheit – gemeinsam. Kooperationstagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie (GMDS), Deutschen Gesellschaft für Sozialmedizin und Prävention (DGSMP), Deutschen Gesellschaft für Epidemiologie (DGEpi), Deutschen Gesellschaft für Medizinische Soziologie (DGMS) und der Deutschen Gesellschaft für Public Health (DGPH). Dresden, 08.-13.09.2024. Düsseldorf: German Medical Science GMS Publishing House; 2024. DocAbstr. 1050

doi: 10.3205/24gmds225, urn:nbn:de:0183-24gmds2254

Published: September 6, 2024

© 2024 Maia et al.
This is an Open Access article distributed under the terms of the Creative Commons Attribution 4.0 License. See license information at http://creativecommons.org/licenses/by/4.0/.


Outline

Text

Introduction: The urgency for early detection of liver cancer is paramount, significantly increasing survival rates. Recent advances in artificial intelligence (AI) have proposed more effective early detection methods. By analysing medical imaging scans, such as CT scans, AI algorithms can identify abnormalities indicative of cancerous growths with remarkable accuracy. However, the efficacy of these algorithms heavily relies on the availability of labelled data and a massive amount of this available data for model training purposes.

Moreover, relevant data for model training is distributed in multiple sources. This work presents an implementation of federated liver and tumour segmentation models using Personal Health Train (PHT) infrastructure.

Federated Model using PHT Infrastructure: Our federated models consist of training a model based on distributed data. PHT [1], [2] infrastructure provides access to different endpoints and makes distributed operations (including federated learning).

We define three PHT Stations (clients) at different places (Leipzig, Cologne, and Aachen) and a central service (central server). Each PHT Station is an independent machine responsible for training a local model based on its locally stored data. Subsequently, the PHT Central Service is connected to each Station and aggregates the trained local models, proposing a global model through local models' weight averaging.

Data Setup: We started training a segmentation model based on the medical segmentation decathlon dataset [3] stored in a centralised server. It contains 131 3D-labelled NIFTI CT images converted into thousands of valid 2D scans and split into three equal parts (each part stored in a specific PHT Station). Each image contains segmentation masks that help identify the regions where the liver and tumours are. We also implemented some state-of-the-art segmentation models (UNET [4], DenseUNET [5], ResUNET++ [6], and Attention-Unet [7]).

We separated the CT scans into training (80%) and test (20%) sets for each PHT Station. We used the dice for liver and tumour regions and the IoU measures to evaluate each model's performance.

Results: We trained and evaluated the previously mentioned segmentation models on the specified dataset. Remarkably, all three segmentation models demonstrated comparable performance. Every model is active more than 90% of the time for dice score prediction of the liver region and more than 70% for dice score prediction of tumour regions for different endpoints.

Discussion: Predictive models such as image segmentation, particularly in sensitive medical domains such as liver cancer segmentation, demand high efficiency in real-world applications. Performance metrics such as the dice coefficient must exceed 90% for liver and tumour region prediction on new images to be adopted in medical institutions to help doctors in decision-making. Incorporating additional CT scans becomes imperative to enhance segmentation model robustness and generalisation. However, amalgamating diverse datasets into a centralised server poses significant challenges, primarily due to stringent data privacy regulations across different institutions, such as DIC Leipzig and Cologne. In the future, we want to join other data sources (e.g., hospitals) containing CT Liver images in our experiments to enrich our federated model.

Acknowledgement: This work is supported by the BMBF project FAIR Data Spaces (FAIRDS14).

The authors declare that they have no competing interests.

The authors declare that an ethics committee vote is not required.


References

1.
Welten S, Mou Y, Neumann L, Jaberansary M, Ucer Y, Kirsten T, et al. A Privacy-Preserving Distributed Analytics Platform for Health Care Data. Methods Inf Med. 2022;61(S 01):e1-e11. DOI: 10.1055/s-0041-1740564 External link
2.
Maia M, Jaberansary M, Ucer Y, Beyan O, Kirsten T. Providing Publicly Available Medical Data Access under FAIR Principles for Distributed Analysis. In: Deutsche Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie, editor. 67. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie e. V. (GMDS), 13. Jahreskongress der Technologie- und Methodenplattform für die vernetzte medizinische Forschung e.V. (TMF). 21.-25.08.2022. Düsseldorf: GMS; 2022. DocAbstr. 203. DOI: 10.3205/22gmds071 External link
3.
Antonell M, Reinke A, Bakas S, et al. The Medical Segmentation Decathlon. Nature Communication. 2022;13:4128. DOI: 10.1038/s41467-022-30695-9 External link
4.
Ronneberger O, Fischer P, Brox T. U-Net: Convolutional Networks for Biomedical Image Segmentation. ArXiv. 2015. arXiv:1505.04597. DOI: 10.48550/arXiv.1505.04597 External link
5.
Cao Y, Liu S, Peng Y, Li J. DenseUNet: densely connected UNet for electron microscopy image segmentation. IET Image Process. 2020;14:2682-2689. DOI: 10.1049/iet-ipr.2019.1527 External link
6.
Jha D, Smedsrud PH, Riegler MA, Johansen D, et al. ResUNet++: An Advanced Architecture for Medical Image Segmentation. In: Proceedings of the IEEE International Symposium on Multimedia (ISM), vol. 162. 2019. p. 225-230.
7.
Oktay O, Schlemper J, Folgoc LL, Lee MJ, Heinrich MP, et al. Attention U-Net: Learning Where to Look for the Pancreas. ArXiv. 2018. arXiv:1804.03999. DOI: 10.48550/arXiv.1804.03999 External link