gms | German Medical Science

66. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie e. V. (GMDS), 12. Jahreskongress der Technologie- und Methodenplattform für die vernetzte medizinische Forschung e. V. (TMF)

26. - 30.09.2021, online

Auditing Framework of Machine Learning Models Applied in Medicine – State of the Art and Idea of MeDICLMU Prototype

Meeting Abstract

  • Markus Schwarz - MeDICLMU – Medical Data Integration Center, Zentrum für Medizinische Datenintegration und -analyse, LMU Klinikum, Planegg, Germany
  • Ludwig Hinske - Institut für medizinische Informationsverarbeitung, Biometrie und Epidemiologie, LMU, München, Germany; Klinik für Anaesthesiologie, Klinikum der Universität München, München, Germany
  • Ulrich Mansmann - Institut für medizinische Informationsverarbeitung, Biometrie und Epidemiologie, LMU, München, Germany
  • Fady Albashiti - MeDICLMU – Medical Data Integration Center, Zentrum für Medizinische Datenintegration und -analyse, LMU Klinikum, Planegg, Germany

Deutsche Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie. 66. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie e. V. (GMDS), 12. Jahreskongress der Technologie- und Methodenplattform für die vernetzte medizinische Forschung e.V. (TMF). sine loco [digital], 26.-30.09.2021. Düsseldorf: German Medical Science GMS Publishing House; 2021. DocAbstr. 190

doi: 10.3205/21gmds010, urn:nbn:de:0183-21gmds0103

Veröffentlicht: 24. September 2021

© 2021 Schwarz et al.
Dieser Artikel ist ein Open-Access-Artikel und steht unter den Lizenzbedingungen der Creative Commons Attribution 4.0 License (Namensnennung). Lizenz-Angaben siehe http://creativecommons.org/licenses/by/4.0/.


Gliederung

Text

Introduction: Artificial intelligence (AI) methods, especially machine learning (ML), experiences a recent boost in medical applications, including diagnostics, treatment and post treatment evaluation [1], [2]. Algorithmic ML models promise a better performance and speed compared to human individuals or expert groups for those cases.

State of the Art

This research project is embedded in the field of Auditable AI, which is related to Explainable AI. Recently, two popular initiatives have been brought forth. The first addresses the question of who is performing the audit and how the process should look like [3]. The second is an AI regulatory framework proposal that has a stronger emphasis on risk assessment [4]. In the area of Explainable AI, there exist key concepts and tools like model diagnostics or experiment databases [5]. Those aim at ML designers, researchers or users to empirically investigate algorithms on different tasks and data in a collaborative manner.

Having identified an algorithm that works for a specific use case in conjunction with a specific patient population is an achievement. However, for potential users, more ingredients are required to avoid a black box effect and to increase trust in AI’s decision outcomes [6].

Concept: At the Medical Data Integration Center of LMU University (MeDICLMU), for our framework, we want to emphasize current and future medical use cases we have in scope. Additionally, we will consider those, where the LMU Klinikum has special medical expertise.

Our ML auditing catalog utilizes general auditing principles like being evidence based, being objective and having transparent criteria. The three components in scope of the framework are: medical use case, algorithm and the patient (population) data. As of current planning, the auditing will be conducted on a logically meaningful combination of the aforementioned components.

The genesis of the catalog shall be threefold: Definition of dimensions, standard auditing procedures and consolidation into result. So far, the following relevant dimensions have been identified: Maturity, Transparency, Clinical Acceptance, Risk, Performance and Interpretability.

Implementation: First, we do a fit and gap analysis of existing frameworks, leading to the MeDICLMU auditing criteria catalog. Second, we specify, experiment with and iteratively implement a minimum viable product (MVP). Finally, we validate the MVP at MeDICLMU in the LMU Klinikum. Important elements of the MVP include an assessment dashboard that clarifies the intended use, assumptions, internal structures, training data and audit results of the selected component combination. In a second version, we want to automate the assessment steps of the MVP.

Lessons Learned: The described hypothetical design of the ML auditing criteria and procedures is work in progress. Limited access to algorithms, data and proprietary studies can pose a challenge that might require creative solutions. Essential are also defined minimum quality requirements for each of the three components, as entry criteria to our framework.

Our intended contribution aims at making ML model predictions being used more often, more effectively and more safely in the healthcare market. We want to raise trust and transparency on AI/ML model’s decision, especially for clinicians or healthcare researchers.

The authors declare that they have no competing interests.

The authors declare that an ethics committee vote is not required.


References

1.
Napoli R. AI’s impact on healthcare industry. Healthcare Tech Outlook. 2019.
2.
Bresnick J. Artificial Intelligence in Healthcare Market to See 40% CAGR Surge. Investors and vendors are feeling very optimistic about the growth of artificial intelligence in healthcare. Available from: https://healthitanalytics.com/news/artificial-intelligence-in-healthcare-market-to-see-40-cagr-surge Externer Link
3.
U.S. Food & Drug Administration. Proposed Regulatory Framework for Modifications to Artificial Intelligence/Machine Learning (AI/ML)-Based Software as a Medical Device (SaMD). FDA; 2019. Available from: https://www.fda.gov/media/122535/download Externer Link
4.
Directorate-General for Communications Networks, Content and Technology European Commission.  Regulatory framework proposal on Artificial Intelligence. European Commission Digital Strategy. 2021. Available from: https://digital-strategy.ec.europa.eu/en/policies/regulatory-framework-ai Externer Link
5.
Vanschoren J. Understanding Machine Learning Performance with Experiment Databases. Eindhoven University of Technology; 2010. Available from: https://www.researchgate.net/publication/289380311_Understanding_Machine_Learning_Performance_with_Experiment_Databases Externer Link
6.
Kuan R. Adopting AI in Health Care Will Be Slow and Difficult. Harvard Business Review. 2019. Available from: https://hbr.org/2019/10/adopting-ai-in-health-care-will-be-slow-and-difficult Externer Link