gms | German Medical Science

65th Annual Meeting of the German Association for Medical Informatics, Biometry and Epidemiology (GMDS), Meeting of the Central European Network (CEN: German Region, Austro-Swiss Region and Polish Region) of the International Biometric Society (IBS)

06.09. - 09.09.2020, Berlin (online conference)

Preprocessing of direct infusion high resolution mass spectrometry data from metabolomics studies

Meeting Abstract

Search Medline for

  • Silke Szymczak - Christian-Albrechts-Universität zu Kiel, Kiel, Germany

Deutsche Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie. 65th Annual Meeting of the German Association for Medical Informatics, Biometry and Epidemiology (GMDS), Meeting of the Central European Network (CEN: German Region, Austro-Swiss Region and Polish Region) of the International Biometric Society (IBS). Berlin, 06.-09.09.2020. Düsseldorf: German Medical Science GMS Publishing House; 2021. DocAbstr. 218

doi: 10.3205/20gmds370, urn:nbn:de:0183-20gmds3700

Published: February 26, 2021

© 2021 Szymczak.
This is an Open Access article distributed under the terms of the Creative Commons Attribution 4.0 License. See license information at http://creativecommons.org/licenses/by/4.0/.


Outline

Text

The focus of metabolomics studies are the end products of cellular processes that provide a snapshot of the physiological state in a

cell, tissue, or organism. Direct infusion methods coupled with high resolution mass analyzers avoid chromatography based separation of samples.

However, preprocessing and annotation of the resulting mass spectrometry data pose additional challenges since the spectra contain only information on mass-to-charge ratio (mz values) and intensities.

A new pipeline is proposed to extract intensities of relevant features and to provide annotation to the most probable metabolite. First, metabolites are identified that would be theoretically measurable in the conducted experiment, e.g. based on Lipid Maps or the Human Metabolome Database. Possible adducts and isotopes are calculated using the R package enviPat, defining the set of features of interest. Second, intensities are extracted for each feature based on local maxima as implemented in the R package FTICRMS and combined across samples. Third, features are filtered, e.g. based on detection in quality control and blank samples. Fourth, features are annotated using a Bayesian approach that integrates information about isotope patterns as well as adduct relationships and is implemented in the R package IPA.

Data from a study available in the MetaboLights database is used to show that the new pipeline identifies a similar number of metabolites and that thedetection of metabolite differences between the two disease groups and healthy controls can be reproduced.

The pipeline is implemented as an R package and will be made publicly available.

The authors declare that they have no competing interests.

The authors declare that an ethics committee vote is not required.