gms | German Medical Science

68. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie e. V. (GMDS)

17.09. - 21.09.23, Heilbronn

DrugOn: A Comprehensive Drug Ontology for Precision Oncology

Meeting Abstract

  • Kevin Kornrumpf - Dept. of Medical Bioinformatics, University Medical Center Göttingen, Göttingen, Germany
  • Vera Gnass - Dept. of Medical Bioinformatics, University Medical Center Göttingen, Göttingen, Germany
  • Myrine Holm - Dept. of Medical Bioinformatics, University Medical Center Göttingen, Göttingen, Germany
  • Raphael Koch - Dept. for Hematology and Medical Oncology, University Medical Center Göttingen, Göttingen, Germany; Comprehensive Cancer Center Lower Saxony (CCC-N), Hannover, Germany
  • Jürgen Dönitz - Dept. of Medical Bioinformatics, University Medical Center Göttingen, Göttingen, Germany; Comprehensive Cancer Center Lower Saxony (CCC-N), Hannover, Germany; Campus Institute Data Science (CIDAS), Göttingen, Germany

Deutsche Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie. 68. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie e. V. (GMDS). Heilbronn, 17.-21.09.2023. Düsseldorf: German Medical Science GMS Publishing House; 2023. DocAbstr. 326

doi: 10.3205/23gmds140, urn:nbn:de:0183-23gmds1402

Veröffentlicht: 15. September 2023

© 2023 Kornrumpf et al.
Dieser Artikel ist ein Open-Access-Artikel und steht unter den Lizenzbedingungen der Creative Commons Attribution 4.0 License (Namensnennung). Lizenz-Angaben siehe http://creativecommons.org/licenses/by/4.0/.


Gliederung

Text

Introduction: In precision oncology and biomedical cancer research the support of tools is needed to select the best drugs based on the patients’ or cell lines’ genetic background. The challenge is to normalize the different annotations, i.e., substance, drug name or drug class to the same level, most often to the drug class. A manual curated classification is time consuming and may be biased towards the indented effect. Existing resources do not cover all levels, are missing granularity or unify drug class and target into the same classification. Decision support and large data research will benefit from a drug ontology that is built automatically and covers the characteristics of a drug in a structured way.

Methods: Here we introduce DrugOn, a drug ontology that incorporates information from multiple public data sources. By accessing and processing data from the Drugbank database [1] and other drug classification systems such as ATC codes and MeSH terms, we create a new ontology combining the most relevant drug-related information based on these sources. Using a selection process, we can identify the best options in terms of drug classes and categories, molecular targets as well as outlinks and other annotations. In this way, the most appropriate drug categories and targets can be identified for each drug based on the data sources.

Results: 192 drugs found in two lymphoma research projects served as test cases for the evaluation and were annotated by an expert. These drugs are assigned to 117 classes and 380 targets in the automated classification. Compared to manual classification, DrugOn had equivalent drug classes for 154 out of 192 oncology drugs. In 20 cases, no output was possible, due to missing data from public sources. DrugOn can be queried via a REST API. In addition, a front-end application provides the ability to view and discover the ontology and make individual queries about drugs.

Discussion: DrugOn is an automatically built drug ontology based on public data sources completed by some processing rules. The result is an unbiased and easy to update resource for oncological drugs. The ontology distinguishes between drug class and the target protein. Through the structured ontology format and the API DrugOn can be included in manual or automated pipelines.

Conclusion: DrugOn has its origin in lymphoma projects with large amount of data. It can be easily extended to other cancer entities. Possible extensions are to include more sources, e.g., to also include descriptions of gene functions. In the current form DrugOn is already used in research projects and in the variant interpretation framework Onkopus for molecular tumor boards. The front-end version is available at https://mtb.bioinf.med.uni-goettingen.de/drugon-web.

Kevin Kornrumpf and Vera Gnass: equal contribution

The authors declare that they have no competing interests.

The authors declare that an ethics committee vote is not required.


References

1.
Wishart DS, Knox C, Guo AC, Shrivastava S, Hassanali M, Stothard P, Chang Z, Woolsey J. DrugBank: a comprehensive resource for in silico drug discovery and exploration. Nucleic Acids Res. 2006 Jan 1;34(Database issue):D668-72. DOI: 10.1093/nar/gkj067 Externer Link