gms | German Medical Science

65th Annual Meeting of the German Association for Medical Informatics, Biometry and Epidemiology (GMDS), Meeting of the Central European Network (CEN: German Region, Austro-Swiss Region and Polish Region) of the International Biometric Society (IBS)

06.09. - 09.09.2020, Berlin (online conference)

Analysis of Interventional Workflow Phases based on Image Classification

Meeting Abstract

  • Jennifer Maier - Pattern Recognition Lab, Department of Computer Science, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany, Erlangen, Germany
  • Julia Schottenhamml - Pattern Recognition Lab, Department of Computer Science, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany, Erlangen, Germany
  • Prathmesh Madhu - Pattern Recognition Lab, Department of Computer Science, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany, Erlangen, Germany
  • Cristiano André da Costa - Software Innovation Laboratory, Universidade do Vale do Rio dos Sinos, Sao Leopoldo, Brazil, Sao Leopoldo, Brazil
  • Andreas Maier - Pattern Recognition Lab, Department of Computer Science, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany, Erlangen, Germany

Deutsche Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie. 65th Annual Meeting of the German Association for Medical Informatics, Biometry and Epidemiology (GMDS), Meeting of the Central European Network (CEN: German Region, Austro-Swiss Region and Polish Region) of the International Biometric Society (IBS). Berlin, 06.-09.09.2020. Düsseldorf: German Medical Science GMS Publishing House; 2021. DocAbstr. 119

doi: 10.3205/20gmds187, urn:nbn:de:0183-20gmds1876

Veröffentlicht: 26. Februar 2021

© 2021 Maier et al.
Dieser Artikel ist ein Open-Access-Artikel und steht unter den Lizenzbedingungen der Creative Commons Attribution 4.0 License (Namensnennung). Lizenz-Angaben siehe http://creativecommons.org/licenses/by/4.0/.


Gliederung

Text

Background: Interventional workflow analysis can help to increase the quality and efficiency of performed procedures, which are two important factors in the medical domain [1]. A useful tool for analyzing medical workflow is video-based phase annotation of procedures, since the duration of certain phases can give evidence about issues in the processes in order to propose suggestions for improvement [2]. We present an approach for classifying frames of intervention videos in order to label phases of these procedures.

Methods: The videos in our dataset show interventional procedures recorded at different locations during the LINC meeting congresses [3]. In each video, after an opening title the case is presented using slides, followed by the actual procedure. The procedure itself is a live catheter examination, a presentation of fluoroscopic images of a previously performed procedure, or a mixture of both. Occasionally, full screen images of the treating physicians are shown. Our data set consists of ten videos recorded at five locations A to E, where the number of videos recorded at the locations is A=3, B=3, C=2, D=1 and E=1. This low amount of data and the fact that some videos are compiled of recordings at different time points currently prevent a time sequence analysis for phase annotation. Instead, we extract one image per second, apply image classification, and define the phases of the videos based on the classes. All extracted video frames are manually assigned to one of the five classes opening (95), fluoroscopy (9376), fluoroscopy overview (196), humans (315), and presentation (1546). For an automatic classification of the images, we use a ResNet18 pre-trained on ImageNet data, since residual networks performed well on real world images and have shown positive results for transfer learning in the medical domain [4], [5]. The network is adapted to our task by training on a part of the available videos. The data set is split in training, validation and test set depending on the location in order to avoid overfitting on certain treating physicians. This results in 7044 frames for training (A, D and E), 1860 frames for validation (B), and 2624 frames for testing (C). The classified images are used to assign phase labels to the test videos. Figure 1 [Fig. 1].

Results: The trained network predicted the correct class for 99% of the test images. Because of this high accuracy, video phases were assigned reliably with average time deviations of 1.10 ± 0.94 seconds at phase changes. Incorrectly classified images led to short jumps in the phase annotations, which could in most cases be removed by requiring a minimum phase length of 3 seconds.

Conclusion: Our proposed method can classify frames of procedure videos with a high accuracy and enables an annotation of video phases. These phases are a first step towards interventional workflow analysis and could be improved by incorporating temporal information from more videos. Live procedures and previously recorded treatments are currently not differentiated, since the images in both cases are similar. Here, the fluoroscopy overview frames and the audio signal could give hints and will be included in a future analysis.

The authors declare that they have no competing interests.

The authors declare that an ethics committee vote is not required.


References

1.
Cheung YY, Goodman EM, Osunkoya TO. No More Waits and Delays: Streamlining Workflow to Decrease Patient Time of Stay for Image-guided Musculoskeletal Procedures. RadioGraphics. 2016;36(3):856-871.
2.
Hauptmann AG, Gao J, Yan R, Qi Y, Yang J, Wactlar HD. Automated analysis of nursing home observations. IEEE Pervasive Computing. 2004;3(2):15-21.
3.
Provascular GmbH. Leipzig Interventional Course – interdisciplinary global platform for endovascular interventions. 2020. Available from: https://www.leipzig-interventional-course.com/ Externer Link
4.
He K, Zhang X, Ren S, Sun J. Deep Residual Learning for Image Recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR); 2016 Jun 27-30; Las Vegas, NV. p. 770-778.
5.
Yu Y, Lin H, Meng J, Wei X, Guo H, Zhao Z. Deep transfer learning for modality classification of medical images. Information. 2017;8(3):91.