gms | German Medical Science

66. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie e. V. (GMDS), 12. Jahreskongress der Technologie- und Methodenplattform für die vernetzte medizinische Forschung e. V. (TMF)

26. - 30.09.2021, online

Reporting on Data Quality Approaches in Health Information Systems

Meeting Abstract

  • Lucas Triefenbach - Medizinische Fakultät der RWTH Aachen, Aachen, Germany
  • Jonas Bienzeisler - Medizinische Fakultät der RWTH Aachen, Aachen, Germany
  • Atinkut Alamirrew Zeleke - Universität Greifswald, Greifswald, Germany
  • Alexander Kombeiz - Medizinische Fakultät der RWTH Aachen, Aachen, Germany
  • Ronny Otto - Universitätsklinikum Magdeburg A.ö.R., Magdeburg, Germany
  • Raphael W. Majeed - Medizinische Fakultät der RWTH Aachen, Aachen, Germany
  • Rainer Röhrig - Medizinische Fakultät der RWTH Aachen, Aachen, Germany

Deutsche Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie. 66. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie e. V. (GMDS), 12. Jahreskongress der Technologie- und Methodenplattform für die vernetzte medizinische Forschung e.V. (TMF). sine loco [digital], 26.-30.09.2021. Düsseldorf: German Medical Science GMS Publishing House; 2021. DocAbstr. 25

doi: 10.3205/21gmds066, urn:nbn:de:0183-21gmds0665

Published: September 24, 2021

© 2021 Triefenbach et al.
This is an Open Access article distributed under the terms of the Creative Commons Attribution 4.0 License. See license information at http://creativecommons.org/licenses/by/4.0/.


Outline

Text

Introduction: In healthcare research, the aggregation and processing of medical data for secondary use in health information systems such as registries or electronic health records is increasing [1]. Errors in data acquisition affect primary data level users [2], but also secondary users such as decisions support systems and research [3], [4]. In recent years, several reviews of data quality in healthcare have explored how data quality can be measured and is being measured [5], [6], [7], [8], [9]. While these works indicate a wealth of literature concerning the development and assessment of data quality indices, there seems to be a gap in literature with regard to how data quality issues are approached.

Methods: We conducted a preliminary scoping review using the Joanna Briggs Institute methodology [10]. Our approach consisted of a literature search using a Medline search limited to the works from 01.01.2018 till 25.03.2021 and including terms related to data quality and electronic health/medical records.

Inspired by the risk management for technology induced errors [11] we developed a classification to capture the procedural nature of different works. Fundamentally, a data quality management process consists of three basic components: Data Control (DC), the process of improving data quality by interventions, Data Quality Assessment (DQA), the process of quantifying and evaluating data quality, and Continuous Data Quality Monitoring (Monitoring), the process to control monitoring of data quality. The works were classified based on these categories by 2 team members independently, and a third team member was used to resolve conflicts.

Results: The searches in Medline and inclusion of manually searched literature resulted in 591 records to be screened by titles and abstracts. Out of these, 126 records were chosen for full-text assessment, with 20 being excluded for being inaccessible or out of scope. Finally, 104 records were classified using the previously presented classification. Out of 104 records 53 were describing DQA, 26 DC, 1 Monitoring, 2 DQA & Monitoring, 9 DQA & DC and 1 DC & Monitoring. 10 works included the complete process consisting of all 3 processes.

Discussion and Conclusion: As hypothesized, the main focus of current literature is on DQA methodology, while the second most popular type of works is DC on its own. Monitoring is not discussed in isolation, which seems to be a logical consequence of monitoring being dependent on DQA. Within its time limitation, our approach identifies a bias towards high level approaches in data quality reporting. Most of the current approaches focus on one to two aspects of the data quality management process. While raising awareness of data quality issues seems to be the current scope of literature, more evidence for the functioning of generic data quality methodology seems indicated. The current version of the scoping review is limited to the PubMed database search and the analysis of the theme to the main classification levels. We anticipate addressing those issues in the upcoming extended versions of this work and to publish them in a peer reviewed journal.

Supplemental Material: The classification is available from: https://github.com/rwth-imi/Supplemental_Material/raw/master/classifikation_papers_Reporting_on_data_quality_approaches.xlsx

The authors declare that they have no competing interests.

The authors declare that an ethics committee vote is not required.


References

1.
Schlegel DR, Ficheur G. Secondary Use of Patient Data: Review of the Literature Published in 2016. Yearb Med Inform. 2017; 26(1):68–71. DOI: 10.15265/IY-2017-032 External link
2.
Kohn LT, Corrigan JM, Donaldson MS. To Err is Human: Building a Safer Health System. Washington (DC): The National Academies Press; 2000. DOI: 10.17226/9728 External link
3.
Malin JL, Keating NL. The cost-quality trade-off: need for data quality standards for studies that impact clinical practice and health policy. J Clin Oncol. 2005; 23(21):4581–4. DOI: 10.1200/JCO.2005.01.912. External link
4.
Coiera E, Westbrook J, Wyatt J. The safety and quality of decision support systems. Yearb Med Inform. 2006:20–5.
5.
Kahn MG, Callahan TJ, Barnard J, Bauck AE, Brown J, Davidson BN, et al. A Harmonized Data Quality Assessment Terminology and Framework for the Secondary Use of Electronic Health Record Data. EGEMS (Wash DC). 2016; 4(1):1244. DOI: 10.13063/2327-9214.1244 External link
6.
Callahan TJ, Bauck AE, Bertoch D, Brown J, Khare R, Ryan PB, et al. A Comparison of Data Quality Assessment Checks in Six Data Sharing Networks. EGEMS (Wash DC). 2017; 5(1):8. DOI: 10.5334/egems.223 External link
7.
Bian J, Lyu T, Loiacono A, Viramontes TM, Lipori G, Guo Y, et al. Assessing the practice of data quality evaluation in a national clinical data research network through a systematic scoping review in the era of real-world data. J Am Med Inform Assoc. 2020; 27(12):1999–2010. DOI: 10.1093/jamia/ocaa245 External link
8.
Weiskopf NG, Bakken S, Hripcsak G, Weng C. A Data Quality Assessment Guideline for Electronic Health Record Data Reuse. EGEMS (Wash DC). 2017; 5(1):14. DOI: 10.5334/egems.218 External link
9.
Weiskopf NG, Weng C. Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research. J Am Med Inform Assoc. 2013; 20(1):144–51. DOI: 10.1136/amiajnl-2011-000681 External link
10.
von Elm E, Schreiber G, Haupt CC. Methodische Anleitung für Scoping Reviews (JBI-Methodologie). Z Evid Fortbild Qual Gesundhwes. 2019; 143:1–7. DOI: 10.1016/j.zefq.2019.05.004 External link
11.
Borycki E, Dexheimer JW, Hullin Lucay Cossio C, Gong Y, Jensen S, Kaipio J, Kennebeck S, Kirkendall E, Kushniruk AW, Kuziemsky C, Marcilly R, Röhrig R, Saranto K, Senathirajah Y, Weber J, Takeda H. Methods for Addressing Technology-induced Errors: The Current State. Yearb Med Inform. 2016;25(01):30–40. DOI: 10.15265/IY-2016-029 External link