gms | German Medical Science

67. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie e. V. (GMDS), 13. Jahreskongress der Technologie- und Methodenplattform für die vernetzte medizinische Forschung e. V. (TMF)

21.08. - 25.08.2022, online

How hazardous are hazard ratios? An empirical investigation in individual patient data from 28 large randomized clinical trials

Meeting Abstract

Suche in Medline nach

  • Alexandra Strobel - Martin-Luther University Halle-Wittenberg, Institute of Medical Epidemiology, Biostatistics and Informatics, Halle (Saale), Germany
  • Andreas Wienke - Martin-Luther University Halle-Wittenberg, Institute of Medical Epidemiology, Biostatistics and Informatics, Halle (Saale), Germany
  • Oliver Kuß - German Diabetes Center, Leibniz Center for Diabetes Research at Heinrich-Heine-University Düsseldorf, Institute for Biometrics and Epidemiology, Düsseldorf, Germany

Deutsche Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie. 67. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie e. V. (GMDS), 13. Jahreskongress der Technologie- und Methodenplattform für die vernetzte medizinische Forschung e.V. (TMF). sine loco [digital], 21.-25.08.2022. Düsseldorf: German Medical Science GMS Publishing House; 2022. DocAbstr. 111

doi: 10.3205/22gmds086, urn:nbn:de:0183-22gmds0861

Veröffentlicht: 19. August 2022

© 2022 Strobel et al.
Dieser Artikel ist ein Open-Access-Artikel und steht unter den Lizenzbedingungen der Creative Commons Attribution 4.0 License (Namensnennung). Lizenz-Angaben siehe http://creativecommons.org/licenses/by/4.0/.


Gliederung

Text

Introduction: Hazard ratios are almost exclusively used to communicate treatment effects in survival analysis. As results of Cox regression models, they are easy to implement and summarize the effect in a single number. However, the hazard ratio has also been criticized for a number of reasons in recent years. One criticism is the implied selection bias in randomized controlled trials (RCTs), which is an inherent consequence of the partial likelihood approach to parameter estimation. While the first factor in the partial likelihood is still characterized by a randomized comparison, all subsequent factors can potentially generate bias in case of large treatment effects and unobserved heterogeneity [1]. Theoretical considerations showed that estimated hazard ratios could be heavily biased in such situations [2]. However, there is no evidence of the practical relevance of this problem. The aim of the present work is thus to analyse the extent of this selection bias through an empirical study of 28 large RCTs and to derive appropriate recommendations for dealing with the interpretation of the hazard ratio in RCTs.

Method: For measuring selection bias, observations are successively deleted from the datasets which were sorted by observation time. Then, a Cox model is fitted for each reduced data set. In addition, the balance of two confounders (age and gender) is measured by z-differences [3]. This procedure of successive cancellation and application of a Cox model is repeated until the studies no longer contain a sufficient number of events. If there is unobserved heterogeneity in the studies, a systematic selection of patients is expected, which results in a bias in the hazard ratio. A small proof-of-concept simulation showed that this method could indeed identify such a selection bias. Finally, we ran these analyses in the individual patient data of a previously used sample of 28 RCTs [4].

Results: No systematic bias in the estimated treatment effect could be found for most of the studies. Apart from random fluctuations, the hazard ratios showed rather stable estimates in the course of deleting observations. The associated balance in the measured covariables age and gender also remained stable, indicating no imbalance of these confounders.

Discussion: The analyses showed that the criticism of hazard ratios as an effect measure for evaluating treatment effects is not justified. Selection bias due to unobserved heterogeneity have no practical relevance when evaluating survival data with the Cox model, which is partly due to the rare occurrence of large treatment effects in RCTs [5].

Conclusion: In summary, it was shown that the impressive warnings from various authors are of a theoretical nature and standard analyses with the Cox model are generally not distorted by the described effect. However, all other reported problems of hazard ratios in terms of interpretability or non-collapsibility stay virulent and should be taken serious by the biostatistical community.

The authors declare that they have no competing interests.

The authors declare that a positive ethics committee vote has been obtained.


References

1.
Hernán MA. The hazards of hazard ratios. Epidemiology. 2010;21(1):13-15. DOI: 10.1097/EDE.0b013e3181c1ea43 Externer Link
2.
Aalen OO, Cook RJ, Roysland K. Does Cox analysis of a randomized survival study yield a causal treatment effect? Lifetime Data Analysis. 2015;21(4):579-593. DOI: 10.1007/s10985-015-9335-y Externer Link
3.
Kuss O. The z-difference can be used to measure covariate balance in matched propensity score analyses. Journal of Clinical Epidemiology. 2013;66(11):1302-1307. DOI: 10.1016/j.jclinepi.2013.06.001 Externer Link
4.
Kent DM, Nelson J, Dahabreh IJ, Rothwell PM, Altman DG, Hayward RA. Risk and treatment effect heterogeneity: re-analysis of individual participant data from 32 large clinical trials. International journal of epidemiology. 2016;45(6): 2075-2088. DOI: 10.1093/ije/dyw118 Externer Link
5.
Djulbegovic B, Kumar A, Glasziou PP, Perera R, Reljic T, et al. New treatments compared to established treatments in randomized trials. Cochrane Database Syst Rev. 2012;10(10). DOI: 10.1002/14651858.MR000024.pub3 Externer Link