gms | German Medical Science

66. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie e. V. (GMDS), 12. Jahreskongress der Technologie- und Methodenplattform für die vernetzte medizinische Forschung e. V. (TMF)

26. - 30.09.2021, online

Exact tests for binary endpoints

Meeting Abstract

Search Medline for

  • Samuel Kilian - Institute of Medical Biometry and Informatics, University of Heidelberg, Heidelberg, Germany
  • Laura Benner - Institute of Medical Biometry and Informatics, University of Heidelberg, Heidelberg, Germany
  • Meinhard Kieser - Institute of Medical Biometry and Informatics, University of Heidelberg, Heidelberg, Germany

Deutsche Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie. 66. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie e. V. (GMDS), 12. Jahreskongress der Technologie- und Methodenplattform für die vernetzte medizinische Forschung e.V. (TMF). sine loco [digital], 26.-30.09.2021. Düsseldorf: German Medical Science GMS Publishing House; 2021. DocAbstr. 239

doi: 10.3205/21gmds086, urn:nbn:de:0183-21gmds0867

Published: September 24, 2021

© 2021 Kilian et al.
This is an Open Access article distributed under the terms of the Creative Commons Attribution 4.0 License. See license information at http://creativecommons.org/licenses/by/4.0/.


Outline

Text

Introduction: The comparison of a binary endpoint between two groups is a common objective in biostatistics. In this case, the exact distribution of the number of successes in each group is known to be a binomial distribution. Since calculations with binomial probabilities are done easily and quickly, this situation suggests the use of exact tests. A number of exact tests for superiority have been proposed and shown to perform better than approximate tests, especially for small sample sizes [1]. Furthermore, exact tests for non-inferiority have been proposed for different effect measures [2]. However, usage of these methods is rare in practice. This might be due to the fact that commonly used software like R do not provide satisfying functionality to calculate p-values, power, sample size, and confidence regions for exact tests for superiority or non-inferiority of two independent binomial probabilities.

Methods: In this talk, we present a general method to construct and implement exact tests for difference, superiority, non-inferiority, or for any other null hypothesis formulated in terms of two independent binomial probabilities pE and pC. Starting with an arbitrary null hypothesis and an ordering criterion (i.e. test statistic), the critical value can be calculated exactly by performing a grid search over the null parameter space of (pE, pC). Analogously, the exact maximum p-value can be calculated. We explain Barnard’s convexity condition (BCC, [3]) and the advantage of test statistics satisfying BCC. Furthermore, we show the construction of confidence regions which are consistent with the test decision. For exact sample size calculation, we present a general iterative method based on the exact power for a specified alternative.

Results: We apply our general approach to commonly used test statistics for hypotheses formulated in terms of the risk difference, the risk ratio [4], and the odds ratio [5]. Calculations are less extensive since all test statistics satisfy BCC. We show that the maximum p-value can be calculated efficiently by finding the roots of the derivative. We compare type I error and power of the original tests and their exact alternatives. It is seen that the exact sample size calculation can be performed relatively fast. Furthermore, the exact sample sizes are compared with the approximate sample sizes.

Discussion: We see that the confidence regions which are consistent with the test decision are not always connected and we discuss the implications. Furthermore, we discuss the choice of the test statistic and its implications for the power.

Conclusion: We conclude that the exact calculation of critical value, p-value, sample size, and confidence region is straightforward. For commonly used test statistics, these calculations can be done efficiently and fast. This is especially true for small sample sizes where exact tests have a great advantage regarding type I error control as compared to approximate tests. In essence, there is no reason not to use exact tests for testing superiority or non-inferiority of two independent binomial proportions.

The authors declare that they have no competing interests.

The authors declare that an ethics committee vote is not required.


References

1.
Lydersen S, Fagerland MW, Laake P. Recommended tests for association in 2 x 2 tables. Stat Med. 2009 Mar 30;28(7):1159-75. DOI: 10.1002/sim.3531 External link
2.
Chan IS. Proving non-inferiority or equivalence of two treatments with dichotomous endpoints using exact methods. Stat Methods Med Res. 2003 Jan;12(1):37-58. DOI: 10.1191/0962280203sm314ra External link
3.
Barnard GA. Significance tests for 2×2 tables. Biometrika. 1947 Jan;34(1-2):123–138. DOI: 10.1093/biomet/34.1-2.123 External link
4.
Farrington CP, Manning G. Test statistics and sample size formulae for comparative binomial trials with null hypothesis of non‐zero risk difference or non-unity relative risk. Statist Med. 1990;9:1447-1454. DOI: 10.1002/sim.4780091208 External link
5.
Wellek S. Statistical methods for the analysis of two-arm non-inferiority trials with binary outcomes. Biom J. 2005 Feb;47(1):48-61; discussion 99-107. DOI: 10.1002/bimj.200410090 External link