gms | German Medical Science

65th Annual Meeting of the German Association for Medical Informatics, Biometry and Epidemiology (GMDS), Meeting of the Central European Network (CEN: German Region, Austro-Swiss Region and Polish Region) of the International Biometric Society (IBS)

06.09. - 09.09.2020, Berlin (online conference)

Estimating the number of non-exposed cases using risk factor prevalence and risk function

Meeting Abstract

Search Medline for

  • Annette Aigner - Institute of Biometry and Clinical Epidemiology, Charité – Universitätsmedizin Berlin, Berlin, Germany
  • Heiko Becher - Institut für Medizinische Biometrie und Epidemiologie, Universitätsklinikum Hamburg-Eppendorf, Hamburg, Germany

Deutsche Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie. 65th Annual Meeting of the German Association for Medical Informatics, Biometry and Epidemiology (GMDS), Meeting of the Central European Network (CEN: German Region, Austro-Swiss Region and Polish Region) of the International Biometric Society (IBS). Berlin, 06.-09.09.2020. Düsseldorf: German Medical Science GMS Publishing House; 2021. DocAbstr. 463

doi: 10.3205/20gmds239, urn:nbn:de:0183-20gmds2396

Published: February 26, 2021

© 2021 Aigner et al.
This is an Open Access article distributed under the terms of the Creative Commons Attribution 4.0 License. See license information at http://creativecommons.org/licenses/by/4.0/.


Outline

Text

National mortality statistics commonly provide disease-specific absolute and relative frequencies and rates of death by sex and age, but not by exposure status. However, it is often of interest to know how many of the diseased individuals, i.e. the cases, were exposed or not exposed to a specific risk factor.

For many risk factors a proportion of the population is not exposed, and among the exposed exposure has a continuous distribution, for example smoking or alcohol consumption. These variables are called semi-continuous or spike at zero variables.

We present a method to estimate the proportion and subsequently the absolute number of exposed and non-exposed cases. The method requires an estimate of the exposure prevalence in the non-diseased population and an estimate of the relative effect of the exposure. The latter is either a relative risk function given X has a continuous distribution for X>0, or a set of risk estimates for all exposure categories given X is categorical. In the continuous case, an estimate for the proportion of non-exposed cases (p01) is given by p01 = p00 / (p00 + ∫ R(x) f(x) dx), where p00 is the proportion of the non-exposed in the non-diseased, R(x) is the risk function denoting the relative risk given dose x compared to dose 0, and f(x) is the density function of X in the non-diseased.

We provide theoretical justification for the method and suggest approaches for sensitivity analyses. The method is applied to the estimation of the proportion and number of never smokers among lung cancer deaths in Germany.

The authors declare that they have no competing interests.

The authors declare that an ethics committee vote is not required.