gms | German Medical Science

65th Annual Meeting of the German Association for Medical Informatics, Biometry and Epidemiology (GMDS), Meeting of the Central European Network (CEN: German Region, Austro-Swiss Region and Polish Region) of the International Biometric Society (IBS)

06.09. - 09.09.2020, Berlin (online conference)

Shrinkage method for estimating the occurrence probability of a repeated measured binary variable

Meeting Abstract

Suche in Medline nach

  • Sven Knüppel - Federal Institute for Risk Assessment (BfR), Berlin, Germany
  • Christine Müller-Graf - Federal Institute for Risk Assessment (BfR), Berlin, Germany

Deutsche Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie. 65th Annual Meeting of the German Association for Medical Informatics, Biometry and Epidemiology (GMDS), Meeting of the Central European Network (CEN: German Region, Austro-Swiss Region and Polish Region) of the International Biometric Society (IBS). Berlin, 06.-09.09.2020. Düsseldorf: German Medical Science GMS Publishing House; 2021. DocAbstr. 129

doi: 10.3205/20gmds219, urn:nbn:de:0183-20gmds2196

Veröffentlicht: 26. Februar 2021

© 2021 Knüppel et al.
Dieser Artikel ist ein Open-Access-Artikel und steht unter den Lizenzbedingungen der Creative Commons Attribution 4.0 License (Namensnennung). Lizenz-Angaben siehe http://creativecommons.org/licenses/by/4.0/.


Gliederung

Text

In order to obtain more precise information, larger and larger studies with more and more measurements are being carried out. Suitable statistical methods are needed to analyze such a large amount of data. A special case is the analysis of repeated measurements. For example, in the large multi-center German National Cohort [1], also known as NAKO Gesundheitsstudie, the 24-hour food list [2] is repeatedly applied to estimate the individuals' consumption probability. For a large number of foods, only the question is asked whether or not they were consumed the day before. A first choice for modeling such repeatedly measured binary outcome variables is the logistic mixed model with random effects. The drawback is that this model has a very long run-time for large sample sizes. Therefore, as an alternative, the Multiple Source Method (MSM) [3] was revised to estimate the probability of the occurrence of a repeatedly measured binary variable for the application in large studies. The MSM uses a shrinkage technique of the residuals from ordinary logistic regression. These residuals are mathematically reduced to sums and thus no complex maximization algorithms are necessary, which are required when applying mixed models. Therefore, the MSM allows a fast calculation and can be applied in large studies. The result of a simulation study is used to represent the properties of the revised multiple source method.

The authors declare that they have no competing interests.

The authors declare that an ethics committee vote is not required.


References

1.
German National Cohort (GNC) Consortium. The German National Cohort: aims, study design and organization. Eur J Epidemiol. 2014 May;29(5):371-82. DOI: 10.1007/s10654-014-9890-7 Externer Link
2.
Knüppel S, Clemens M, Conrad J, Gastell S, Michels KB, Leitzmann M, Krist L, Pischon T, Krause G, Ahrens W, Ebert N, Jöckel KH, Kluttig A, Obi N, Kaaks R, Lieb W, Schipf S, Brenner H, Heuer T, Harttig U, Linseisen J, Nöthlings U, Boeing H. Design and characterization of dietary assessment in the German National Cohort. Eur J Clin Nutr. 2019 11;73(11):1480-1491. DOI: 10.1038/s41430-018-0383-8 Externer Link
3.
Haubrock J, Nöthlings U, Volatier JL, Dekkers A, Ocké M, Harttig U, Illner AK, Knüppel S, Andersen LF, Boeing H; European Food Consumption Validation Consortium. Estimating usual food intake distributions by using the multiple source method in the EPIC-Potsdam Calibration Study. J Nutr. 20