gms | German Medical Science

51. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie

Deutsche Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie e. V. (gmds)

10. - 14.09.2006, Leipzig

A new bivariate survival model including a non-susceptible fraction

Meeting Abstract

Suche in Medline nach

  • Andreas Wienke - Universität Halle, Halle

Deutsche Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie e.V. (gmds). 51. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie. Leipzig, 10.-14.09.2006. Düsseldorf, Köln: German Medical Science; 2006. Doc06gmds314

Die elektronische Version dieses Artikels ist vollständig und ist verfügbar unter:

Veröffentlicht: 1. September 2006

© 2006 Wienke.
Dieser Artikel ist ein Open Access-Artikel und steht unter den Creative Commons Lizenzbedingungen ( Er darf vervielfältigt, verbreitet und öffentlich zugänglich gemacht werden, vorausgesetzt dass Autor und Quelle genannt werden.



Traditional analysis of time-to-event data is based on two main assumptions: first, independence of observed event times and second, all individuals are susceptible to the event of interest and will eventually experience this event if the follow-up is sufficiently long.

Various approaches are investigated to overcome the first restriction of independence, known as multivariate survival models. Available statistical models fall into two broad classes - marginal and frailty models. Marginal methods of analysis specify models for the effect of covariates on the hazards of the individual events (the margins), taking into account the fact that observed event times are correlated but without the need for explicitly modelling this correlation. The association between the events is considered as a nuisance parameter. As with the analysis of longitudinal data, regression parameters are estimated from generalized estimating equations, and the corresponding variance-covariance estimators are corrected properly to account for the dependence structure.

The marginal approach is ideal for making inference on the population average effect of risk factors on failure time. However, it provides no insight into the multivariate relationship among failure times. These types of questions are answered by frailty models, considering the association between various events explicitly. In general, frailty models have an intuitive appeal and provide insight into the relationship between failures, and we will focus on this approach in the present paper.

Frailty models have been used frequently for modelling dependence in multivariate time-to-event data. The dependence usually arises because individuals in the same group (family, litter, study centre) are related to each other or because of the multiple recurrence of an event for the same individual. The traditional proportional hazards model can not be applied to these cases. A possible solution to this problem is the use of conditional proportional hazards given the frailty. Here, the variability of lifetimes is formulated as arising from two different sources: first, natural variability, which is included in the baseline hazard function; second, frailty. Lifetimes are conditionally independent given the frailty (as individual random effect), and the frailty term represents unobserved covariates. We assume that, given unobserved frailty, the hazard for each individual survival time follows a proportional hazards model, with the frailty variable (the random effect) acting multiplicatively on the baseline hazard function.

To overcome the second strong assumption in general survival models mentioned above cure models are introduced into survival analysis. Cure models allow for a fraction of individuals in the population who are not susceptible to the event under study. Univariate cure models are well established in the literature, but only a few papers exist dealing with bivariate cure models, combining the concepts of frailty and cure modelling [1], [2], [3]. Chatterjee and Shih used a shared frailty model with an extra latent variable to model a non-susceptible fraction. Extending this approach Wienke et al. applied a correlated gamma frailty model. Moger and Aalen suggest a compound Poisson frailty model with random scale, without additional latent variables. But their survival function needs a recursive representation. The main aim of the present paper is to establish a simple bivariate frailty model with cure fraction.

To avoid the limitations of the often used shared frailty models, correlated frailty models are being developed for the analysis of multivariate failure time data [2], [4], [5], [6], in which associated random variables are used to characterize the frailty effect for each cluster. Correlated frailty models provide not only variance parameters of the frailties as in shared frailty models, but they also contain additional parameters for modelling the correlation between frailties in each group.

Data on Swedish twins was collected by the Swedish Twin Registry, the largest population based twin registry in the world. Information about onset of cancer was obtained from the National Cancer Registry by regularly matching both registries. Unfortunately, such kind of registers does not commonly contain detailed information about individual risk factors. To account for this missing information in the analysis, frailty models can be used. In these models the frailty variable represents varying levels of risk of different individuals in the study population.

A new frailty model is suggested for analysis of bivariate time-to-event data. The model is an extension of the correlated PVF frailty model (correlated three-parameter frailty model) introduced by Yashin and Iachine [7] and has never before been considered in bivariate survival studies. It is based on a bivariate extension of the compound Poisson frailty model introduced to univariate survival analysis by Aalen [8]. It is also related to the extended correlated gamma frailty model by Wienke [2]. It allows for a non-susceptible fraction in the population, overcoming the common assumption in survival analysis that all individuals are susceptible to the event under study. The survival function of the correlated compound Poisson frailty model is given by Figure 1 [Fig. 1].

Parameter γ divides the class of distributions in two major subfamilies: For γ ≥ 0 the distribution is a power variance function distribution (PVF). The extension to γ < 0 in the univariate case was suggested by Aalen [8] and shown to yield the compound Poisson distribution. The two subclasses are separated by the gamma distribution γ = 0. The model is applied to breast cancer data of 5857 Swedish twin pairs to estimate the correlations between frailties of both monozygotic (ρMZ) and dizygotic (ρDZ) twins as well as the size of the susceptible fraction. Results of the analysis are given in Table 1 [Tab. 1].

The inverse Gaussian model is given by γ=0.5. Parameter values of γ < 0 imply the existence of a non-susceptible fraction in the population. Parameter φ describes the size of the susceptible fraction and σ is the standard deviation of the frailty. The log-likelihood is given in the last row of the table. The Results are compared to the analysis by Wienke et al. [2].


Chatterjee N, Shih J. A bivariate cure-mixture approach for modeling familial association in diseases. Biometrics. 2001; 57: 779 - 786.
Wienke A, Lichtenstein P, Yashin AI. A bivariate frailty model with a cure fraction for modeling familial correlations in diseases. Biometrics. 2003; 59: 1178 - 1183.
Moger TA, Aalen OO. A distribution for multivariate frailty based on the compound Poisson distribution with random scale. Lifetime Data Analysis. 2005; 11: 41 - 59.
Yashin AI, Vaupel JW, Iachine IA Correlated individual frailty: an advantageous approach to survival analysis of bivariate data. Mathematical Population Studies. 1995; 5: 145 - 159.
Ripatti S, Palmgren J. Estimation of multivariate frailty models using penalized partial likelihood. Biometrics. 2000; 56: 1016 - 1022.
Ripatti S, Larsen K, Palmgren J. Maximum likelihood inference for multivariate frailty models using an automated Monte Carlo EM algorithm. Lifetime Data Analysis. 2002; 8: 349-360.
Yashin AI, Begun A, Iachine IA. Genetic factors in susceptibility to death: a comparative analysis of bivariate survival models. Journal of Epidemiology and Biostatistics. 1999; 4: 53-60.
Aalen OO. Modelling heterogeneity in survival analysis by the compound Poisson distribution. Annals of Applied Probability. 1992; 4: 951 - 972