Genetic Mixed Effects Models for Twin Survival Data

Ha, Il-Do;Noh, Maengseok;Yoon, Sangchul

  • 발행 : 2005.12.01


Twin studies are one of the most widely used methods for quantifying the influence of genetic and environmental factors on some traits such as a life span or a disease. In this paper we propose a genetic mixed linear model for twin survival time data, which allows us to separate the genetic component from the environmental component. Inferences are based upon the hierarchical likelihood (h-likelihood), which provides a statistically efficient and simple unified framework for various random-effect models. We also propose a simple and fast computation method for analyzing a large data set on twin survival study. The new method is illustrated to the survival data in Swedish Twin Registry. A simulation study is carried out to evaluate the performance.


Environment effect;Genetic effect;Hierarchical likelihood;Random effects


  1. Agresti, A., Caffo, B. and Ohman-Strickland, P. (2004). Example in which misspecification of a random effects distribution reduces efficiency, and possible remedies. Computational Statistics and Data Analysis, 47, 639-653
  2. Ha, I. D. and Lee, Y. (2005a). Multilevel mixed linear models for survival data. Lifetime Data Analysis, 11, 131-142
  3. Ha, I. D. and Lee, Y. (2005b). Comparison of hierarchical likelihood versus orthodox best linear unbiased predictor approaches for frailty models. Biometrika, 92, 717-723
  4. Ha, I. D., Lee, Y. and Song, J-K (2002). Hierarchical likelihood approach for mixed linear models with censored data. Lifetime Data Analysis, 8, 163-176
  5. Henderson, C. R. (1975). Best linear unbiased estimation and prediction under a selection model. Biometrics, 31, 423-447
  6. Hougaard, P. (2000). Analysis of multivariate survival data. Springer-Verlag, New York
  7. Lambert, P., Collett, D., Kimber, A, and Johnson, R. (2004). Parametric accelerated failure time models with random effects and an application to kidney transplant survival. Statistics in Medicine. 23, 3177-3192
  8. Lee, Y. and NeIder, J A. (1996). Hierarchical generalized linear models (with discussion). Journal of the Royal Statistical Society, B, 58, 619-678
  9. Lee, Y. and Nelder, J A (2001a). Hierarchical generalised linear models: a synthesis of generalised linear models, random-effect models and structured dispersions. Biometrika, 88, 987-1006
  10. Lee, Y. and Nelder, J A. (2001b). Modelling and analysing correlated non-normal data. Statistical Modelling, 1, 3-16
  11. Neal, M. C. and Cardon, L. R. (1992). Methodology for genetic studies of twin and families. Kluwer Academic: Dordrecht
  12. Noh, M. and Lee, Y. (2004). REML estimation for binary data in GLMMs. A manuscript submitted for publication
  13. Noh, M., Ha, I. D. and Lee, Y. (2005). Dispersion frailty models and HGLMs. Statistics in Medicine, 24, in press
  14. Pawitan, Y., Reilly, M., Nilsson, E, Cnattingius, S. and Lichtenstein, P. (2004). Estimation of genetic and environmental factors for binary traits using family data. Statistics in Medicine, 23, 449-465
  15. Self, S. G. and Liang, K Y. (1987). Asymptotic properties of maximum likelihood estimators and likelihood ratio tests under nonstandard conditions. Journal of the American Statistical Association, 82, 605-610
  16. Sham, P. C. (1998). Statistics in Human Genetics. Arnold: London
  17. Vu, H. T. V. and Knuiman, M. W. (2002). A hybrid ML-EM algorithm for calculation of maximum likelihood estimates in semiparametrics shared frailty models. Computational Statistics and Data Analysis, 40, 173-187
  18. Wolfinger, R. D. (1999). Fitting nonlinear mixed models with the new NLMIXED procedure. Proceedings of the 99 Joint Statistical Meetings, 287
  19. Yashine, A. I., lachine, I. A. (1995). How long can humans live? Lower bound for biological limit of human longevity calculated from Danish twin data using correlated frailty model. Mechanisms of Ageing and Development, 80, 147-169
  20. Yashine, A I., Iachine, I. A and Harris, J. R (1999). Half of the variation in susceptibility to mortality is genetic: findings from Swedish twin survival data. Behavior Genetics, 29, 11-19
  21. Tobin, J. (1958). Estimation of relationship for limited dependent variables. Econometrica, 26, 24-36