Logistic Regression Method in Interval-Censored Data Yun, Eun-Young; Kim, Jin-Mi; Ki, Choong-Rak;
In this paper we propose a logistic regression method to estimate the survival function and the median survival time in interval-censored data. The proposed method is motivated by the data augmentation technique with no sacrifice in augmenting data. In addition, we develop a cross validation criterion to determine the size of data augmentation. We compare the proposed estimator with other existing methods such as the parametric method, the single point imputation method, and the nonparametric maximum likelihood estimator through extensive numerical studies to show that the proposed estimator performs better than others in the sense of the mean squared error. An illustrative example based on a real data set is given.
Cross validation;imputation method;Kaplan-Meier estimator;median survival time;non-parametric maximum likelihood estimation;survival function;
Efron, B. (1967). The two sample problem with censored data, In Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, University of California Press, Berkeley, 831-853.
Fang, H., Sun, J. and Lee, M-L. T. (2002). Nonparametric survival comparison for interval-censored continuous data, Statistica Sinica, 12, 1073-1083.
Finkelstein, D. M. (1986). A proportional hazards model for interval-censored failure time data, Biometrics, 42, 845-854.
Finkelstein, D. M. and Wolfe, R. A. (1985). A semiparametric model for regression analysis of intervalcensored failure time data, Biometrics, 41, 933-945.
Groeneboom, P. and Wellner, J. A. (1992). Information Bounds and Nonparametric Maximum likelihood Estimation, DMV Seminar, Band 19, Birkhauser, New York.
Huang, J. (1996). Efficient estimation for the proportional hazards model with interval censoring, The Annals of Statistics, 24, 540-568.
Hudgens, M. G. (2005). On nonparametric maximum likelihood estimation with interval censoring and left truncation, Journal of the Royal Statistical Society, Series B, 67, 573-587.
Jongbloed, G. (1998). The iterative convex minorant algorithm for nonparametric estimation, Journal of Computational and Graphical Statistics, 7, 310-321.
Kaplan, E. L. and Meier, P. (1958). Nonparametric estimation from incomplete observations, Journal of the American Statistical Association, 53, 457-481.
Kim, C., Park, B. U., Kim, W. and Lim, C. (2003). Bezier curve smoothing of the Kaplan-Meier estimator, The Annals of the Institute of Statistical Mathematics, 55, 359-367.
Lawless, J. F. and Babineau, D. (2006). Models for interval censoring and simulation-based inference for lifetime distributions, Biometrika, 93, 671-686.
Lindsey, J. C. and Ryan, L. M. (1998). Tutorial in biostatistics: Methods for interval-censored data, Statistics in Medicine, 17, 219-238.
Peto, R. (1973). Experimental survival curves for interval-censored data, Applied Statistics, 22, 86-91.
Rubin, D. B. (1987). Multiple Imputation for Noresponse in Surveys, Wiley, New York.
Sun, J. (2006). The Statistical Analysis of Interval-Censored Failure Time Data, Springer.
Tanner, M. A. and Wong, W. H. (1987). The application of Imputation to an estimation problem in grouped lifetime analysis, Technometrics, 29, 23-32.
Tanner, M. A. (1991). Tools for Statistical Inference: Observed Data and Data Augmentation Methods, Springer-Verlag, New York.
Turnbull, B. W. (1976). The empirical distribution with arbitrarily grouped censored and truncated data, Journal of the Royal Statistical Society, Series B, 38, 290-295.
Wellner, J. and Zhan, Y. (1997). A hybrid algorithm for computation of the nonparametric maximum likelihood estimator from censored data, Journal of the American Statistical Association, 92, 945-959.