Statistical Analysis of Clustered Interval-Censored Data with Informative Cluster Size Kim, Yang-Jin; Yoo, Han-Na;
Interval-censored data are commonly found in studies of diseases that progress without symptoms, which require clinical evaluation for detection. Several techniques have been suggested with independent assumption. However, the assumption will not be valid if observations come from clusters. Furthermore, when the cluster size relates to response variables, commonly used methods can bring biased results. For example, in a study on lymphatic filariasis, a parasitic disease where worms make several nests in the infected person's lymphatic vessels and reside until adulthood, the response variable of interest is the nest-extinction times. Since the extinction times of nests are checked by repeated ultrasound examinations, exact extinction times are not observed. Instead, data are composed of two examination points: the last examination time with living worms and the first examination time with dead worms. Furthermore, as Williamson et al. (2008) pointed out, larger nests show a tendency for low clearance rates. This association has been denoted as an informative cluster size. To analyze the relationship between the numbers of nests and interval-censored nest-extinction times, this study proposes a joint model for the relationship between cluster size and clustered interval-censored failure data.
EM algorithm;frailty effect;informative cluster size;interval censored data;
Bellamy, S., Li, Y., Ryan, L. M., Lipsitz, S., Canner, M. and Wright, R. (2005). Analysis of clustered and interval censored data from a community-based study in asthma, Statistics in Medicine, 34, 3607–3621.
Catalano, P. and Ryan, L. M. (1992). Bivariate latent variable models for clustered discrete and continuous outcomes, Journal of the American Statistical Association, 87, 651–658.
Cong, X. J., Yin, G. and Shen, Y. (2007). Marginal analysis of correlated failure time data with informative cluster sizes, Biometrics, 63, 663–672.
Dunson, D. B., Chen, Z. and Harry, J. (2003). A bayesian approach for joint modeling of clusger size and subunit-specific outcome, Biometrics, 59, 521–530.
Finkelstein, D. M. Goggins,W. B. and Schoenfeld, D. A. (2002). Analysis of failure time data with dependent interval censoring, Biometrics, 58, 298–304.
Hoffman, E., Sen, P. and Weinberg, C. (2001). Within-cluster resampling, Biometrika, 88, 1121–1134.
Kim, Y. J. (2010). Regression Analysis of Clustered Interval-Censored Data with Informative Cluster Size, Technical Report
Liu, L., Huang, X. and O’Quigley, J. (2008). Analysis of longitudinal data in the presence of informative observational times and a dependent terminal event, with application to medical cost data, Biometrics, 64, 950–958.
Sun, J. (2006). The Statistical Analysis of Interval-censored Failure Time Data, Springer-Verlag, New-York.
Turnbull, B.W. (1976). The empirical distribution function with arbitrarily grouped censored and truncated data, Journal of the Royal Statistical Society, Series B, 38, 290–295.
Williamson, J., Datta, S. and Satten, G. (2003). Marginal analysis of clustered data when cluster size is informative, Biometrics, 59, 36–42.
Williamson, J., Kim, H., Manatunga, A. and Addiss, D. (2008). Modeling survival data with informative cluster size, Statistics in Medicine, 27, 543–555.
Zhang, X. and Sun, J. (2010). Regression analysis of clustered interval-censored failure time data with informative cluster size, Computational Statistics and Data Analysis, 54, 1817–1823.