Semiparametric support vector machine for accelerated failure time model

  • Hwang, Chang-Ha (Department of Statistics, Dankook University) ;
  • Shim, Joo-Yong (Department of Applied Statistics, Catholic University of Daegu)
  • Received : 2010.05.07
  • Accepted : 2010.07.05
  • Published : 2010.07.31

Abstract

For the accelerated failure time (AFT) model a lot of effort has been devoted to develop effective estimation methods. AFT model assumes a linear relationship between the logarithm of event time and covariates. In this paper we propose a semiparametric support vector machine to consider situations where the functional form of the effect of one or more covariates is unknown. The proposed estimating equation can be computed by a quadratic programming and a linear equation. We study the effect of several covariates on a censored response variable with an unknown probability distribution. We also provide a generalized approximate cross-validation method for choosing the hyper-parameters which affect the performance of the proposed approach. The proposed method is evaluated through simulations using the artificial example.

Keywords

Acknowledgement

Supported by : National Research Foundation of Korea (NRF)

References

  1. Buckley, J. and James, I. (1979). Linear regression with censored data. Biometrika, 66, 429-436. https://doi.org/10.1093/biomet/66.3.429
  2. Chen, K., Shen, J. and Ying, Z. (2005). Rank estimation in partial linear model with censored data. Statistica Sinica, 15, 767-779.
  3. Cox, D. R. (1972). Regression models and life-tables (with discussion). Journal of the Royal Statistical Society, 34, 187-220.
  4. Green, P. J. and Silverman, B. W. (1994). Nonparametric regression and generalized linear models, Chapman and Hall, London.
  5. Gunn, S. (1998). Support vector machines for classification and regression, ISIS Technical Report, Uni- versity of Southampton.
  6. Jin, Z., Lin, D. Y., Wei, L. J. and Ying, Z. L. (2003). Rank-based inference for the accelerated failure time model. Biometrika, 90, 341-353. https://doi.org/10.1093/biomet/90.2.341
  7. Jones, M. P. (1997). A class of semiparametric regressions for the accelerated failure time model. Biometrika, 84, 73-84. https://doi.org/10.1093/biomet/84.1.73
  8. Kalbfleisch, J. D. and Prentice, R. L. (2002). The statistical analysis of failure time data (2nd edn) , Wiley, Hoboken.
  9. Kaplan, E. L. and Meier, P. (1958). Nonparametric estimation from incomplete observations. Journal of the American Statistical Association, 66, 484-491.
  10. Kim, M. S., Park, H. J., Hwang, C. and Shim, J. (2008). Claims reserving via kernel machine. Journal of Korean Data & Information Science Society, 19, 1419-1427.
  11. Kimeldorf, G. S. and Wahba, G. (1971). Some results on tchebychean spline functions. Journal of Mathematical Analysis and its Applications, 33, 82-95. https://doi.org/10.1016/0022-247X(71)90184-3
  12. Koul, H., Susarla, V. and Van Ryzin, J. (1981). Regression analysis with randomly right censored data. Annals of Statistics, 9, 1276-1288. https://doi.org/10.1214/aos/1176345644
  13. Kuhn, H. W. and Tucker, A. W. (1951). Nonlinear programming, Proceedings of 2nd Berkeley Symposium, Berkeley, University of California Press, Berkeley.
  14. Li, Y., Liu, Y. and Ji, Z. (2007). Quantile regression in reproducing kernel Hilbert spaces. Journal of the American Statistical Association, 102, 255-268. https://doi.org/10.1198/016214506000000979
  15. Mercer, J. (1909). Functions of positive and negative type and their connection with theory of integral equations. Philosophical Transactions of the Royal Society, A, 415-446.
  16. Orbe, J., Ferreira, E. and Nunez-Anton, V. (2003). Censored partial regression. Biostatistics, 4, 109-121. https://doi.org/10.1093/biostatistics/4.1.109
  17. Shim, J. (2005a). Censored kernel ridge regression. Journal of Korean Data & Information Science Society, 16, 1045-1052.
  18. Shim, J. (2005b). E-SVR using IRWLS Procedure. Journal of Korean Data & Information Science Society, 16, 1087-1094.
  19. Shim, J. and Seok, K. H. (2008). Kernel Poisson regression for longitudinal data. Journal of Korean Data & Information Science Society, 19, 1353-1360.
  20. Stute, W. (1993). Consistent estimation under random censorship when covariables are available. Journal of Multivariate Analysis, 45, 89-103. https://doi.org/10.1006/jmva.1993.1028
  21. Tsiatis, A. A. (1990). Estimating regression parameters using linear rank tests for censored data. Annals of Statistics, 18, 303-328. https://doi.org/10.1214/aos/1176347502
  22. Vapnik, V. N. (1998). Statistical learning theory, Wiley, New York.
  23. Wei, L. J., Ying, Z. L. and Lin, D. Y. (1990). Linear regression analysis of censored survival data based on rank tests. Biometrika, 77, 845-851. https://doi.org/10.1093/biomet/77.4.845
  24. Yuan, M. (2006). GACV for quantile smoothing splines. Computational Statistics and Data Analysis, 50, 813-829. https://doi.org/10.1016/j.csda.2004.10.008
  25. Zhou, M. (1992). M-estimation in censored linear models. Biometrika, 79, 837-841. https://doi.org/10.1093/biomet/79.4.837