DOI QR코드

DOI QR Code

Conditional bootstrap confidence intervals for classification error rate when a block of observations is missing

  • Chung, Hie-Choon (Department of Healthcare Management, Gwangju University) ;
  • Han, Chien-Pai (Department of Mathematics, University of Texas at Arlington)
  • Received : 2012.11.29
  • Accepted : 2013.01.17
  • Published : 2013.01.31

Abstract

In this paper, it will be assumed that there are two distinct populations which are multivariate normal with equal covariance matrix. We also assume that the two populations are equally likely and the costs of misclassification are equal. The classification rule depends on the situation whether the training samples include missing values or not. We consider the conditional bootstrap confidence intervals for classification error rate when a block of observation is missing.

Keywords

References

  1. Anderson, T. W. (1951). Classification by multivariate analysis. Psychometrika, 16, 31-50. https://doi.org/10.1007/BF02313425
  2. Anderson, T. W. (1957). Maximum likelihood estimates for a multivariate normal distribution when some observations are missing. Journal of the American Statistical Association, 52, 200-203. https://doi.org/10.1080/01621459.1957.10501379
  3. Anderson, T. W. (1984). An introduction to multivariate statistical analysis, John Wiley and Sons, New York.
  4. Buckland, S. T. (1983). Monte Carlo methods for confidence interval estimation using the bootstrap technique. Bias, 10,194-212.
  5. Buckland, S. T. (1984). Monte Carlo confidence intervals. Biometrics, 40, 811-817. https://doi.org/10.2307/2530926
  6. Buckland, S. T. (1985). Calculation of Monte Carlo confidence intervals. Journal of the Royal Statistical Society C, 297-301.
  7. Chan, L. S. and Dunn, O. J. (1972). The treatment of missing values in discriminant analysis-1. The sampling experiment. Journal of the American Statistical Association, 67, 473-477.
  8. Chan, L. S. and Dunn, O. J. (1974). A note on the asymptotical aspect of the treatment of missing values in discriminant analysis. Journal of the American Statistical Association, 69, 672-673. https://doi.org/10.1080/01621459.1974.10480186
  9. Chung, H. and Han, C. (2000). Discriminant analysis when a block of observations is missing. Annals of the Institute of Statistical Mathematics, 52, 544-556. https://doi.org/10.1023/A:1004129706000
  10. Dempster, A. P., Laird, N. M. and Rubin, R. J. A. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society B, 39, 302-306.
  11. Diciccio, T. J. and Efron, B. (1996). Bootstrap confidence intervals. Statistical Science, 11, 189-228. https://doi.org/10.1214/ss/1032280214
  12. DiCiccio, T. J. and Romano, J. P. (1988). A review of bootstrap confidence intervals. Journal of the Royal Statistical Society B, 50, 338-354.
  13. Dorvlo, A. S. S. (1992). An interval estimation of the probability of misclassification. Journal of Mathematical Analysis and Application, 171, 389-394. https://doi.org/10.1016/0022-247X(92)90352-E
  14. Efron, B. (1982). The jackknife, the bootstrap, and other resampling plans, CBMS-NSF Regional Conference Series in Applied Mathematics, 38, Society for Industrial and Applied Mathematics(SIAM), Philadelphia.
  15. Efron, B. (1987). Better bootstrap confidence intervals. Journal of the American Statistical Association, 82, 171-200. https://doi.org/10.1080/01621459.1987.10478410
  16. Fisher, R. A. (1936). The use of multiple measurements in taxonomic problems. Annals of Eugenics, 7, 179-188. https://doi.org/10.1111/j.1469-1809.1936.tb02137.x
  17. Hall, P. (1986a). On the bootstrap and confidence intervals. Annals of Statistics, 14, 1431-1452. https://doi.org/10.1214/aos/1176350168
  18. Hall, P. (1986b). On the number of bootstrap simulations required to construct a confidence interval. Annals of Statistics, 14, 1453-1462. https://doi.org/10.1214/aos/1176350169
  19. Hinkley, D. V. (1988). Bootstrap methods. Journal of the Royal Statistical Society B, 50, 321-337.
  20. Hocking, R. R. and Smith, W. B. (1968). Estimation of parameters in the mutivariate normal distribution with missing observation. Journal of the American Statistical Association, 63, 159-173. https://doi.org/10.2307/2283837
  21. Johnson, R. A. and Wichern, D. W. (2002). Applied multivariate statistical analysis, Prentice Hall, New Jersey.
  22. Twedt, D. J. and Gill, D. S. (1992). Comparison of algorithm for replacing missing data in discriminant analysis. Communications in Statistics-Theory and Methods, 21, 1567-1578. https://doi.org/10.1080/03610929208830864

Cited by

  1. Analysis of English abstracts in Journal of the Korean Data & Information Science Society using topic models and social network analysis vol.26, pp.1, 2015, https://doi.org/10.7465/jkdi.2015.26.1.151
  2. Estimation of confidence interval in exponential distribution for the greenhouse gas inventory uncertainty by the simulation study vol.24, pp.4, 2013, https://doi.org/10.7465/jkdi.2013.24.4.825