Conditional bootstrap confidence intervals for classification error rate when a block of observations is missing

Chung, Hie-Choon;Han, Chien-Pai;

doi:10.7465/jkdi.2013.24.1.189

Journal of the Korean Data and Information Science Society

Volume 24 Issue 1
/
Pages.189-200
/
2013
/
1598-9402(pISSN)

The Korean Data and Information Science Society (한국데이터정보과학회)

DOI QR Code

Conditional bootstrap confidence intervals for classification error rate when a block of observations is missing

Chung, Hie-Choon (Department of Healthcare Management, Gwangju University) ;
Han, Chien-Pai (Department of Mathematics, University of Texas at Arlington)

Received : 2012.11.29
Accepted : 2013.01.17
Published : 2013.01.31

https://doi.org/10.7465/jkdi.2013.24.1.189 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

In this paper, it will be assumed that there are two distinct populations which are multivariate normal with equal covariance matrix. We also assume that the two populations are equally likely and the costs of misclassification are equal. The classification rule depends on the situation whether the training samples include missing values or not. We consider the conditional bootstrap confidence intervals for classification error rate when a block of observation is missing.

Keywords

References

Anderson, T. W. (1951). Classiﬁcation by multivariate analysis. Psychometrika, 16, 31-50. https://doi.org/10.1007/BF02313425
Anderson, T. W. (1957). Maximum likelihood estimates for a multivariate normal distribution when some observations are missing. Journal of the American Statistical Association, 52, 200-203. https://doi.org/10.1080/01621459.1957.10501379
Anderson, T. W. (1984). An introduction to multivariate statistical analysis, John Wiley and Sons, New York.
Buckland, S. T. (1983). Monte Carlo methods for conﬁdence interval estimation using the bootstrap technique. Bias, 10,194-212.
Buckland, S. T. (1984). Monte Carlo conﬁdence intervals. Biometrics, 40, 811-817. https://doi.org/10.2307/2530926
Buckland, S. T. (1985). Calculation of Monte Carlo conﬁdence intervals. Journal of the Royal Statistical Society C, 297-301.
Chan, L. S. and Dunn, O. J. (1972). The treatment of missing values in discriminant analysis-1. The sampling experiment. Journal of the American Statistical Association, 67, 473-477.
Chan, L. S. and Dunn, O. J. (1974). A note on the asymptotical aspect of the treatment of missing values in discriminant analysis. Journal of the American Statistical Association, 69, 672-673. https://doi.org/10.1080/01621459.1974.10480186
Chung, H. and Han, C. (2000). Discriminant analysis when a block of observations is missing. Annals of the Institute of Statistical Mathematics, 52, 544-556. https://doi.org/10.1023/A:1004129706000
Dempster, A. P., Laird, N. M. and Rubin, R. J. A. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society B, 39, 302-306.
Diciccio, T. J. and Efron, B. (1996). Bootstrap conﬁdence intervals. Statistical Science, 11, 189-228. https://doi.org/10.1214/ss/1032280214
DiCiccio, T. J. and Romano, J. P. (1988). A review of bootstrap conﬁdence intervals. Journal of the Royal Statistical Society B, 50, 338-354.
Dorvlo, A. S. S. (1992). An interval estimation of the probability of misclassiﬁcation. Journal of Mathematical Analysis and Application, 171, 389-394. https://doi.org/10.1016/0022-247X(92)90352-E
Efron, B. (1982). The jackknife, the bootstrap, and other resampling plans, CBMS-NSF Regional Conference Series in Applied Mathematics, 38, Society for Industrial and Applied Mathematics(SIAM), Philadelphia.
Efron, B. (1987). Better bootstrap conﬁdence intervals. Journal of the American Statistical Association, 82, 171-200. https://doi.org/10.1080/01621459.1987.10478410
Fisher, R. A. (1936). The use of multiple measurements in taxonomic problems. Annals of Eugenics, 7, 179-188. https://doi.org/10.1111/j.1469-1809.1936.tb02137.x
Hall, P. (1986a). On the bootstrap and conﬁdence intervals. Annals of Statistics, 14, 1431-1452. https://doi.org/10.1214/aos/1176350168
Hall, P. (1986b). On the number of bootstrap simulations required to construct a conﬁdence interval. Annals of Statistics, 14, 1453-1462. https://doi.org/10.1214/aos/1176350169
Hinkley, D. V. (1988). Bootstrap methods. Journal of the Royal Statistical Society B, 50, 321-337.
Hocking, R. R. and Smith, W. B. (1968). Estimation of parameters in the mutivariate normal distribution with missing observation. Journal of the American Statistical Association, 63, 159-173. https://doi.org/10.2307/2283837
Johnson, R. A. and Wichern, D. W. (2002). Applied multivariate statistical analysis, Prentice Hall, New Jersey.
Twedt, D. J. and Gill, D. S. (1992). Comparison of algorithm for replacing missing data in discriminant analysis. Communications in Statistics-Theory and Methods, 21, 1567-1578. https://doi.org/10.1080/03610929208830864

Cited by

Analysis of English abstracts in Journal of the Korean Data & Information Science Society using topic models and social network analysis vol.26, pp.1, 2015, https://doi.org/10.7465/jkdi.2015.26.1.151
Estimation of confidence interval in exponential distribution for the greenhouse gas inventory uncertainty by the simulation study vol.24, pp.4, 2013, https://doi.org/10.7465/jkdi.2013.24.4.825

Journal of the Korean Data and Information Science Society

Conditional bootstrap confidence intervals for classification error rate when a block of observations is missing

Abstract

Keywords

References

Cited by

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)