Advanced SearchSearch Tips
ROC Function Estimation
facebook(new window)  Pirnt(new window) E-mail(new window) Excel Download
 Title & Authors
ROC Function Estimation
Hong, Chong-Sun; Lin, Mei Hua; Hong, Sun-Woo;
  PDF(new window)
From the point view of credit evaluation whose population is divided into the default and non-default state, two methods are considered to estimate conditional distribution functions: one is to estimate under the assumption that the data is followed the mixture normal distribution and the other is to use the kernel density estimation. The parameters of normal mixture are estimated using the EM algorithm. For the kernel density estimation, five kinds of well known kernel functions and four kinds of the bandwidths are explored. In addition, the corresponding ROC functions are obtained based on the estimated distribution functions. The goodness-of-fit of the estimated distribution functions are discussed and the performance of the ROC functions are compared. In this work, it is found that the kernel distribution functions shows better fit, and the ROC function obtained under the assumption of normal mixture shows better performance.
Bandwidth;density estimation;goodness-of-fit;normal mixture;kernel;performance;ROC function;
 Cited by
대안적인 분류기준: 오분류율곱,홍종선;김효민;김동규;

응용통계연구, 2014. vol.27. 5, pp.773-786 crossref(new window)
Alternative Optimal Threshold Criteria: MFR, Korean Journal of Applied Statistics, 2014, 27, 5, 773  crossref(new windwow)
홍종선, 이원용 (2011). 정규혼합분포를 이용한 ROC 곡선연구, 응용통계연구, 24, 269-278. crossref(new window)

홍종선, 주재선, 최진수 (2010). 혼합분포에서의 최적분류점, 응용통계연구, 23, 13-28. crossref(new window)

홍종선, 최진수 (2009). ROC와 CAP 곡선에서의 최적분류점, <응용통계연구>, 22, 911-921. crossref(new window)

Aitkin, M. and Wilson, T. G. (1980). Mixture models, outliers, and the EM algorithm, Technometrics, 22, 325-331. crossref(new window)

Egan, J. P. (1975). Signal Detection Theory and ROC Analysis, Series in Cognitition and Perception, Academic Press, New York.

Everitt, B. S. (1984). Maximum likelihood estimation of the parameters in a mixture of two univariate normal, Journal of the Royal Statistical Society, 33, 205-215.

Fawcett, T. (2003). ROC graphs: Notes and practical considerations for data mining researchers, Technical Report HPL-2003-4, HP Laboratories, 1-28.

Hall, P. G. and Hyndman, R. J. (2003). Improved methods for bandwidth selection when estimating ROC curves, Statistics and Probability Letters, 64, 181-189. crossref(new window)

Joseph, M. P. (2005). A PD Validation Framework for Basel II Internal Ratings-Based Systems, Credit Scoring and Credit Control IV .

Lloyd, C. J. (1998). The use of smoothed ROC curves to summarise and compare diagnostic systems, Journal of the American Statistical Association, 93, 1356-1364. crossref(new window)

Lloyd, C. J. and Yong, Z. (1999). Kernel estimators of the ROC curve are better than empirical, Statistics and Probability Letters, 44, 221-228. crossref(new window)

McCullagh, P. and Nelder, J. A. (1983). Quasi-likelihood functions, Annals of Statistics, 11, 59-67. crossref(new window)

McLachlan, G. J. and Krishnan, T. (1997). The EM Algorithm and Extensions, John Wiley & Sons, New York.

Pepe, M. S. (1998). Three approaches to regression analysis of receiver operating characteristic curves for continuous test results, Biometrics, 54, 124-135. crossref(new window)

Pepe, M. S. (2003). The Statistical Evaluation of Medical Tests for Classiffication and Prediction, University Press, Oxford, New York.

Provost, F. and Fawcett, T. (1997). Analysis and visualization of classifier performance comparison under imprecise class and cost distributions, In Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, 43-48.

Provost, F. and Fawcett, T. (2001). Robust classification for imprecise environments, Machine Learning, 42, 203-231. crossref(new window)

Rossenblatt, M. (1956). Remarks on some nonparametric estimates of a density function, Annals of Mathematical Statistics, 27, 832-837. crossref(new window)

Silverman, B. W. (1986). Density Estimation for Statistics and Data Analysis, Chapman and Hall, London.

Swets, J. A. (1988). Measuring the accuracy of diagnostic systems, American Association for the Advancement of Science, 240, 1285-1293. crossref(new window)

Swets, J. A., Dawes, R. M. and Monahan, J. (2000). Better decisions through science, Scientific Americal, 283, 82-87.

Tasche, D. (2006). Validation of internal rating systems and PD Estimates, On-line bibliography available from: http://arXiv:physics/0606071.

Zou, K. H., Hall, W. J. and Shapiro, D. E. (1997). Smooth non-parametric receiver operating characteristic(ROC) curves for continuous diagnostic tests, Statistics in Medicine, 16, 2143-2156. crossref(new window)