NONPARAMETRIC MAXIMUM LIKELIHOOD ESTIMATION OF A CONCAVE RECEIVER OPERATING CHARACTERISTIC CURVE VIA GEOMETRIC PROGRAMMING

- Journal title : Bulletin of the Korean Mathematical Society
- Volume 48, Issue 3, 2011, pp.523-537
- Publisher : The Korean Mathematical Society
- DOI : 10.4134/BKMS.2011.48.3.523

Title & Authors

NONPARAMETRIC MAXIMUM LIKELIHOOD ESTIMATION OF A CONCAVE RECEIVER OPERATING CHARACTERISTIC CURVE VIA GEOMETRIC PROGRAMMING

Lee, Kyeong-Eun; Lim, Johan;

Lee, Kyeong-Eun; Lim, Johan;

Abstract

A receiver operating characteristic (ROC) curve plots the true positive rate of a classier against its false positive rate, both of which are accuracy measures of the classier. The ROC curve has several interesting geometrical properties, including concavity which is a necessary condition for a classier to be optimal. In this paper, we study the nonparametric maximum likelihood estimator (NPMLE) of a concave ROC curve and its modification to reduce bias. We characterize the NPMLE as a solution to a geometric programming, a special type of a mathematical optimization problem. We find that the NPMLE is close to the convex hull of the empirical ROC curve and, thus, has smaller variance but positive bias at a given false positive rate. To reduce the bias, we propose a modification of the NPMLE which minimizes the distance from the empirical ROC curve. We numerically compare the finite sample performance of three estimators, the empirical ROC curve, the NMPLE, and the modified NPMLE. Finally, we apply the estimators to estimating the optimal ROC curve of the variance-threshold classier to segment a low depth of field image and to finding a diagnostic tool with multiple tests for detection of hemophilia A carrier.

Keywords

concavity;geometric programming;receiver operating characteristic curve;

Language

English

References

1.

S. Boyd, S. Kim, L. Vandenberghe, and A. Hassibi, A tutorial on geometric programming, Optim. Eng. 8 (2007), no. 1, 67-127.

2.

S. Boyd and L. Vandenberghe, Convex Optimization, Cambridge University Press, Cambridge, 2004.

3.

G. Campbell and M. V. Ratnaparkhi, An application of Lomax distributions in receiver operating characteristic (ROC) curve analysis, Communication in Statistics 22 (1993), 1681-1697.

4.

D. D. Dorfman, K. S. Berbaum, C. E. Metz, R. V. Length, J. A. Hanlet, and H. A. Dagga, Proper receiver operating characteristic analysis: The bigamma model, Academic Radiology 4 (1996), 138-140.

5.

C. Feltz and R. Dykstra, Maximum likelihood estimation of the survival functions of N stochastically ordered random variables, J. Amer. Statist. Assoc. 80 (1985), no. 392, 1012-1019.

6.

P. A. Flach and S. Wu, Repairing concavities in ROC curves, Proceedings of 2003 UK Workshop on Computational Intelligence, 38-44, 2003.

7.

M. Grant, S. Boyd, and Y. Ye, CVX: Matlab software for disciplined convex programming, 2005; Available from http://www.stanford.edu/-boyd/cvx/.

8.

S. Johansen, The product limit estimator as maximum likelihood estimator, Scand. J. Statist. 5 (1978), no. 4, 195-199.

9.

R. A. Johnson and D. W. Wichern, Applied Multivariate Statistical Analysis, Third edition. Prentice Hall, Inc., Englewood Cliffs, NJ, 1992.

10.

E. Kaplan and P. Meier, Nonparametric estimation from incomplete observations, J. Amer. Statist. Assoc. 53 (1958), 457-481.

11.

J. Lim, S.-J. Kim, and X. Wang, Estimation of stochastically ordered survival func- tions via geometric programming, To appear in Journal of Compuational and Graphical Statistics; Available from http://www.stat.snu.ac.kr/johanlim/sorder_v3.pdf.

12.

J. Lim, X. Wang, and W. Choi, Matimum likelihood estimation of ordered multinomial parameters by geometric programing, Compuational Statistics and Data Analysis 53 (2009), 889-893.

15.

J. Lofberg, YALMIP. Yet Another LMI Parser, Version 2.4, 2003; Available from http: //control.ee.ethz.ch/-joloef/yalmip.php.

16.

M. W. McIntosh and M. S. Pepe, Combining several screening tests: optimality of the risk score, Biometrics 58 (2002), no. 3, 657-664.

17.

C. E. Metz and X. Pan, "Proper" binormal ROC curves: theory and maximum-likelihood estimation, J. Math. Psych. 43 (1999), no. 1, 1-33.

18.

A. Mutapcic, K. Koh, S. Kim, and S. Boyd, ggplab: A Matlab toolbox for geometric programming, 2006; Available from http://www.stanford.edu/-boyd/ggplab/.

19.

Y. Nesterov and A. Nemirovsky, Interior-Point Polynomial Methods in Convex Programming, volume 13 of Studies in Applied Mathematics, SIAM Philadelphia, PA, 1994.

20.

J. Norcedal and S. Wright, Numerical Optimization, Springer Series in Operations Research. Springer-Verlag, New York, 1999.

21.

X. Pan and C. E. Metz, The "proper" binomial model: Parametric receiver operating characteristic curve estimation with degenerate data, Academic Radiology 4 (1997), 380-389.

22.

M. S. Pepe, The Statistical Evaluation of Medical Tests for Classication and Prediction, Oxford University Press, Oxford, 2003.

23.

F. Preparata and S. J. Hong, Convex hulls of finite sets of points in two and three dimensions, Comm. ACM 20 (1977), no. 2, 87-93.