DOI QR코드

DOI QR Code

Power comparison of distribution-free two sample goodness-of-fit tests

이표본 분포 동일성에 대한 분포무관 검정법 간 검정력 비교 연구

  • Received : 2017.02.08
  • Accepted : 2017.06.14
  • Published : 2017.08.31

Abstract

Statistics are often used to test two samples if they have been drawn from the same underlying distribution. In this paper, we introduce several well-known distribution-free tests to compare distributions and conduct an extensive Monte-Carlo simulation to specify their behaviors. We consider various circumstances of when two distributions vary in (1) location, (2) scale, (3) symmetry, (4) kurtosis, (5) tail weight. A practical guideline for two-sample goodness-of-fit test is presented based on the simulation result.

두 표본 집단이 동일한 분포를 따르는지 비교하기 위해 분포무관 검정이 많이 사용된다. 하지만 여러 검정법을 체계적으로 비교한 연구가 존재하지 않아서 각 검정법의 특성을 고려하여 연구 상황에 맞는 검정법을 선택하기가 어려웠다. 본 연구에서는 이표본 분포 동일성 검정에 해당하는 여러 분포무관 검정법들을 소개하고 체계적인 모의실험을 통해 그 성능을 비교하고자 한다. 두 표본이 각각 (1) 위치, (2) 척도, (3) 왜도, (4) 첨도, (5) 꼬리가중치가 다른 분포에서 추출된 상황에 대해 실험하였다. 실험 결과를 바탕으로 이표본 분포 동일성 검정법 사용에 대한 실용적인 지침을 제시하려고 한다.

Keywords

References

  1. Anderson, T. W. and Darling, D. A. (1952). Asymptotic theory of certain "goodness of fit" criteria based on stochastic processes, The Annals of Mathematical Statistics, 23, 193-212. https://doi.org/10.1214/aoms/1177729437
  2. Anderson, T. W. (1962). On the distribution of the two-sample Cramer-von Mises criterion, The Annals of Mathematical Statistics, 33, 1148-1159. https://doi.org/10.1214/aoms/1177704477
  3. Bick, R. L., Adams, T., and Schmalhorst, W. R. (1976). Bleeding times, platelet adhesion, and aspirin, American Journal of Clinical Pathology, 65, 69-72. https://doi.org/10.1093/ajcp/65.1.69
  4. Bowley, A. L. (1920). Elements of Statistics (4th ed), P.S. King & Son, London.
  5. Burr, E. J. (1964). Small-sample distributions of the two-sample Cramer-von Mises' $W^2$ and Watson's $U^2$, The Annals of Mathematical Statistics, 35, 1091-1098. https://doi.org/10.1214/aoms/1177703267
  6. Cramer, H. (1928). On the composition of elementary errors: first paper: mathematical deductions, Scan-dinavian Actuarial Journal, 1928, 13-74. https://doi.org/10.1080/03461238.1928.10416862
  7. Crow, E. L. and Siddiqui, M. M. (1967). Robust estimation of location, Journal of the American Statistical Association, 62, 353-389. https://doi.org/10.1080/01621459.1967.10482914
  8. Darling, D. A. (1957). The Kolmogorov-Smirnov, Cramer-von Mises tests, The Annals of Mathematical Statistics, 28, 823-838. https://doi.org/10.1214/aoms/1177706788
  9. Delse, F. C. and Feather, B. W. (1968). The effect of augmented sensory feedback on the control of salivation. Psychophysiology, 5, 15-21. https://doi.org/10.1111/j.1469-8986.1968.tb02796.x
  10. Engmann, S. and Cousineau, D. (2011). Comparing distributions: the two-sample Anderson-Darling test as an alternative to the Kolmogorov-Smirnoff test, Journal of Applied Quantitative Methods, 6, 1-17.
  11. Epps, T. W. and Singleton, K. J. (1986). An omnibus test for the two-sample problem using the empirical characteristic function, Journal of Statistical Computation and Simulation, 26, 177-203. https://doi.org/10.1080/00949658608810963
  12. Goerg, S. J. and Kaiser, J. (2009). Nonparametric testing of distributions - The Epps-Singleton two-sample test using the empirical characteristic function. The Stata Journal, 9, 454-465.
  13. Hollander, M., Wolfe, D. A., and Chicken, E. (2014). Nonparametric Statistical Methods (3rd ed), John Wiley & Sons, Hoboken.
  14. Lilliefors, H. W. (1969). On the Kolmogorov-Smirnov test for the exponential distribution with mean unknown, Journal of the American Statistical Association, 64, 387-389. https://doi.org/10.1080/01621459.1969.10500983
  15. Mann, H. B. and Whitney, D. R. (1947). On a test of whether one of two random variables is stochastically larger than the other, The Annals of Mathematical Statistics, 18, 50-60. https://doi.org/10.1214/aoms/1177730491
  16. Massey, F. J. (1951). The Kolmogorov-Smirnov test for goodness of fit, Journal of the American Statistical Association, 46, 68-78. https://doi.org/10.1080/01621459.1951.10500769
  17. Ozcomak, M. S., Kartal, M., Senger, O., and Celik, A. K. (2013). Comparison of the powers of the Kolmogorov-Smirnov two-sample test and the Mann-Whitney test for different kurtosis and skewness coefficients using the Monte Carlo simulation method, Journal of Statistical and Econometric Methods, 2, 81-98.
  18. Pettitt, A. N. (1976). A two-sample Anderson-Darling rank statistic, Biometrika, 63, 161-168.
  19. Scholz, F. W. and Stephens, M. A. (1987). K-sample Anderson-Darling tests, Journal of the American Statistical Association, 82, 918-924.
  20. Smirnoff, N. (1939). Sur les ecarts de la courbe de la distribution empirique, Receuil Mathemathique (Matematiceskii Sbornik), 6, 3-26.
  21. von Mises, R. (1928). Wahrscheinlichkeit Statistik und Wahrheit, Springer-Verlag, Berlin.
  22. Wilcoxon, F. (1945). Individual comparisons by ranking methods, Biometrics Bulletin, 1, 80-83. https://doi.org/10.2307/3001968