Power comparison of distribution-free two sample goodness-of-fit tests

Kim, Seon Bin;Lee, Jae Won;

doi:10.5351/KJAS.2017.30.4.513

The Korean Journal of Applied Statistics (응용통계연구)

Volume 30 Issue 4
/
Pages.513-528
/
2017
/
1225-066X(pISSN)
/
2383-5818(eISSN)

The Korean Statistical Society (한국통계학회)

DOI QR Code

Power comparison of distribution-free two sample goodness-of-fit tests

이표본 분포 동일성에 대한 분포무관 검정법 간 검정력 비교 연구

Kim, Seon Bin (Department of Statistics, Korea University) ;
Lee, Jae Won (Department of Statistics, Korea University)

김선빈 (고려대학교 통계학과) ;
이재원 (고려대학교 통계학과)

Received : 2017.02.08
Accepted : 2017.06.14
Published : 2017.08.31

https://doi.org/10.5351/KJAS.2017.30.4.513 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

Statistics are often used to test two samples if they have been drawn from the same underlying distribution. In this paper, we introduce several well-known distribution-free tests to compare distributions and conduct an extensive Monte-Carlo simulation to specify their behaviors. We consider various circumstances of when two distributions vary in (1) location, (2) scale, (3) symmetry, (4) kurtosis, (5) tail weight. A practical guideline for two-sample goodness-of-fit test is presented based on the simulation result.

두 표본 집단이 동일한 분포를 따르는지 비교하기 위해 분포무관 검정이 많이 사용된다. 하지만 여러 검정법을 체계적으로 비교한 연구가 존재하지 않아서 각 검정법의 특성을 고려하여 연구 상황에 맞는 검정법을 선택하기가 어려웠다. 본 연구에서는 이표본 분포 동일성 검정에 해당하는 여러 분포무관 검정법들을 소개하고 체계적인 모의실험을 통해 그 성능을 비교하고자 한다. 두 표본이 각각 (1) 위치, (2) 척도, (3) 왜도, (4) 첨도, (5) 꼬리가중치가 다른 분포에서 추출된 상황에 대해 실험하였다. 실험 결과를 바탕으로 이표본 분포 동일성 검정법 사용에 대한 실용적인 지침을 제시하려고 한다.

Keywords

References

Anderson, T. W. and Darling, D. A. (1952). Asymptotic theory of certain "goodness of fit" criteria based on stochastic processes, The Annals of Mathematical Statistics, 23, 193-212. https://doi.org/10.1214/aoms/1177729437
Anderson, T. W. (1962). On the distribution of the two-sample Cramer-von Mises criterion, The Annals of Mathematical Statistics, 33, 1148-1159. https://doi.org/10.1214/aoms/1177704477
Bick, R. L., Adams, T., and Schmalhorst, W. R. (1976). Bleeding times, platelet adhesion, and aspirin, American Journal of Clinical Pathology, 65, 69-72. https://doi.org/10.1093/ajcp/65.1.69
Bowley, A. L. (1920). Elements of Statistics (4th ed), P.S. King & Son, London.
Burr, E. J. (1964). Small-sample distributions of the two-sample Cramer-von Mises' $W^2$ and Watson's $U^2$, The Annals of Mathematical Statistics, 35, 1091-1098. https://doi.org/10.1214/aoms/1177703267
Cramer, H. (1928). On the composition of elementary errors: first paper: mathematical deductions, Scan-dinavian Actuarial Journal, 1928, 13-74. https://doi.org/10.1080/03461238.1928.10416862
Crow, E. L. and Siddiqui, M. M. (1967). Robust estimation of location, Journal of the American Statistical Association, 62, 353-389. https://doi.org/10.1080/01621459.1967.10482914
Darling, D. A. (1957). The Kolmogorov-Smirnov, Cramer-von Mises tests, The Annals of Mathematical Statistics, 28, 823-838. https://doi.org/10.1214/aoms/1177706788
Delse, F. C. and Feather, B. W. (1968). The effect of augmented sensory feedback on the control of salivation. Psychophysiology, 5, 15-21. https://doi.org/10.1111/j.1469-8986.1968.tb02796.x
Engmann, S. and Cousineau, D. (2011). Comparing distributions: the two-sample Anderson-Darling test as an alternative to the Kolmogorov-Smirnoff test, Journal of Applied Quantitative Methods, 6, 1-17.
Epps, T. W. and Singleton, K. J. (1986). An omnibus test for the two-sample problem using the empirical characteristic function, Journal of Statistical Computation and Simulation, 26, 177-203. https://doi.org/10.1080/00949658608810963
Goerg, S. J. and Kaiser, J. (2009). Nonparametric testing of distributions - The Epps-Singleton two-sample test using the empirical characteristic function. The Stata Journal, 9, 454-465.
Hollander, M., Wolfe, D. A., and Chicken, E. (2014). Nonparametric Statistical Methods (3rd ed), John Wiley & Sons, Hoboken.
Lilliefors, H. W. (1969). On the Kolmogorov-Smirnov test for the exponential distribution with mean unknown, Journal of the American Statistical Association, 64, 387-389. https://doi.org/10.1080/01621459.1969.10500983
Mann, H. B. and Whitney, D. R. (1947). On a test of whether one of two random variables is stochastically larger than the other, The Annals of Mathematical Statistics, 18, 50-60. https://doi.org/10.1214/aoms/1177730491
Massey, F. J. (1951). The Kolmogorov-Smirnov test for goodness of fit, Journal of the American Statistical Association, 46, 68-78. https://doi.org/10.1080/01621459.1951.10500769
Ozcomak, M. S., Kartal, M., Senger, O., and Celik, A. K. (2013). Comparison of the powers of the Kolmogorov-Smirnov two-sample test and the Mann-Whitney test for different kurtosis and skewness coefficients using the Monte Carlo simulation method, Journal of Statistical and Econometric Methods, 2, 81-98.
Pettitt, A. N. (1976). A two-sample Anderson-Darling rank statistic, Biometrika, 63, 161-168.
Scholz, F. W. and Stephens, M. A. (1987). K-sample Anderson-Darling tests, Journal of the American Statistical Association, 82, 918-924.
Smirnoff, N. (1939). Sur les ecarts de la courbe de la distribution empirique, Receuil Mathemathique (Matematiceskii Sbornik), 6, 3-26.
von Mises, R. (1928). Wahrscheinlichkeit Statistik und Wahrheit, Springer-Verlag, Berlin.
Wilcoxon, F. (1945). Individual comparisons by ranking methods, Biometrics Bulletin, 1, 80-83. https://doi.org/10.2307/3001968

The Korean Journal of Applied Statistics (응용통계연구)

Power comparison of distribution-free two sample goodness-of-fit tests

이표본 분포 동일성에 대한 분포무관 검정법 간 검정력 비교 연구

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)