DOI QR코드

DOI QR Code

A comparison of tests for homoscedasticity using simulation and empirical data

  • Anastasios Katsileros (Department of Crop Science, Agricultural University of Athens) ;
  • Nikolaos Antonetsis (Department of Crop Science, Agricultural University of Athens) ;
  • Paschalis Mouzaidis (Department of Forestry and Natural Resources, Democritus University of Thrace) ;
  • Eleni Tani (Department of Crop Science, Agricultural University of Athens) ;
  • Penelope J. Bebeli (Department of Crop Science, Agricultural University of Athens) ;
  • Alex Karagrigoriou (Department of Statistics and Actuarial-Financial Mathematics, University of the Aegean)
  • Received : 2023.04.23
  • Accepted : 2023.10.14
  • Published : 2024.01.31

Abstract

The assumption of homoscedasticity is one of the most crucial assumptions for many parametric tests used in the biological sciences. The aim of this paper is to compare the empirical probability of type I error and the power of ten parametric and two non-parametric tests for homoscedasticity with simulations under different types of distributions, number of groups, number of samples per group, variance ratio and significance levels, as well as through empirical data from an agricultural experiment. According to the findings of the simulation study, when there is no violation of the assumption of normality and the groups have equal variances and equal number of samples, the Bhandary-Dai, Cochran's C, Hartley's Fmax, Levene (trimmed mean) and Bartlett tests are considered robust. The Levene (absolute and square deviations) tests show a high probability of type I error in a small number of samples, which increases as the number of groups rises. When data groups display a nonnormal distribution, researchers should utilize the Levene (trimmed mean), O'Brien and Brown-Forsythe tests. On the other hand, if the assumption of normality is not violated but diagnostic plots indicate unequal variances between groups, researchers are advised to use the Bartlett, Z-variance, Bhandary-Dai and Levene (trimmed mean) tests. Assessing the tests being considered, the test that stands out as the most well-rounded choice is the Levene's test (trimmed mean), which provides satisfactory type I error control and relatively high power. According to the findings of the study and for the scenarios considered, the two non-parametric tests are not recommended. In conclusion, it is suggested to initially check for normality and consider the number of samples per group before choosing the most appropriate test for homoscedasticity.

Keywords

Acknowledgement

The authors extend their appreciation to the editor and referees for their valuable comments and suggestions, which has enhanced the quality of the current manuscript's presentation.

References

  1. Bhandary M and Dai H (2008). An alternative test for the equality of variances for several populations when the underlying distributions are normal, Communications in Statistics - Simulation and Computation, 38, 109-117, Available from: doi: 10.1080/03610910802431011 
  2. Bhandary M and Dai H (2013). An alternative test for the equality of variances for several populations in randomised complete block design, Statistical Methodology, 11, 22-35, Available from: https://doi.org/10.1016/j.stamet.2012.08.002 
  3. Bradley JV (1978). Robustness?, British Journal of Mathematical and Statistical Psychology, 31, 144-152.  https://doi.org/10.1111/j.2044-8317.1978.tb00581.x
  4. Bradley JV (1980). Nonrobustness in Z, t, and F tests at large sample sizes, Bulletin of the Psychonomic Society, 16, 333-336.  https://doi.org/10.3758/BF03329558
  5. Brown MB and Forsythe AB (1974). Robust tests for the equality of variances, Journal of the American Statistical Association, 69, 364-367, Available from: doi: 10.1080/01621459.1974.10482955 
  6. Cochran WG (1941). The distribution of the largest of a set of estimated variances as a fraction of their total, Annals of Human Genetics (London), 11, 47-52. 
  7. Conover WJ and Iman RL (1981). Rank transformations as a bridge between parametric and nonparametric statistics, The American Statistician, 35, 124-129, Available from: https://doi.org/10.2307/2683975 
  8. Dag O, Dolgun A, and Konar NM (2018). Onewaytests: An R package for one-way tests in independent groups designs, R Journal, 10, 175-199.  https://doi.org/10.32614/RJ-2018-022
  9. David HA (1952). Upper 5 % maximum F-ratio, Biometrika, 39, 422-424.  https://doi.org/10.1093/biomet/39.3-4.422
  10. de Mendiburu F and Yaseen M (2020). Agricolae: Statistical Procedures for Agricultural Research. R package version 1.4. Available from: https://cran.r-project.org/web/packages/agricolae 
  11. Delacre M, Leys C, Mora YL, and Lakens D (2019). Taking parametric assumptions seriously: Arguments for the use of Welch's F-test instead of the classical F-test in one-way ANOVA, International Review of Social Psychology, 32, 1-12, Available from: doi: https://doi.org/10.5334/irsp.198 
  12. Fligner MA and Killeen TJ (1976). Distribution-free two sample tests for scale, Journal of the American Statistical Association, 71, 210-213.  https://doi.org/10.1080/01621459.1976.10481517
  13. Gorbunova AA and Lemeshko BY (2012). Application of parametric homogeneity of variances tests under violation of classical assumption. In Proceedings of the 2nd Stochastic Modeling Techniques and Data Analysis International Conference, Chania, Crete, Greece, 1-9. 
  14. Hartley HO (1950). The maximum F-ratio as a short cut test for homogeneity of variance, Biometrika, 37, 308-312. 
  15. Hatchavanich D (2014). A comparison of type I error and power of Bartlett's test, Levene's test and O'Brien's test for homogeneity of variance tests, Southeast Asian Journal of Sciences, 3, 181-194. 
  16. Hennessy DA (2009). Crop yield skewness under law of the minimum technology, American Journal of Agricultural Economics, 91, 197-208, Available from: https://doi.org/10.1111/j.1467-8276.2008.01181.x 
  17. Hsiung TH and Olejnik S (1994). Contrast analysis for additive non-orthogonal two-factor designs in unequal variance cases, British Journal of Mathematical and Statistical Psychology, 47, 337-354, Available from: https://doi.org/10.1111/j.2044-8317.1994.tb01041.x 
  18. Kim SB, Kim DS, and Magana-Ramirez C (2021). Applications of statistical experimental designs to improve statistical inference in weed management, PLoS ONE, 16, e0257472, Available from: http://doi.org/10.1371/ journal.pone.0257472 
  19. Kozak M and Piepho HP (2018). What's normal anyway? residual plots are more telling than significance tests when checking ANOVA assumptions, Journal of Agronomy and Crop Science, 204, 86-98, Available from: doi.org/10.1111/jac.12220 
  20. Lee HB, Katz GS, and Restori AF (2010). A Monte Carlo study of seven homogeneity of variance tests, Journal of Mathematics and Statistics, 6, 359-366, Available from: doi.org/10.3844/jmssp.2010.359.366 
  21. Levene H (1960). Robust tests for equality of variances. In I Olkin (Eds), Contributions to Probability and Statistics; Essays in Honor of Harold Hotelling (pp. 278-292), Stanford University Press, Redwood City, CA. 
  22. Lix LM, Keselman JC, and Keselman HJ (1996). Consequences of assumption violations revisited: A quantitative review of alternatives to the one-way analysis of variance F-test, Review of Educational Research, 66, 579-619, Available from: https://doi.org/10.3102/00346543066004579 
  23. Mirtagioglu H, Yigit S, Mendes E, and Mendes M (2017). A Monte Carlo simulation study for comparing performances of some homogeneity of variances tests, Journal of Applied Quantitative Methods, 12, 1-11. 
  24. O'Brien RG (1979). A general ANOVA method for robust test of additive models for variance, Journal of the American Statistical Association, 74, 877-880.  https://doi.org/10.1080/01621459.1979.10481047
  25. O'Brien RG (1981). A simple test for variance effects in experimental designs, Psychological Bulletin, 89, 570-574.  https://doi.org/10.1037/0033-2909.89.3.570
  26. Onifade OC and Olanrewaju SO (2020). Investigating performances of some statistical tests for heteroscedasticity assumption in generalized linear model: A Monte Carlo simulations study, Open Journal of Statistics, 10, 453-493, Available from: https://doi.org/10.4236/ojs.2020.103029 
  27. Overall JE and Woodward JA (1974). A simple test for heterogeneity of variance in complex factorial designs, Psychometrika, 39, 311-318, Available from: https://doi.org/10.1007/BF02291705 
  28. Parra-Frutos I (2013). Testing homogeneity of variances with unequal sample sizes, Computational Statistics, 28, 1269-1297, Available from: 10.1007/s00180-012-0353-x 
  29. Piepho HP (1996). A Monte Carlo test for variance homogeneity in linear models, Biometrical Journal, 38, 461-473.  https://doi.org/10.1002/bimj.4710380411
  30. Pohlert T (2021). PMCMRplus: Calculate Pairwise Multiple Comparisons of Mean Rank Sums Extended. R package version 1.9.3. Available from: https://cran.r-project.org/web/packages/PMCMRplus 
  31. Raudonius S (2017). Application of statistics in plant and crop research: Important issues, ZemdirbysteAgriculture, 104, 377-382, Available from: http:// doi 10.13080/z-a.2017.104.048 
  32. Snedecor GW and Cochran WG (1983). Statistical Methods (6th ed), Oxford and IBH, New Delhi. 
  33. Rossoni D and Lima RR (2019). Autoregressive analysis of variance for experiments with spatial dependence between plots: A simulation study, Brazilian Journal of Biometrics, 37, 244-257, Available from: doi:https://doi.org/10.28951/rbb.v37i2.388 
  34. Stroup WW (2015). Rethinking the analysis of non-normal data in plant and soil science, Agronomy Journal, 107, 811-827, Available from: doi:10.2134/agronj2013.0342 
  35. Sharma D and Kibria BMG (2013). On some test statistics for testing homogeneity of variances: A comparative study, Journal of Statistical Computation and Simulation, 83, 1944-1963, Available from: doi: 10.1080/00949655.2012.675336 
  36. Tukey JW (1960). A survey of sampling from contaminated distributions. In I Olkin (Eds), Contributions to Probability and Statistics: Essays in Honor of Harold Hotelling (pp. 448-485), Stanford University Press, Redwood City, CA. 
  37. Wang Y, Rodriguez P, Chen YH, Kromrey JD, Kim ES, Pham T, Nguyen D, and Romano JL (2017). Comparing the performance of approaches for testing the homogeneity of variance assumption in one-factor ANOVA models, Educational and Psychological Measurement, 77, 305-329, Available from: doi: 10.1177/0013164416645162 
  38. Wilcox RR (2003). Applying Contemporary Statistical Methods, Academic Press, San Diego, CA. 
  39. Webster R and Lark RM (2019). Analysis of variance in soil research: Examining the assumptions, European Journal of Soil Science, 70, 990-1000, Available from: https://doi.org/10.1111/ejss.12804 
  40. Yamamotto ELM, Goncalves MC, Davide LMC, Rossoni DF, and Santos A. (2022). Spatial variability in evaluation experiments of corn genotypes in the state of Mato Grosso do Sul, Brazil, Acta Scientiarum. Agronomy, 44, e55972, Available from: https://doi.org/10.4025/actasciagron.v44i1.55972