DOI QR코드

DOI QR Code

Tests for homogeneity of proportions in clustered binomial data

  • Received : 2016.08.17
  • Accepted : 2016.09.23
  • Published : 2016.09.30

Abstract

When we observe binary responses in a cluster (such as rat lab-subjects), they are usually correlated to each other. In clustered binomial counts, the independence assumption is violated and we encounter an extra-variation. In the presence of extra-variation, the ordinary statistical analyses of binomial data are inappropriate to apply. In testing the homogeneity of proportions between several treatment groups, the classical Pearson chi-squared test has a severe flaw in the control of Type I error rates. We focus on modifying the chi-squared statistic by incorporating variance inflation factors. We suggest a method to adjust data in terms of dispersion estimate based on a quasi-likelihood model. We explain the testing procedure via an illustrative example as well as compare the performance of a modified chi-squared test with competitive statistics through a Monte Carlo study.

Keywords

References

  1. Agresti A (2013). Categorical Data Analysis (3rd ed), John Wiley & Sons, Hoboken, NJ.
  2. Cochran WG (1977). Sampling Techniques (3rd ed), John Wiley & Sons, Hoboken, NJ.
  3. Crowder MJ (1978). Beta-binomial ANOVA for proportions, Journal of the Royal Statistical Society, Series C: Applied Statistics, 27, 34-37.
  4. Donner A (1989). Statistical methods in ophthalmology: an adjusted chi-squared approach, Biometrics, 45, 605-611. https://doi.org/10.2307/2531501
  5. Jeong KM (2015). Goodness-of-fit for the clustered binomial models, Journal of the Korean Data Analysis Society, 17, 1725-1737.
  6. Jeong KM and Lee HY (2013). Modeling overdispersion for clustered binomial data, Journal of the Korean Data Analysis Society, 15, 2343-2356.
  7. Paul SR (1982). Analysis of proportions of affected foetuses in teratological experiments, Biometrics, 38, 361-370. https://doi.org/10.2307/2530450
  8. Rao JNK and Scott AJ (1992). A simple method for the analysis of clustered binary data, Biometrics, 48, 577-585. https://doi.org/10.2307/2532311
  9. Reed JF (2004). Adjusted chi-square statistics: application to clustered binary data in primary care, Annals of Family Medicine, 2, 201-203. https://doi.org/10.1370/afm.41
  10. Ridout MS, Demetrio CGB, and Firth D (1999). Estimating intraclass correlation for binary data, Biometrics, 55, 137-148. https://doi.org/10.1111/j.0006-341X.1999.00137.x
  11. Wedderburn RWM (1974). Quasi-likelihood functions, generalized linear models, and the Gauss-Newton method, Biometrika, 61, 439-447.
  12. White H (1982). Maximum likelihood estimation of misspecified models, Econometrica, 50, 1-25. https://doi.org/10.2307/1912526
  13. Williams DA (1982). Extra-binomial variation in logistic linear models, Journal of the Royal Statistical Society, Series C: Applied Statistics, 31, 144-148.