DOI QR코드

DOI QR Code

Optimizing the maximum reported cluster size for normal-based spatial scan statistics

  • Yoo, Haerin (Division of Biostatistics, Department of Biomedical Systems Informatics, Yonsei University College of Medicine) ;
  • Jung, Inkyung (Division of Biostatistics, Department of Biomedical Systems Informatics, Yonsei University College of Medicine)
  • Received : 2018.02.28
  • Accepted : 2018.06.12
  • Published : 2018.07.31

Abstract

The spatial scan statistic is a widely used method to detect spatial clusters. The method imposes a large number of scanning windows with pre-defined shapes and varying sizes on the entire study region. The likelihood ratio test statistic comparing inside versus outside each window is then calculated and the window with the maximum value of test statistic becomes the most likely cluster. The results of cluster detection respond sensitively to the shape and the maximum size of scanning windows. The shape of scanning window has been extensively studied; however, there has been relatively little attention on the maximum scanning window size (MSWS) or maximum reported cluster size (MRCS). The Gini coefficient has recently been proposed by Han et al. (International Journal of Health Geographics, 15, 27, 2016) as a powerful tool to determine the optimal value of MRCS for the Poisson-based spatial scan statistic. In this paper, we apply the Gini coefficient to normal-based spatial scan statistics. Through a simulation study, we evaluate the performance of the proposed method. We illustrate the method using a real data example of female colorectal cancer incidence rates in South Korea for the year 2009.

Keywords

References

  1. Duczmal L and Assuncao R (2004). A simulated annealing strategy for the detection of arbitrarily shaped spatial clusters, Computational Statistics & Data Analysis, 45, 269-286. https://doi.org/10.1016/S0167-9473(02)00302-X
  2. Dwass M (1957). Modified randomization tests for nonparametric hypotheses, The Annals of Mathematical Statistics, 20, 181-187.
  3. Gastwirth JL (1972). The estimation of the Lorenz curve and Gini index, The Review of Economics and Statistics, 54, 306-316. https://doi.org/10.2307/1937992
  4. Han J, Zhu L, Kulldorff M, Hostovich S, Stinchcomb DG, Tatalovich Z, Lewis DR, and Feuer EJ (2016). Using Gini coefficient to determining optimal cluster reporting sizes for spatial scan statistics, International Journal of Health Geographics, 15, 27. https://doi.org/10.1186/s12942-016-0056-6
  5. Huang L, Kulldorff M, and Gregorio D (2007). A spatial scan statistic for survival data, Biometrics, 63, 109-118. https://doi.org/10.1111/j.1541-0420.2006.00661.x
  6. Huang L, Tiwari RC, Zou Z, Kulldorff M, and Feuer EJ (2009). Weighted normal spatial scan statistic for heterogeneous population data, Journal of the American Statistical Association, 104, 886-898. https://doi.org/10.1198/jasa.2009.ap07613
  7. Jung I, Kulldorff M, and Klassen AC (2007). A spatial scan statistic for ordinal data, Statistics in Medicine, 26, 1594-1607. https://doi.org/10.1002/sim.2607
  8. Jung I, Kulldorff M, and Richard OJ (2010). A spatial scan statistic for multinomial data, Statistics in Medicine, 29, 1910-1918. https://doi.org/10.1002/sim.3951
  9. Kim S and Jung I (2017). Optimizing the maximum reported cluster size in the spatial scan statistic for ordinal data, PLoS ONE, 12, e0182234. https://doi.org/10.1371/journal.pone.0182234
  10. Kulldorff M (1997). A spatial scan statistic, Communications in Statistics - Theory and Methods, 26, 1481-1496. https://doi.org/10.1080/03610929708831995
  11. Kulldorff M and Information Management Services Inc (2018). SaTScan$^{TM}$ v9.5: Software for the spatial and space-time spatial scan statistics, from: http://www.satscan.org/
  12. Kulldorff M, Huang L, and Konty K (2009). A scan statistic for continuous data based on the normal probability model, International Journal of Health Geographics, 8, 58. https://doi.org/10.1186/1476-072X-8-58
  13. Kulldorff M, Huang L, Pickle L, and Duczmal L (2006). An elliptic spatial scan statistic, Statistics in Medicine, 25, 3929-3943. https://doi.org/10.1002/sim.2490
  14. Lorenz MO (1905). Methods of measuring the concentration of wealth, Publications of the American Statistical Association, 9, 209-219. https://doi.org/10.2307/2276207
  15. Ribeiro SHR and Costa MA (2012). Optimal selection of the spatial scan parameters for cluster detection: a simulation study, Spatial and Spatio-Temporal Epidemiology, 3, 107-120. https://doi.org/10.1016/j.sste.2012.04.004
  16. Patil GP and Taillie C (2004). Upper level set scan statistic for detecting arbitrarily shaped hotspots, Environmental and Ecological Statistics, 11, 183-197. https://doi.org/10.1023/B:EEST.0000027208.48919.7e
  17. Tango T and Takahashi K (2005). A flexibly shaped spatial scan statistic for detecting clusters, International Journal of Health Geographics, 4, 11. https://doi.org/10.1186/1476-072X-4-11