DOI QR코드

DOI QR Code

Model assessment with residual plot in logistic regression

로지스틱회귀에서 잔차산점도를 이용한 모형평가

  • 강명욱 (숙명여자대학교 통계학과)
  • Received : 2014.12.16
  • Accepted : 2015.01.10
  • Published : 2015.01.31

Abstract

Graphical paradigms for assessing the adequacy of models in logistic regression are discussed. The residual plot has been widely used as a graphical tool for evaluating the adequacy of the model. However, this approach works well only for linear models with constant variance, and the alternative approach, the marginal model plot, has its defects as well. We suggest a Chi-residual plot that overcomes the potential shortcomings of the marginal model plot.

로지스틱회귀에서 모형을 평가하거나 진단할 때 가설검정이 주로 사용되지만 이것만으로는 놓칠 수 있는 부분이 많고 이에 대한 보완을 위하여 그래픽적 방법의 사용이 요구된다. 그래프를 이용한 모형의 적절성 평가를 위한 도구로 잔차산점도가 널리 이용되고 있으나 적용 범위가 선형회귀에 국한되는 문제점이 있다. 해결 방안으로 주변모형산점도를 이용하여 모형의 적절성을 평가하는 방법이 있으나 역시 문제점을 가지고 있다. 본 논문에서는 주변모형산점도의 대안으로 카이잔차산점도를 제안하고 그 효용성을 알아본다.

Keywords

References

  1. Atkinson, A. C. (1985). Plots, transformations, and regression, Oxford University Press, Oxford.
  2. Belsley, D. A., Kuh, E. and Welsch, R. E. (1980). Regression diagnostics, Wiley, New York.
  3. Cleveland, W. S. (1987). Research in statistical graphics. Journal of the American Statistical Association, 82, 419-423. https://doi.org/10.1080/01621459.1987.10478444
  4. Cleveland, W. and Devlin, D. J. (1988). Locally weighted regression: An approach to regression analysis by local fitting. Journal of the American Statistical Association, 83, 596-610. https://doi.org/10.1080/01621459.1988.10478639
  5. Cook, R. D. (1998). Regression graphics: Idea for studying regressions through graphics, Wiley, New York.
  6. Cook, R. D. and Weisberg, S. (1982). Residuals and influence in regression, Chapman and Hall, New York.
  7. Cook, R. D. and Weisberg, S. (1994). An introduction to regression graphics, Wiley, New York.
  8. Cook, R. D. and Weisberg, S. (1997). Graphics for assessing the adequacy of regression models. Journal of the American Statistical Association, 92, 490-499. https://doi.org/10.1080/01621459.1997.10474002
  9. Cook, R. D. and Weisberg, S. (1999). Applied Regression including computing and graphics, Wiley, New York.
  10. Ezekiel, M. (1924). A method for handling curvilinear correlation for any number of variables. Journal of the American Statistical Association, 19, 431-453. https://doi.org/10.1080/01621459.1924.10502899
  11. Kahng, M. (2005). Exploring interaction in generalized linear models. Journal of the Korean Data & Information Science Society, 16, 13-18.
  12. Kahng, M. and Shin, E. (2012). A study on log-density with log-odds graph for variable selection in logistic regression. Journal of Korean Data & Information Science Society, 23, 99-111. https://doi.org/10.7465/jkdi.2012.23.1.099
  13. Kay, R. and Little, S. (1987). Transformations of the explanatory variables in the logistic regression model for binary data. Biometrika, 74, 495-501. https://doi.org/10.1093/biomet/74.3.495
  14. Scrucca, L. (2003). Graphics for studying logistic regression models. Statistical Methods and Applications, 11, 371-394
  15. Scrucca, L. and Weisberg, S. (2004). A simulation study to investigate the behavior of the log-density ratio under normality. Communication in Statistics Simulation and Computation, 33, 159-178. https://doi.org/10.1081/SAC-120028439
  16. Tierney, L. (1990). Lisp-Stat: An object-oriented environment for statistical computing and dynamic graphics, Wiley, New York.
  17. Weisberg, S. (2005). Applied linear regression, 3rd Ed, Wiley, New York.