DOI QR코드

DOI QR Code

A robust method for response variable transformations using dynamic plots

  • Seo, Han Son (Department of Applied Statistics, Konkuk University)
  • Received : 2019.02.24
  • Accepted : 2019.06.11
  • Published : 2019.09.30

Abstract

The variable transformations are useful ways to guarantee the functional relationships in the model. However, the presence of outliers may undermine the accuracy of transformation. This paper deals with response transformations in the partial linear models under the existence of outliers. A new procedure for response transformation and outliers detection is proposed. The procedure uses a sequential method for identifying outliers and dynamic graphical methods for an appropriate transformation. The graphical tools make it possible to catch diagnostic information by monitoring the movement of points in the data. The procedure is illustrated with several examples. Examples show that visual clues regarding the optimal transformation, the fittness of the model and the outlyness of the observations can be checked from the series of plots.

Keywords

References

  1. Atkinson AC (1994). Fast very robust methods for the detection of multiple outliers, Journal of the American Statistical Association, 89, 1329-1339. https://doi.org/10.1080/01621459.1994.10476872
  2. Atkinson AC and Riani M (2000). Robust Diagnostic Regression Analysis, Springer, New York.
  3. Chambers JM, Cleveland WS, Kleiner B, and Tukey P (1983). Graphical Methods for Data Analysis,Duxbury Press, Boston.
  4. Cheng T (2005). Robust regression diagnostics with data transformations, Computational Statistics and Data Analysis, 49, 875-891. https://doi.org/10.1016/j.csda.2004.06.010
  5. Cook RD (1993). Exploring partial residual plots, Technometrics, 35, 351-362. https://doi.org/10.1080/00401706.1993.10485350
  6. Cook RD and Weisberg S (1994). Transforming a response variable for linearity, Biometrika, 81, 731-737. https://doi.org/10.1093/biomet/81.4.731
  7. Hadi AS and Simonoff JS (1993). Procedures for the identification of multiple outliers in linear models, Journal of the American Statistical Association, 88, 1264-1272. https://doi.org/10.1080/01621459.1993.10476407
  8. Hartigan JA (1981). Consistency of single linkage for high-density clusters, Journal of the American Statistical Association, 76, 388-394. https://doi.org/10.1080/01621459.1981.10477658
  9. Larsen WA and McCleary SJ (1972). The use of partial residual plots in regression analysis, Technometrics, 14, 781-790. https://doi.org/10.1080/00401706.1972.10488966
  10. Mallows CL (1986). Augmented partial residual plots, Technometrics, 28, 313-320. https://doi.org/10.2307/1268980
  11. Rosner B (1975). On the Detection of Many Outliers, Technometrics, 17, 217-227.
  12. Rousseeuw PJ (1984). Least median of squares regression, Journal of American Statistical Association, 79, 871-880. https://doi.org/10.1080/01621459.1984.10477105
  13. Seo HS (2009). A visual procedure for optimal response transformation and curvature specifications, Optimization and Engineering, 10, 301-312. https://doi.org/10.1007/s11081-008-9070-2
  14. Seo HS, Lee GY, and Yoon M (2012). Robust response transformation using outlier detection in regression model, The Korean Journal of Applied Statistics, 25, 205-213. https://doi.org/10.5351/KJAS.2012.25.1.205
  15. Seo HS and Yoon M (2009). A dynamic graphical method for transformations and curvature specifications in regression, The Korean Journal of Applied Statistics, 22, 189-195. https://doi.org/10.5351/KJAS.2009.22.1.189
  16. Seo HS and Yoon M (2013). Regression diagnostics for response transformations in a partial linear model, Journal of the Korean Data & Information Science Society, 24, 33-39. https://doi.org/10.7465/jkdi.2013.24.1.33
  17. Simonoff JS (1984). The calculation of outlier detection statistics, Communications in Statistics, Part B-Simulation and Computation, 13, 275-285. https://doi.org/10.1080/03610918408812374
  18. Simonoff JS (1988). Detecting outlying cells in two-way contingency tables via backwards-stepping, Technometrics, 30, 339-345. https://doi.org/10.1080/00401706.1988.10488407
  19. Stromberg AJ (1993). Computation of high breakdown nonlinear regression parameters, Journal of the American Statistical Association, 88, 237-244. https://doi.org/10.2307/2290718
  20. Tierney L (1990). LISP-STAT: An Object-Oriented Environment for Statistical Computing and Dynamic Graphics, John Wiley & Sons, New York.
  21. Weisberg S (2005). Applied Linear Regression, Wiley, New York.
  22. Yohai VJ (1987). High breakdown-point and high efficiency robust estimate for regression, The Annals of Statistics, 15, 642-656. https://doi.org/10.1214/aos/1176350366