The Identification Of Multiple Outliers

  • Park, Jin-Pyo (Division of information & communication engineering, Kyungnam University)
  • Published : 2000.10.31

Abstract

The classical method for regression analysis is the least squares method. However, if the data contain significant outliers, the least squares estimator can be broken down by outliers. To remedy this problem, the robust methods are important complement to the least squares method. Robust methods down weighs or completely ignore the outliers. This is not always best because the outliers can contain some very important information about the population. If they can be detected, the outliers can be further inspected and appropriate action can be taken based on the results. In this paper, I propose a sequential outlier test to identify outliers. It is based on the nonrobust estimate and the robust estimate of scatter of a robust regression residuals and is applied in forward procedure, removing the most extreme data at each step, until the test fails to detect outliers. Unlike other forward procedures, the present one is unaffected by swamping or masking effects because the statistics is based on the robust regression residuals. I show the asymptotic distribution of the test statistics and apply the test to several real data and simulated data for the test to be shown to perform fairly well.

Keywords

References

  1. The Annals of Statistics v.1 Robust regression: Asymptotics, conjectures and Monte Carlo Huber, P. J.
  2. Bell Telephone Laboratories On some topics in robustness, unpublished memorandum Mallows, C. L.
  3. Biometrika v.69 Robust regression using repeated median Siegel, A. F.
  4. Journal of the American Statistical Association v.79 Least median of squares regression Rousseeuw, P. J.
  5. Lecture Notes in Statistics no.26 Robust regression by means of Sestimators Rousseeuw, P. J.;Yohai, V.
  6. The Annals of Statistics High breakdown-point and high efficiency robust estimates for regression Yohai, V.J.
  7. Fitting Equations to data Daniel, C.;Wood, F. S.
  8. Robust regression and outlier detection Rousseeuw, P. J.;Leroy, A. M.
  9. Statistical Theory and Methodology in Science and Engineering(2nd ed.) Brownlee, K. A.
  10. Applied Regression Analysis Draper, N. R.;Smith, H.