• Title/Summary/Keyword: Influential observations

Search Result 73, Processing Time 0.028 seconds

A Study on Detection of Influential Observations on A Subset of Regression Parameters in Multiple Regression

  • Park, Sung Hyun;Oh, Jin Ho
    • Communications for Statistical Applications and Methods
    • /
    • v.9 no.2
    • /
    • pp.521-531
    • /
    • 2002
  • Various diagnostic techniques for identifying influential observations are mostly based on the deletion of a single observation. While such techniques can satisfactorily identify influential observations in many cases, they will not always be successful because of some mask effect. It is necessary, therefore, to develop techniques that examine the potentially influential effects of a subset of observations. The partial regression plots can be used to examine an influential observation for a single parameter in multiple linear regression. However, it is often desirable to detect influential observations for a subset of regression parameters when interest centers on a selected subset of independent variables. Thus, we propose a diagnostic measure which deals with detecting influential observations on a subset of regression parameters. In this paper, we propose a measure M, which can be effectively used for the detection of influential observations on a subset of regression parameters in multiple linear regression. An illustrated example is given to show how we can use the new measure M to identify influential observations on a subset of regression parameters.

Influential Points in GLMs via Backwards Stepping

  • Jeong, Kwang-Mo;Oh, Hae-Young
    • Communications for Statistical Applications and Methods
    • /
    • v.9 no.1
    • /
    • pp.197-212
    • /
    • 2002
  • When assessing goodness-of-fit of a model, a small subset of deviating observations can give rise to a significant lack of fit. It is therefore important to identify such observations and to assess their effects on various aspects of analysis. A Cook's distance measure is usually used to detect influential observation. But it sometimes is not fully effective in identifying truly influential set of observations because there may exist masking or swamping effects. In this paper we confine our attention to influential subset In GLMs such as logistic regression models and loglinear models. We modify a backwards stepping algorithm, which was originally suggested for detecting outlying cells in contingency tables, to detect influential observations in GLMs. The algorithm consists of two steps, the identification step and the testing step. In identification step we Identify influential observations based on influencial measures such as Cook's distances. On the other hand in testing step we test the subset of identified observations to be significant or not Finally we explain the proposed method through two types of dataset related to logistic regression model and loglinear model, respectively.

A Study on Detection of Outliers and Influential Observations in Linear Models

  • Kang, Eun M.;Park, Sung H.
    • Journal of Korean Society for Quality Management
    • /
    • v.16 no.2
    • /
    • pp.18-33
    • /
    • 1988
  • A new diagnostic statistic for detecting outliers and influential observations in linear models is suggested and studied in this paper. The proposed statistic is a weighted sum of two measures ; one is for detecting outliers and the other is for detecting influential ovservations. The merit of this statistic is that it is possible to distinguish outliers from influential observations. This statistic can be used for not only regression models but also factorial design models. A Monte Carlo simulation study is reported to suggest critical values for detecting outliers and influential observations for simple regression models when the number of observations is 11. 21, 31, 41 or 51.

  • PDF

LOCAL INFLUENCE ON THE GOODNESS-OF-FIT TEST STATISTIC IN MAXIMUM LIKELIHOOD FACTOR ANALYSIS

  • Jung, Kang-Mo
    • Journal of applied mathematics & informatics
    • /
    • v.5 no.2
    • /
    • pp.489-498
    • /
    • 1998
  • The influence of observations the on the goodness-of-fit test in maximum likelihood factor analysis is investigated by using the local influence method. under an appropriate perturbation the test statistic forms a surface. One of main diagnostics is the maximum slope of the perturbed surface the other is the direction vector cor-responding to the curvature. These influence measures provide the information about jointly influence measures provide the information about jointly influential observations as well as individ-ually influential observations.

Graphical method for evaluating the impact of influential observations in high-dimensional data (고차원 자료에서 영향점의 영향을 평가하기 위한 그래픽 방법)

  • Ahn, Sojin;Lee, Jae Eun;Jang, Dae-Heung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.6
    • /
    • pp.1291-1300
    • /
    • 2017
  • In the high-dimensional data, the number of variables is very larger than the number of observations. In this case, the impact of influential observations on regression coefficient estimates can be very large. Jang and Anderson-Cook (2017) suggested the LASSO influence plot. In this paper, we propose the LASSO influence plot, LASSO variable selection ranking plot, and three-dimensional LASSO influence plot as graphical methods for evaluating the impact of influential observations in high-dimensional data. With real two high-dimensional data examples, we apply these graphical methods as the regression diagnostics tools for finding influential observations. It has been found that we can obtain influential observations with by these graphical methods.

Influence Analysis on a Test Statistic in Canonical Correlation Analysis

  • Jung, Kang-Mo
    • Communications for Statistical Applications and Methods
    • /
    • v.8 no.2
    • /
    • pp.347-355
    • /
    • 2001
  • We propose a method for detecting influential observations that have a large influence on the likelihood ratio test statistic for the two sets of variables are uncorrelated with one another. For this purpose we derive a local influence measure for the likelihood ratio test statistic under certain perturbation scheme. An illustrative example is given to show the effectiveness of the proposed method on the identification of influential observations.

  • PDF

Local Influence in Quadratic Discriminant Analysis

  • Jung, Kang-Mo
    • Communications for Statistical Applications and Methods
    • /
    • v.6 no.1
    • /
    • pp.43-52
    • /
    • 1999
  • The local influence method is adapted to quadratic discriminant analysis for the identification of influential observations affecting the estimation of probability density function probabilities and log odds. The method allows a simultaneous perturbation on all observations so that it can identify multiple influential observations. The proposed method is applied to a real data set and satisfactory result is obtained.

  • PDF

A Bayesian Diagnostic Measure and Stopping Rule for Detecting Influential Observations in Discriminant Analysis

  • Kim, Myung-Cheol;Kim, Hea-Jung
    • Journal of the Korean Statistical Society
    • /
    • v.29 no.3
    • /
    • pp.337-350
    • /
    • 2000
  • This paper suggests a new diagnostic measure and a stopping rule for detecting influential observations in multiple discriminant analysis (MDA). It is developed from a Bayesian point of view using a default Bayes factor obtained from the fractional Bayes factor methodology. The Bayes factor is taken as a discriminatory information in MDA. It is shown that the effect of an observation over the discriminatory information is fully explained by the diagnostic measure. Based on the measure, we suggest a stopping rule for detecting influential observations in a given training sample. As a tool for interpreting the measure a graphical method is sued. Performance of the method is used. Performance of the method is examined through two illustrative examples.

  • PDF

A study of a new statistic for detection of outliers and/or influential observations in regression diagnostics (회귀진단에서 이상치와 영향관측치를 동시에 발견하는 새로운 통계량에 관한 연구)

  • 강은미
    • The Korean Journal of Applied Statistics
    • /
    • v.6 no.1
    • /
    • pp.67-78
    • /
    • 1993
  • A new diagnostic statistic for detecting outliers and influential observations in linear models is suggested and studied in this paper. The proposed statistic is a weighted sum of two measures; one is for detecting outliers and the other is for detecting influential observations. The merit of this statistic is that it is possible to distinguish outliers from influential observations. We have done some Monte-Carlo Simulation to find the probability distribution of this statistic.

  • PDF

On an Information Theoretic Diagnostic Measure for Detecting Influential Observations in LDA

  • Kim, Hea-Jung
    • Journal of the Korean Statistical Society
    • /
    • v.25 no.2
    • /
    • pp.289-301
    • /
    • 1996
  • This paper suggests a new diagnostic measure for detecting influential observations in two group linear discriminant analysis(LDA). It is developed from an information theoretic point of view using the minimum discrimination information(MDI) methodology. MDI estimator of symmetric divergence by Kullback(l967) is taken as a measure of the power of discrimination in LDA. It is shown that the effect of an observation over the power of discrimination is fully explained by the diagnostic measure. Asymptotic distribution of the proposed measure is derived as a function of independent chi-squared and standard normal variables. By means of the distributions, a couple of methods are suggested for detecting the influential observations in LDA. Performance of the suggested methods are examined through a simulation study.

  • PDF