Search | Korea Science

Kahng, Myung-Wook;Kim, Bu-Yang
- Communications for Statistical Applications and Methods
- /
- v.16 no.1
- /
- pp.163-168
- /
- 2009
Given the specific mean shift outlier model, the standard approaches to obtaining test statistics for outliers are discussed. Accuracy of outlier tests is investigated using subset curvatures. These subset curvatures appear to be reliable indicators of the adequacy of the linearization based test. Also, we consider obtaining graphical summaries of uncertainty in estimating parameters through confidence curves. The results are applied to the problem of assessing the accuracy of outlier tests.
https://doi.org/10.5351/CKSS.2009.16.1.163 인용 PDF KSCI

Kahng, Myung-Wook
- Journal of the Korean Statistical Society
- /
- v.22 no.2
- /
- pp.201-208
- /
- 1993
Given the specific mean shift outlier model, the score test for multiple outliers in nonlinear regression is discussed as an alternative to the likelihood ratio test. The geometric interpretation of the score statistic is also presented.
PDF

Kahng, Myung-Wook
- Journal of the Korean Statistical Society
- /
- v.24 no.2
- /
- pp.419-437
- /
- 1995
Given the specific mean shift outlier model, several standard approaches to obtaining test statistic for outliers are discussed. Each of these is developed in detail for the nonlinear regression model, and each leads to an equivalent distribution. The geometric interpretations of the statistics and accuracy of linear approximation are also presented.
PDF

Namkyung, Pyong;Lee, Joon Suk
- Communications for Statistical Applications and Methods
- /
- v.7 no.2
- /
- pp.447-456
- /
- 2000
In this paper, we considered three methods for outlier identification sample surveys. First, we studied method of handling and adjusting outliers in normal population. Second, we studied existing methods using mean, maximum and minimum and proposed a test using of median which well reflects characteristic of data regardless of sampling distribution. Finally, we showed our test using median works better than Dixon and mean test through simulation.
PDF

Kahng, Myung-Wook
- Journal of the Korean Data and Information Science Society
- /
- v.17 no.1
- /
- pp.205-211
- /
- 2006
For a linear regression model, the necessary and sufficient condition for the asymptotic consistency of the outlier test statistic is known. An analogous condition for the nonlinear regression model is considered in this paper.
PDF

Kahng, Myung-Wook
- Communications for Statistical Applications and Methods
- /
- v.18 no.1
- /
- pp.131-136
- /
- 2011
The original Bates-Watts framework applies only to the complete parameter vector. Thus, guidelines developed in that framework can be misleading when the adequacy of the linear approximation is very different for different subsets. The subset curvature measures appear to be reliable indicators of the adequacy of linear approximation for an arbitrary subset of parameters in nonlinear models. Given the specific mean shift outlier model, the standard approaches to obtaining test statistics for outliers are discussed. The accuracy of outlier tests is investigated using subset curvatures.
https://doi.org/10.5351/CKSS.2011.18.1.131 인용 PDF KSCI

Seo, Han Son;Yoon, Min
- The Korean Journal of Applied Statistics
- /
- v.27 no.2
- /
- pp.307-315
- /
- 2014
An exact distribution of the test statistic to test for multiple outlier candidates does not generally exist; therefore, tests of individual outliers (or tests using simulated critical-values) are usually conducted instead of testing for groups of outliers. This article is on procedures to test outlying observations. We suggest a method that can be applied to arbitrary observations or multiple outlier candidates detected by an outlier detecting method. A Monte Carlo study performance is used to compare the proposed method with others.
https://doi.org/10.5351/KJAS.2014.27.2.307 인용 PDF KSCI

Kim, Myung-Geun
- Communications for Statistical Applications and Methods
- /
- v.9 no.2
- /
- pp.473-478
- /
- 2002
A test for a single outlier in multivariate regression with linear constraints on regression coefficients using a mean shift model is derived. It is shown that influential observations based on case-deletions in testing linear hypotheses are determined by two types of outliers that are mean shift outliers with or without linear constraints, An illustrative example is given.
https://doi.org/10.5351/CKSS.2002.9.2.473 인용 PDF KSCI

Seo, Han Son;Yoon, Min
- The Korean Journal of Applied Statistics
- /
- v.29 no.4
- /
- pp.699-706
- /
- 2016
Outlier detection methods without performing a test often do not succeed in detecting multiple outliers because they are structurally vulnerable to a masking effect or a swamping effect. This paper considers testing procedures supplemented to a clustering-based method of identifying the group with a minority of the observations as outliers. One of general steps is performing a variety of t-test on individual outlier-candidates. This paper proposes a sequential procedure for searching for outliers by changing cutoff values on a cluster tree and performing a test on a set of outlier-candidates. The proposed method is illustrated and compared to existing methods by an example and Monte Carlo studies.
https://doi.org/10.5351/KJAS.2016.29.4.699 인용 PDF KSCI

Park, Cheong Hee
- Journal of Korea Multimedia Society
- /
- v.24 no.9
- /
- pp.1242-1250
- /
- 2021
Detection of outliers deviating normal data distribution in high dimensional data is an important technique in many application areas. In this paper, a distance-based outlier detection method using landmarks in high dimensional data is proposed. Given normal training data, the k-means clustering method is applied for the training data in order to extract the centers of the clusters as landmarks which represent normal data distribution. For a test data sample, the distance to the nearest landmark gives the outlier score. In the experiments using high dimensional data such as images and documents, it was shown that the proposed method based on the landmarks of one-tenth of training data can give the comparable outlier detection performance while reducing the time complexity greatly in the testing stage.
https://doi.org/10.9717/kmms.2021.24.9.1242 인용 PDF KSCI HTML