Go to the main menu
Skip to content
Go to bottom
REFERENCE LINKING PLATFORM OF KOREA S&T JOURNALS
> Journal Vol & Issue
Korean Journal of Applied Statistics
Journal Basic Information
Journal DOI :
The Korean Statistical Society
Editor in Chief :
Volume & Issues
Volume 18, Issue 3 - Nov 2005
Volume 18, Issue 2 - Jul 2005
Volume 18, Issue 1 - Mar 2005
Selecting the target year
An Analysis for the Structural Variation in the Unemployment Rate and the Test for the Turning Point
Kim, Tae-Ho ; Hwang, Sung-Hye ; Lee, Young-Hoon ;
Korean Journal of Applied Statistics, volume 18, issue 2, 2005, Pages 253~269
DOI : 10.5351/KJAS.2005.18.2.253
One of the basic assumptions of the regression models is that the parameter vector does not vary across sample observations. If the parameter vector is not constant for all observations in the sample, the statistical model is changed and the usual least squares estimators do not yield unbiased, consistent and efficient estimates. This study investigates the regression model with some or all parameters vary across partitions of the whole sample data when the model permits different response coefficients during unusual time periods. Since the usual test for overall homogeneity of regressions across partitions of the sample data does not explicitly identify the break points between the partitions, the testing the equality between subsets of coefficients in two or more linear regressions is generalized and combined with the test procedure to search the break point. The method is applied to find the possibility and the turning point of the structural change in the long-run unemployment rate in the usual static framework by using the regression model. The relationships between the variables included in the model are reexamined in the dynamic framework by using Vector Autoregression.
Comparison between Kriging and GWR for the Spatial Data
Kim Sun-Woo ; Jeong Ae-Ran ; Lee Sung-Duck ;
Korean Journal of Applied Statistics, volume 18, issue 2, 2005, Pages 271~280
DOI : 10.5351/KJAS.2005.18.2.271
Kriging methods as traditional spatial data analysis methods and geographical weighted regression models as statistical analysis methods are compared. In this paper, we apply data from the Ministry of Environment to spatial analysis for practical study. We compare these methods to performance with monthly carbon monoxide observations taken at 116 measuring area of air pollution in 1999.
Analysis of Periodicity of Meteorological Measures and Their Effects on Precipitation Observed with Surface Meteorological Instruments at Eight Southwestern Areas, Korea during 2004KOEP
Kim Hea-Jung ; Yum Joonkeun ; Lee Yung-Seop ; Kim Young-Ah ; Chung Hyo-Sang ; Cho Chun-Ho ;
Korean Journal of Applied Statistics, volume 18, issue 2, 2005, Pages 281~296
DOI : 10.5351/KJAS.2005.18.2.281
This article summarizes our research on estimation of area-specific and time-adjusted rainfall rates during 2004KEOP (Korea enhanced observation period: June 1,
August 31, 2004). The rainfall rate is defined as the proportion of rainfall days per week and areas are consisting of Haenam, Yeosu, Janghung, Heuksando, Gwangju, Mokpo, Jindo, and Wando. Our objectives are to analyze periodicity in area-specific precipitation and the meteorological measures and investigate the relationships between the geographic pattern of the rainfall rates and the corresponding pattern in potential explanatory covariates such as temperature, wind, wind direction, pressure, and humidity. A generalized linear model is introduced to implement the objectives and the patterns are estimated by considering a set of rainfall rates produced using samples from the posterior distribution of the population rainfall rates.
Haplotype-Based Association and Linkage Analysis of Angiotensin-I Converting Enzyme(ACE) Gene with a Hypertension
Kim Jinheum ; Nam Chung Mo ; Kang Dae Ryong ; Suh Il ;
Korean Journal of Applied Statistics, volume 18, issue 2, 2005, Pages 297~310
DOI : 10.5351/KJAS.2005.18.2.297
In this study we investigate the association between the haplotype block of 4 SNPs in ACE genes and hypertension with a case-control dataset of size of 277 and 40 families data collected from Kangwha studies. To this end we perform a haplotype-based case-control association study and a haplotype-based TDT study. We do the same analysis with tag-SNPs that can identify the haplotype block. Through a cladogram analysis we make the evolution-tree of haplotypes and then classify the haplotypes into a few clades by collecting haplotypes exposed to the disease to the same extent. We also discuss the association between these clades and hypertension.
Weighted Hot-Deck Imputation in Farm and Fishery Household Economy Surveys
Kim Kyu-Seong ; Lee Kee-Jae ; Kim Jin ;
Korean Journal of Applied Statistics, volume 18, issue 2, 2005, Pages 311~328
DOI : 10.5351/KJAS.2005.18.2.311
This paper deals with a treatment of nonresponse in farm and fishery household economy surveys in Korea. Since the samples in two surveys were selected by stratified multi-stage sampling and weighted sample means has been used to estimate the population means, we choose a weighted hot-deck imputation method as an appropriate method for two surveys. We investigate the procedure of the weighted hot-deck as well as an adjusted jackknife method for variance estimation. Through an empirical study we found that the method worked very well in both mean and variance estimation in two surveys. In addition, we presented a procedure of forming imputation class and formed four imputation classes for each survey and then compared them with analysis. As a result, we presented two most efficient imputation classes for two surveys.
A Comparative Study of Determining the Number of Clusters with a Method Proposed
Chae, Seong-San ; Lim, Nam-Kyoo ;
Korean Journal of Applied Statistics, volume 18, issue 2, 2005, Pages 329~341
DOI : 10.5351/KJAS.2005.18.2.329
A method of determining the number of clusters is proposed based on some asymptotic results on the Rand's(1971}
, k = 2, 3, . . ., N - 1, statistic. Simulation is conducted to compare the proposed method with Chae and Warde(1991), and Huh and Lee(2004).
An Empirical Comparison of Bagging, Boosting and Support Vector Machine Classifiers in Data Mining
Lee Yung-Seop ; Oh Hyun-Joung ; Kim Mee-Kyung ;
Korean Journal of Applied Statistics, volume 18, issue 2, 2005, Pages 343~354
DOI : 10.5351/KJAS.2005.18.2.343
The goal of this paper is to compare classification performances and to find a better classifier based on the characteristics of data. The compared methods are CART with two ensemble algorithms, bagging or boosting and SVM. In the empirical study of twenty-eight data sets, we found that SVM has smaller error rate than the other methods in most of data sets. When comparing bagging, boosting and SVM based on the characteristics of data, SVM algorithm is suitable to the data with small numbers of observation and no missing values. On the other hand, boosting algorithm is suitable to the data with number of observation and bagging algorithm is suitable to the data with missing values.
Exploratory Analysis of Gene Expression Data Using Biplot
Park, Mi-Ra ;
Korean Journal of Applied Statistics, volume 18, issue 2, 2005, Pages 355~369
DOI : 10.5351/KJAS.2005.18.2.355
Genome sequencing and microarray technology produce ever-increasing amounts of complex data that needs statistical analysis. Visualization is an effective analytic technique that exploits the ability of the human brain to process large amounts of data. In this study, biplot approach applied to microarray data to see the relationship between genes and samples. The supplementary data method to classify new sample to known category is suggested. The methods are validated by applying it to well known microarray data such as Golub et al.(1999), Alizadeh et al.(2000), Ross et al.(2000). The results are compared to the results of several clustering methods. Modified graph which combine partitioning method and biplot is also suggested.
A Pattern Consistency Index for Detecting Heterogeneous Time Series in Clustering Time Course Gene Expression Data
Son, Young-Sook ; Baek, Jang-Sun ;
Korean Journal of Applied Statistics, volume 18, issue 2, 2005, Pages 371~379
DOI : 10.5351/KJAS.2005.18.2.371
In this paper, we propose a pattern consistency index for detecting heterogeneous time series that deviate from the representative pattern of each cluster in clustering time course gene expression data using the Pearson correlation coefficient. We examine its usefulness by applying this index to serum time course gene expression data from microarrays.
Asymptotic Test for Dimensionality in Sliced Inverse Regression
Park, Chang-Sun ; Kwak, Jae-Guen ;
Korean Journal of Applied Statistics, volume 18, issue 2, 2005, Pages 381~393
DOI : 10.5351/KJAS.2005.18.2.381
As a promising technique for dimension reduction in regression analysis, Sliced Inverse Regression (SIR) and an associated chi-square test for dimensionality were introduced by Li (1991). However, Li's test needs assumption of Normality for predictors and found to be heavily dependent on the number of slices. We will provide a unified asymptotic test for determining the dimensionality of the SIR model which is based on the probabilistic principal component analysis and free of normality assumption on predictors. Illustrative results with simulated and real examples will also be provided.
Analysis of Large Tables
Choi, Hyun-Jip ;
Korean Journal of Applied Statistics, volume 18, issue 2, 2005, Pages 395~410
DOI : 10.5351/KJAS.2005.18.2.395
For the analysis of large tables formed by many categorical variables, we suggest a method to group the variables into several disjoint groups in which the variables are completely associated within the groups. We use a simple function of Kullback-Leibler divergence as a similarity measure to find the groups. Since the groups are complete hierarchical sets, we can identify the association structure of the large tables by the marginal log-linear models. Examples are introduced to illustrate the suggested method.
Comparison of MIVQUE Estimators Using EQDGs for the One-way Random Model with Unbalanced Data
Jung, Byoung-Cheol ;
Korean Journal of Applied Statistics, volume 18, issue 2, 2005, Pages 411~420
DOI : 10.5351/KJAS.2005.18.2.411
In this study, the MIVQUE estimators of variance components for the one-way random model with unbalanced data are investigated. In order to compare the efficiency of MIVQUE estimators obtained by using three priori estimates, the Empirical Quantile Dispersion Graphs (EQDGs) are used. From the results of Monte-Carlo study, the MIVQUE estimator using
as the priori estimate performs well relative to other estimators.
Study on Support Vector Machines Using Mathematical Programming
Yoon, Min ; Lee, Hak-Bae ;
Korean Journal of Applied Statistics, volume 18, issue 2, 2005, Pages 421~434
DOI : 10.5351/KJAS.2005.18.2.421
Machine learning has been extensively studied in recent years as effective tools in pattern classification problem. Although there have been several approaches to machine learning, we focus on the mathematical programming (in particular, multi-objective and goal programming; MOP/GP) approaches in this paper. Among them, Support Vector Machine (SVM) is gaining much popularity recently. In pattern classification problem with two class sets, the idea is to find a maximal margin separating hyperplane which gives the greatest separation between the classes in a high dimensional feature space. However, the idea of maximal margin separation is not quite new: in 1960's the multi-surface method (MSM) was suggested by Mangasarian. In 1980's, linear classifiers using goal programming were developed extensively. This paper proposes a new family of SVM using MOP/GP techniques, and discusses its effectiveness throughout several numerical experiments.
Variable Selection for Logistic Regression Model Using Adjusted Coefficients of Determination
Hong C. S. ; Ham J. H. ; Kim H. I. ;
Korean Journal of Applied Statistics, volume 18, issue 2, 2005, Pages 435~443
DOI : 10.5351/KJAS.2005.18.2.435
Coefficients of determination in logistic regression analysis are defined as various statistics, and their values are relatively smaller than those for linear regression model. These coefficients of determination are not generally used to evaluate and diagnose logistic regression model. Liao and McGee (2003) proposed two adjusted coefficients of determination which are robust at the addition of inappropriate predictors and the variation of sample size. In this work, these adjusted coefficients of determination are applied to variable selection method for logistic regression model and compared with results of other methods such as the forward selection, backward elimination, stepwise selection, and AIC statistic.
Re-consideration of Cuboctahedral Die
Jang Dae-Heung ;
Korean Journal of Applied Statistics, volume 18, issue 2, 2005, Pages 445~452
DOI : 10.5351/KJAS.2005.18.2.445
We considered the frequency and classical probabilities under cuboctahedral die with 8 hexagons and 6 regular squares and compared with the frequency and logical probabilities under cuboctahedral die with 8 triangles and 6 regular squares.
A Study on Probability and Statistics Education in Practical Mathematics and Mathematics I Textbooks According to the 7th National Mathematics Curriculum in Korea
Jang Dae-Heung ; Lee Hyo-Jeong ;
Korean Journal of Applied Statistics, volume 18, issue 2, 2005, Pages 453~469
DOI : 10.5351/KJAS.2005.18.2.453
In Korea, mathematics education of 11-12 grade students has been taken according to the 7th national mathematics curriculum, which was renovated by the Ministry of Education and Human Resources Development announcement in 1997. The education of probability and statistics has been carried out as a part of this curriculum. We analyze mathematics textbooks-Practical mathematics and Mathematics I- and compare the 7th national mathematics curriculum with the 6th national mathematics curriculum.
Collapsibility Using Raindrop Plot
Hong C. S. ; Kim B. J. ; Park J. Y. ;
Korean Journal of Applied Statistics, volume 18, issue 2, 2005, Pages 471~485
DOI : 10.5351/KJAS.2005.18.2.471
For categorical data analysis, the collapsibility were explained with the odds ratio (cross-product ratio). When these theories with these odds ratios are applied to real
contingency tables, it is impossible to decide whether data are collapsible. Among graphical methods to represent odds ratios, Contour plot which is developed by Doi, Nakamura and Yamamoto (2001) could explain the structure of these data, but cannot decide on the collapsibility. In this paper, by using the Raindrop plot proposed by Barrowman and Myers (2003), we suggest an alternative method which can not only explain the structure of data, but also decide on the collapsibility.
A Suggestion on Instruction Service Quality Assessment Tool
Choi Kyoung-Ho ; Lee Seung-Joo ;
Korean Journal of Applied Statistics, volume 18, issue 2, 2005, Pages 487~497
DOI : 10.5351/KJAS.2005.18.2.487
The instruction assessment, which can be a tool to realize a class of high quality, evaluates instruction system and process. The result data of the instruction assessment is analysed and used to reorganize lecture contents or to improve teaching method. It enhances qualitative level of university education. The goal of the instruction assessment is to be a tool that provides high qualitative education to raise students' satisfaction. It is important to improve education service quality through instruction assessment as a useful tool for measuring service quality. The suggestion of valid tool for improving instruction service quality is described in this paper.