Go to the main menu
Skip to content
Go to bottom
REFERENCE LINKING PLATFORM OF KOREA S&T JOURNALS
> Journal Vol & Issue
Journal of the Korean Data and Information Science Society
Journal Basic Information
Journal DOI :
Korean Data and Information Science Society
Editor in Chief :
Volume & Issues
Volume 23, Issue 6 - Nov 2012
Volume 23, Issue 5 - Sep 2012
Volume 23, Issue 4 - Jul 2012
Volume 23, Issue 3 - May 2012
Volume 23, Issue 2 - Mar 2012
Volume 23, Issue 1 - Jan 2012
Selecting the target year
Smoothing Kaplan-Meier estimate using monotone support vector regression
Hwang, Changha ; Shim, Jooyong ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1045~1054
DOI : 10.7465/jkdi.2012.23.6.1045
Support vector machine is known to be the very useful statistical method in classification and nonlinear function estimation. In this paper we propose a monotone support vector regression (SVR) for the estimation of monotonically decreasing function. The proposed monotone SVR is applied to smooth the Kaplan-Meier estimate of survival function. Experimental results are then presented which indicate the performance of the proposed monotone SVR using survival functions obtained by exponential distribution.
The change of relative importance in athletes self management test through AHP
Yu, Kwang-Kil ; Chung, Sung-Suk ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1055~1064
DOI : 10.7465/jkdi.2012.23.6.1055
This research was to count and quantify relative importance through analytic hierarchy process on checkups items in order to solve the questions raised by uniformalized numbering in the established athletes self management test. This study has the following results. The relative importance in priority factors of the test is in following order of training management, mental care, body care and interpersonal management. It was also confirmed that relative importance varied according to every checkup item in the lower factors. Consequently, the new result from this study, as an initial research for the relative importance of athletes self management test, supplied the possibility in which the sports-field expert`s opinion was reflected. It was expected that this study would be helpful for enhancing athletes` competence in the sports field.
Effects of on-base and slugging ability on run productivity in Korean professional baseball
Kim, Hyuk Joo ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1065~1074
DOI : 10.7465/jkdi.2012.23.6.1065
The purpose of this paper is to statistically analyze the effects of on-base and slugging ability on the run productivity in Korean professional baseball. In Section 2, we have investigated the OPS (On-base percentage Plus Slugging average) and introduced new indices of batting ability by modifying the OPS. In Section 3, we have examined the correlation which the batting average, on-base percentage, slugging average, IsoP (Isolated Power), OPS and the indices introduced in Section 2 have with the average runs per game, using the data from all the games of the regular seasons in 2007~2011. In addition, by generalizing the OPS and the indices introduced in Section 2, we have analyzed the correlation of the indices with various weights between the average runs per game. As a result, the weighted OPS consisting of on-base percentage (with weight 57%) and slugging average (with weight 43%) has been found to give the best explanation of the run productivity.
A study on the information transfer effect among the China stock markets
Lee, Sang-Woo ; Lee, Eui-Kyung ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1075~1084
DOI : 10.7465/jkdi.2012.23.6.1075
This study examines stock market co-movement among three China stock markets: Shanghai stock market, Shenzhen stock market, Hongkong stock market. US stock market leads three China stock markets and Honkong stock market leads Shanghai and Shenzhen stock market. But there are no lead-lag effects among China stock markets after controlling US stock market effect. These results could be important for the investors and firms that are interested in China stock markets.
Adjusted maximum tolerated dose estimation by stopping rule in phaseⅠclinical trial
Park, Ju Hee ; Kim, Dongjae ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1085~1091
DOI : 10.7465/jkdi.2012.23.6.1085
Phase I clinical trials are designed to identify an appropriate dose; the maximum tolerated dose, which assures safety of a new drug by evaluating the toxicity at each dose-level. The adjusted maximum tolerated dose estimation is presented by stopping rule in phase I clinical trial on this research. The suggested maximum tolerated dose estimation is compared to the standard method3 and NM method using a Monte Carlo simulation study.
Prediction of the interest spread using VAR model
Kim, Junhong ; Jin, Dalae ; Lee, Jisun ; Kim, Suji ; Son, Young Sook ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1093~1102
DOI : 10.7465/jkdi.2012.23.6.1093
In this paper, we predicted the interest spread using the VAR (vector autoregressive) model. Variables used in the VAR model were selected among 56 domestic and foreign macroeconomic time series through crosscorrelation and Granger causality test. The performance of the VAR model was compared with the univariate time series model, AR (autoregressive) model, in view of MAPE (mean absolute percentage error) and RMSE (root mean square error) of forecasts for the last twelve months.
The effect of onion on hyperlipidemia: Meta-analysis
Choi, Kiheon ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1103~1115
DOI : 10.7465/jkdi.2012.23.6.1103
In this study, we studied the effect of onion on hyperlipidemia in terms of factors, such as body weight, liver weight, kidney weight, heart weight, blood glucose, total cholesterol, triglycerides, HDL-cholesterol, and LDL-cholesterol. The hyperlipidemia supplement was significantly effective on the liver weight, kidney weight, blood glucose, total cholesterol, triglycerides, and LDL-cholesterol with the fixed effect model. However, the liver weight, blood glucose, total cholesterol, and triglycerides were significantly decreased with the random effect model on the heterogeneous factors selected by Galbraith plot. The existence of publication bias was checked by using a funnel plot.
Models for forecasting food poisoning occurrences
Yeo, In-Kwon ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1117~1125
DOI : 10.7465/jkdi.2012.23.6.1117
The occurrence of food poisoning is usually modeled by meteorological variables like the temperature and the humidity. In this paper, we investigate the relationship between food poisoning occurrence and climate variables in Korea and compare Poisson regression and autoregressive moving average model to select the forecast model. We confirm that lagged climate variables affect the food poisoning occurrences. However, it turns out that, from the viewpoint of the prediction, the number of previous occurrences is more influential to the current occurrence than meteorological variables and Poisson regression model is less reliable.
Exploration of PIM based similarity measures as association rule thresholds
Park, Hee Chang ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1127~1135
DOI : 10.7465/jkdi.2012.23.6.1127
Association rule mining is the method to quantify the relationship between each set of items in a large database. One of the well-studied problems in data mining is exploration for association rules. There are three primary quality measures for association rule, support and confidence and lift. We generate some association rules using confidence. Confidence is the most important measure of these measures, but it is an asymmetric measure and has only positive value. Thus we can face with difficult problems in generation of association rules. In this paper we apply the similarity measures by probabilistic interestingness measure to find a solution to this problem. The comparative studies with support, two confidences, lift, and some similarity measures by probabilistic interestingness measure are shown by numerical example. As the result, we knew that the similarity measures by probabilistic interestingness measure could be seen the degree of association same as confidence. And we could confirm the direction of association because they had the sign of their values.
Study on the effectiveness of english-medium class
Cho, Jang Sik ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1137~1144
DOI : 10.7465/jkdi.2012.23.6.1137
Many universities stress gradually the importance of english-medium class in order to improve the international competitiveness and the internationalization of the university. In this paper, we compare english-medium class with korean class using course evaluation score. Also we analyze the factors that affect the effectiveness of the course evaluation score of english-medium class. First, logistic regression analysis is used to examine the main effects of subjects and individual characteristics. Also, decision tree analysis is used to examine the interaction effects for subjects and individual characteristics. The results of this paper are as follows. Grade, department category, class size, GPA and screening method affect the effectiveness of english-medium class. The highest effectiveness group of english-medium class is that grade is freshmen and department category is humanity. Also the group of the second highest effectiveness group is that grade is freshmen and department category is nature and art and GPA is high.
Analysis on characteristics of person with disability using ratio of the lengths of the second to fourth digits
Kim, Sung-Jin ; Shin, Hyun-Uk ; Lee, Byong-Sun ; Choi, Ji-Won ; Choi, Kyounh-Ho ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1145~1154
DOI : 10.7465/jkdi.2012.23.6.1145
Recent research has been mostly focused on the differences of prenatal sex hormone could affect the ratio of the lengths of the second to fourth digits (2D:4D) as well as the rate of several physiological characteristics. It has been suggested that digit ratios are thought to be associated with either higher prenatal testosterone levels or greater sensitivity to androgens, or both. However, a study of person with disabilities are rare in Korea. The purpose of this study was to compare patterns of the ratio (2D:4D) of the lengths of the second and fourth digits in persons with disabilities and students without disabilities. Also, we empirically analyze the characteristics of persons with disabilities. We found that finger length ratio (2D:4D) of men with disabilities and student without disabilities were lower than women with disabilities and student without disabilities. Therefore, it could be confirmed that finger length ratio (2D:4D) is sexually dimorphic trait. There were no correlations between finger length ratio (2D:4D) and age group in persons with disabilities. Therefore, as with person without disabilities, the 2D:4D ratio was negatively associated with a growth process of persons with disabilities. Finally, There was no difference correlations between finger length ratio (2D:4D) and level of disability in persons with intellectual disabilities. From this, one possible explanation for this is that if finger length ratio (2D:4D) is related to person with disability then current classification of the level of disability deserves further investigation.
Type II analysis by projections
Choi, Jaesung ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1155~1163
DOI : 10.7465/jkdi.2012.23.6.1155
This paper suggests a method for getting sums of squares due to sources of variation under the assumption of two-way fixed effects model. The method used for calculating the quantities due to fixed-effects is based on the projections of an observation vector y on the column space generated by the model matrix X under the assumed model. The suggested method shows that the calculation of Type II sums of squares by projections is much easier than the classical Type II analysis.
The reality of benefits for retirement and the measures for annuitization of the occupational pension
Jung, Se Chang ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1165~1172
DOI : 10.7465/jkdi.2012.23.6.1165
The purpose of this paper is to investigate the reality of the occupational pension and suggest annuitization over a lifetime of the occupational pension for the purpose of securing income after retirement. A survey and the empirical analysis such as regression and crosstabulation analysis are employed. An research on a case study of an advanced countries is also conducted. The crosstabulation analysis shows that the post-retirement amenities of the self-employed person and the lower income bracket are serious. Individual retirement pension is suggested for the self-employed person and Riester Pension in Germany is recommended for the lower income bracket. The cases in Australia and the UK are useful for annuitization over a lifetime of the occupational pension.
Mathematics education attitude of the students in the specialized high school
Kim, Minsuk ; Oh, Kwangsik ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1173~1181
DOI : 10.7465/jkdi.2012.23.6.1173
In order to suggest the basic resources of mathematics education to the specialized high school, we investigate the attitude of students about mathematics education. Questionnaires survey was carried on 654 students and we use the statistical analysis such as chi-square test, gamma, generalized linear model, Anova, regression. Several result can be derived from the questionnaire analysis. There are differences between the general and specialized high school students in the interest, pre-learning ability etc. The specialized school students think the usefulness of mathematics more importantly, while the general school students think more closely related to their course.
Handling the nonresponse in sample survey
Lee, Hwa-Jung ; Kang, Suk-Bok ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1183~1194
DOI : 10.7465/jkdi.2012.23.6.1183
When it comes to a survey, no answer would occur frequently. Therefore various methods for handling nonresponse have been applied to analyse the survey. In this paper, the ratio of occurrence of two type of nonresponse cases - unit nonresponse and item nonresponse - is presented using previous real survey data, and we compared complete data and data with nonresponse. We suggest the reason of happening of nonresponse and the ratio of nonresponse using data collected through group interviews.
Extended warranty policy when minimal repair cost is a function of failure time
Jung, Ki Mun ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1195~1202
DOI : 10.7465/jkdi.2012.23.6.1195
In this paper, we determine the expected total cost from the user`s perspective for the replacement model with the extended warranty when minimal repair cost is a function of failure time. To do so, we define the extended warranty and assume the replacement model following the expiration of extended warranty from the user`s perspective. Especially, we propose the criterion to buy the extended warranty and the numerical examples are presented to illustrate the purpose when the failure time of the system has a Weibull distribution.
On the development of DES round key generator based on Excel Macro
Kim, Daehak ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1203~1212
DOI : 10.7465/jkdi.2012.23.6.1203
In this paper, we consider the development of round key generator of DES (data encryption standard) based on Microsoft Excel Macro, which was adopted as the FIPS (federal information processing standard) of USA in 1977. Simple introduction to DES is given. Algorithms for round key generator are adapted to excel macro. By repeating the 16 round which is consisted of diffusion (which hide the relation between plain text and cipher text) and the confusion (which hide the relation between cipher key and cipher text) with Microsoft Excel Macro, we can easily get the desired DES round keys.
Moment of the ratio and approximate MLEs of parameters in a bivariate Pareto distribution
Kim, Jungdae ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1213~1222
DOI : 10.7465/jkdi.2012.23.6.1213
We shall derive the moment of the ratio Y/(X + Y) and the reliability P(X < Y ), and then observe the skewness of the ratio in a bivariate Pareto density function of (X, Y). And we shall consider an approximate MLE of parameters in the bivariate Pareto density function.
Generalized Kullback-Leibler information and its extensions to censored and discrete cases
Park, Sangun ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1223~1229
DOI : 10.7465/jkdi.2012.23.6.1223
In this paper, we propose a generalized Kullback-Leibler (KL) information for measuring the distance between two distribution functions where the extension to the censored case is immediate. The generalized KL information has the nonnegativity and characterization properties, and its censored version has the additional property of monotonic increase. We also extend the discussion to the discrete case and propose a generalized censored measure which is comparable to Pearson`s chi-square statistic.
Kernel Poisson regression for mixed input variables
Shim, Jooyong ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1231~1239
DOI : 10.7465/jkdi.2012.23.6.1231
An estimating procedure is introduced for kernel Poisson regression when the input variables consist of numerical and categorical variables, which is based on the penalized negative log-likelihood and the component-wise product of two different types of kernel functions. The proposed procedure provides the estimates of the mean function of the response variables, where the canonical parameter is linearly and/or nonlinearly related to the input variables. Experimental results are then presented which indicate the performance of the proposed kernel Poisson regression.
Bayesian inference in finite population sampling under measurement error model
Goo, You Mee ; Kim, Dal Ho ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1241~1247
DOI : 10.7465/jkdi.2012.23.6.1241
The paper considers empirical Bayes (EB) and hierarchical Bayes (HB) predictors of the finite population mean under a linear regression model with measurement errors We discuss how to calculate the mean squared prediction errors of the EB predictors using jackknife methods and the posterior standard deviations of the HB predictors based on the Markov Chain Monte Carlo methods. A simulation study is provided to illustrate the results of the preceding sections and compare the performances of the proposed procedures.
Estimation for generalized half logistic distribution based on records
Seo, Jung-In ; Lee, Hwa-Jung ; Kan, Suk-Bok ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1249~1257
DOI : 10.7465/jkdi.2012.23.6.1249
In this paper, we derive maximum likelihood estimators (MLEs) and approximate MLEs (AMLEs) of the unknown parameters in a generalized half logistic distribution when the data are upper record values. As an illustration, we examine the validity of our estimation using real data and simulated data. Finally, we compare the proposed estimators in the sense of the mean squared error (MSE) through a Monte Carlo simulation for various record values of size.
A GGQS-based hybrid algorithm for inter-cloud time-critical event dissemination
Bae, Ihn-Han ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1259~1269
DOI : 10.7465/jkdi.2012.23.6.1259
Cloud computing has rapidly become a new infrastructure for organizations to reduce their capital cost in IT investment and to develop planetary-scale distributed applications. One of the fundamental challenges in geographically distributed clouds is to provide efficient algorithms for supporting inter-cloud data management and dissemination. In this paper, we propose a geographic group quorum system (GGQS)-based hybrid algorithm for improving the interoperability of inter-cloud in time-critical event dissemination service, such as computing policy updating, message sharing, event notification and so forth. The proposed algorithm first organizes these distributed clouds into a geographic group quorum overlay to support a constant event dissemination latency. Then it uses a hybrid protocol that combines geographic group-based broad-cast with quorum-based multicast. Our numerical results show that the GGQS-based hybrid algorithm improves the efficiency as compared with Chord-based, Plume an GQS-based algorithms.
Estimation in the exponential distribution under progressive Type I interval censoring with semi-missing data
Shin, Hyejung ; Lee, Kwangho ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1271~1277
DOI : 10.7465/jkdi.2012.23.6.1271
In this paper, we propose an estimation method of the parameter in an exponential distribution based on a progressive Type I interval censored sample with semi-missing observation. The maximum likelihood estimator (MLE) of the parameter in the exponential distribution cannot be obtained explicitly because the intervals are not equal in length under the progressive Type I interval censored sample with semi-missing data. To obtain the MLE of the parameter for the sampling scheme, we propose a method by which progressive Type I interval censored sample with semi-missing data is converted to the progressive Type II interval censored sample. Consequently, the estimation procedures in the progressive Type II interval censored sample can be applied and we obtain the MLE of the parameter and survival function. It will be shown that the obtained estimators have good performance in terms of the mean square error (MSE) and mean integrated square error (MISE).
A note on the sample size determination of sequential and multistage procedures
Choi, Kiheon ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1279~1287
DOI : 10.7465/jkdi.2012.23.6.1279
We particularly emphasized how to determine the number of replications with sequential and multistage procedures. So, the t-test is used to achieve some predetermined level of accuracy efficiently with loss function in the case of normal, chi-squared, an exponential distributions. We provided that the relevance of procedures are sequential procedure, two-stage procedure, modified two-stage procedure, three-stage procedure and accelerated sequential procedure. Monte Carlo simulation is carried out to obtain the stopping sample size that minimizes the risk.
A compound Poisson risk model with variable premium rate
Song, Mi Jung ; Kim, Jongwoo ; Lee, Jiyeon ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1289~1297
DOI : 10.7465/jkdi.2012.23.6.1289
We consider a general compound Poisson risk model in which the premium rate is surplus dependent. We analyze the joint distribution of the surplus immediately before ruin, the deffcit at ruin and the time of ruin by solving the integro-differential equation for the Gerber-Shiu discounted penalty function.
Default Bayesian hypothesis testing for the scale parameters in nonregular Pareto distributions
Kang, Sang Gil ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1299~1308
DOI : 10.7465/jkdi.2012.23.6.1299
This article deals with the problem of testing the equality of the scale parameters in nonregular Pareto distributions.We propose Bayesian hypothesis testing procedures for the equality of the scale parameters under the noninformative prior. The noninformative prior is usually improper which yields a calibration problem that makes the Bayes factor to be de ned up to a multiplicative constant. So we propose the default Bayesia hypothesis testing procedures based on the fractional Bayes factor and the intrinsic Bayes factors under the reference priors. Simulation study and a real data example are provided.
Stochastic precipitation modeling based on Korean historical data
Kim, Yongku ; Kim, Hyeonjeong ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1309~1317
DOI : 10.7465/jkdi.2012.23.6.1309
Stochastic weather generators are commonly used to simulate time series of daily weather, especially precipitation amount. Recently, a generalized linear model (GLM) has been proposed as a convenient approach to fitting these weather generators. In this paper, a stochastic weather generator is considered to model the time series of daily precipitation at Seoul in South Korea. As a covariate, global temperature is introduced to relate long-term temporal scale predictor to short-term temporal predictands. One of the limitations of stochastic weather generators is a marked tendency to underestimate the observed interannual variance of monthly, seasonal, or annual total precipitation. To reduce this phenomenon, we incorporate time series of seasonal total precipitation in the GLM weather generator as covariates. It is veri ed that the addition of these covariates does not distort the performance of the weather generator in other respects.
Estimating the determinants of victory and defeat through analyzing records of Korean pro-basketball[Author`s correction]
Kim, Sae Hyung ;
Journal of the Korean Data and Information Science Society, volume 23, issue 6, 2012, Pages 1319~1319
DOI : 10.7465/jkdi.2012.23.6.1319