Go to the main menu
Skip to content
Go to bottom
REFERENCE LINKING PLATFORM OF KOREA S&T JOURNALS
> Journal Vol & Issue
Journal of the Korean Data and Information Science Society
Journal Basic Information
Journal DOI :
Korean Data and Information Science Society
Editor in Chief :
Volume & Issues
Volume 27, Issue 4 - Jul 2016
Volume 27, Issue 3 - May 2016
Volume 27, Issue 2 - Mar 2016
Volume 27, Issue 1 - Jan 2016
Selecting the target year
Study on abnormal behavior prediction models using flexible multi-level regression
Jung, Yu Jin ; Yoon, Yong Ik ;
Journal of the Korean Data and Information Science Society, volume 27, issue 1, 2016, Pages 1~8
DOI : 10.7465/jkdi.2016.27.1.1
In the recently, violent crime and accidental crime has been generated continuously. Consequently, people anxiety has been heightened. The Closed Circuit Television (CCTV) has been used to ensure the security and evidence for the crimes. However, the video captured from CCTV has being used in the post-processing to apply to the evidence. In this paper, we propose a flexible multi-level models for estimating whether dangerous behavior and the environment and context for pedestrians. The situation analysis builds the knowledge for the pedestrians tracking. Finally, the decision step decides and notifies the threat situation when the behavior observed object is determined to abnormal behavior. Thereby, tracking the behavior of objects in a multi-region, it can be seen that the risk of the object behavior. It can be predicted by the behavior prediction of crime.
Bayesian analysis of insurance risk model with parameter uncertainty
Cho, Jaerin ; Ji, Hyesu ; Lee, Hangsuck ;
Journal of the Korean Data and Information Science Society, volume 27, issue 1, 2016, Pages 9~18
DOI : 10.7465/jkdi.2016.27.1.9
In the Heckman-Meyers model, which is frequently referred by IAA, Swiss Solvency Test, EU Solvency II, the assumption of parameter distribution is key factor. While in theory Bayesian analysis somewhat reflects parameter uncertainty using prior distribution, it is often the case where both Heckman-Meyers and Bayesian are necessary to better manage the parameter uncertainty. Therefore, this paper proposes the use of Bayesian H-M CRM, a combination of Heckman-Meyers model and Bayesian, and analyzes its efficiency.
Analysis of the 2015 reform plan of government employees pension system (GEPS) through monte carlo simulations
Lee, Jieun ; Song, Seongjoo ;
Journal of the Korean Data and Information Science Society, volume 27, issue 1, 2016, Pages 19~32
DOI : 10.7465/jkdi.2016.27.1.19
Due to the increasing fiscal burden and structural unbalanced premium/benefit costs, the new reform on the government employees pension system (GEPS) was considered even after the recent reform in 2009. This article examines the various effects of recent amendment in 2015 on GEPS using a simple probabilistic model. We consider effects on both sides, the pensioners and the government. First of all, the expected net value of pension payment for an individual employee was calculated based on the supposed survival distribution. The fairness of individual pension holders was compared using the benefit-cost ratio. Secondly, from pension system users` point of view, the default probability and the government subsidy were examined by Monte-carlo simulation. From the simulation experiment, we could see that the 2015 reform plan indeed reduces the default probability and the size of the fiscal burden of government by increasing the premium and decreasing the benefit. However, the size of the effect is not very standout at this moment because the number of new employees who are fully subject to the reform will be much smaller than the number of previous employees for a while. Thus, the effect of the reform is expected to appear in a slow manner.
Standardized polytomous discrimination index using concordance
Choi, Jin Soo ; Hong, Chong Sun ;
Journal of the Korean Data and Information Science Society, volume 27, issue 1, 2016, Pages 33~44
DOI : 10.7465/jkdi.2016.27.1.33
There are many situations that the outcome for clinical decision and credit assessment should be predicted more than two categories. Five kinds of statistics which are used the concordance are proposed and used for these polytomous problems. However, these statistics are defined without exact distinction of categories, so that we have difficulty to use both the pair and set approaches and it is hard to understand the meanings of these statistics. Hence, it is not possible to compare and analyze them. In this paper, the polytomous confusion matrix is standardized and the concordance statistic can be represented based on the confusion matrix. The five kinds of statistics by using the concordance are defined. With the methods proposed in this paper, we could not only explain their meanings but also compare and analyze these statistics. Based on various data sets, properties of these five statistics are explored and explained.
Comparative study of working conditions of Korea and Europe
Kim, Youngsun ; Yang, Wanyoun ; Baik, Jaiwook ;
Journal of the Korean Data and Information Science Society, volume 27, issue 1, 2016, Pages 45~65
DOI : 10.7465/jkdi.2016.27.1.45
Global competition and unstable economic situation in recent years have brought in adverse working conditions such as proliferation of temporary or part-time workers and new shift system. In this study we use exploratory data analysis method to find the relationship between some of the important variables which are related to working hours in KWCS (Korean Working Conditions Survey) and EWCS (European Working Conditions Survey). Next, working hours are explored in terms of sex, employment type, occupation and industry using 2006, 2010, 2011, 2014 KWCS data as well as 2010 EWCS data. Lastly, respondents are divided into two groups of healthy and nonhealthy workers and their working hours are explored in terms of the same categories as above. Finally, simple regression analysis is used to find the effect of health on weekly working hours and adjustment of the effect of health on weekly working hours is made using multiple regression analysis with some other independent variable such as sex, age and employment type included in the model.
The statistical factors affecting the freezing of the road pavement
Kim, Hyun-Ji ; Lee, Jea-Young ; Kim, Byung-Doo ; Cho, Gyu-Tae ;
Journal of the Korean Data and Information Science Society, volume 27, issue 1, 2016, Pages 67~74
DOI : 10.7465/jkdi.2016.27.1.67
Due to the character of the climate of Korea, the pavement of a road is Influenced by freezing in winter season and thawing in thawing season. In the last few years, several articles have been devoted to the study to minimize the damage of freezing and thawing action. The purpose of this paper is to identify appropriacy of factors that influence road pavement thickness. We conduct the decision tree analysis on the field data of road pavement. The target variable is `Frost penetration`. This value was calculated from the temperature data. The input variables are `Region`, `Type of road pavement`, `Anti-frost layer`, `Month` and `Air temperature`. The region was divided into 9 regions by freezing index
. The type of road pavement has three-section such as area of cutting, boundary area of cutting and bankin, lower area of banking. As the result, the variables that influence `Frost penetration` are Month, followed by anti-frost layer, air temperature and region.
A proposal and analysis of finance evaluation indicators for actuarial review of the national pension
Lee, Hangsuck ; Shin, Seunghee ;
Journal of the Korean Data and Information Science Society, volume 27, issue 1, 2016, Pages 75~89
DOI : 10.7465/jkdi.2016.27.1.75
This study suggests both modification reserve ratio and cover rate for expenditure as new finance evaluation indicators. Firstly, modification reserve ratio is an evaluation indicator which shows how long can the accumulated reserves at the beginning of the year afford future benefit expenditure. Modification reserve ratio has an advantage both to present what the scale of annual accumulated reserves means and to know the exhaustion speed of accumulated fund through analyzing the trend of modification fund ratio. Secondly, this research classifies resources for expenditures as premium income and reserves, thereafter, presents cover rate for expenditure as finance evaluation indicator. We can know how premium income and reserves can cover future expenditure during the evaluation period, and how deficient are resources through these indices. The researcher anticipates this research to contribute to policy researches for financial stabilization of the national pension scheme.
FUZZY matching using propensity score: IBM SPSS 22 Ver.
Kim, So Youn ; Baek, Jong Il ;
Journal of the Korean Data and Information Science Society, volume 27, issue 1, 2016, Pages 91~100
DOI : 10.7465/jkdi.2016.27.1.91
Fuzzy matching is proposed to make propensities of two groups similar with their propensity scores and a way to select control variable to make propensity scores with a process that shows how to acquire propensity scores using logic regression analysis, is presented. With such scores, it was a method to obtain an experiment group and a control group that had similar propensity employing the Fuzzy Matching. In the study, it was proven that the two groups were the same but with a different distribution chart and standardization which made edge tolerance different and we realized that the number of chosen cases decreased when the edge tolerance score became smaller. So with the idea, we were able to determine that it is possible to merge groups using fuzzy matching without a precontrol and use them when data (big data) are used while to check the pros and cons of Fuzzy Matching were made possible.
How depression affects girls who experienced violence in home or at school: Using mixed model
Min, Dae Kee ; Choi, Mi Kyung ;
Journal of the Korean Data and Information Science Society, volume 27, issue 1, 2016, Pages 101~110
DOI : 10.7465/jkdi.2016.27.1.101
Depression is a mental symptoms that accompany psychological symptoms such as anxiety, gloom, a sense of failure, helplessness as well as a physical symptoms that shows in the adaption process of stress. Depression may result in cognitive, emotional, and behavioral disorders in accordance with individual experience. In particular, the depression when growing lowers the concentration on studies of children and adolescents and also increases the violence and interpersonal problems causes a serious loss to individuals and society. This study analyzed the longitudinal data using mixed models to investigate how depression affects girls who experienced violence in home or at school.
Nonparametric estimation of the discontinuous variance function using adjusted residuals
Huh, Jib ;
Journal of the Korean Data and Information Science Society, volume 27, issue 1, 2016, Pages 111~120
DOI : 10.7465/jkdi.2016.27.1.111
In usual, the discontinuous variance function was estimated nonparametrically using a kernel type estimator with data sets split by an estimated location of the change point. Kang et al. (2000) proposed the Gasser-
type kernel estimator of the discontinuous regression function using the adjusted observations of response variable by the estimated jump size of the change point in
(1992). The adjusted observations might be a random sample coming from a continuous regression function. In this paper, we estimate the variance function using the Nadaraya-Watson kernel type estimator using the adjusted squared residuals by the estimated location of the change point in the discontinuous variance function like Kang et al. (2000) did. The rate of convergence of integrated squared error of the proposed variance estimator is derived and numerical work demonstrates the improved performance of the method over the exist one with simulated examples.
A redistribution model for spatially dependent Parrondo games
Lee, Jiyeon ;
Journal of the Korean Data and Information Science Society, volume 27, issue 1, 2016, Pages 121~130
DOI : 10.7465/jkdi.2016.27.1.121
An ansemble of N players arranged in a circle play a spatially dependent Parrondo game B. One player is randomly selected to play game B, which is based on the toss of a biased coin, with the amount of the bias depending on states of the selected player`s two nearest neighbors. The player wins one unit with heads and loses one unit with tails. In game A` the randomly chosen player transfers one unit of capital to another player who is randomly chosen among N - 1 players. Game A` is fair with respect to the ensemble`s total profit. The games are said to exhibit the Parrondo effect if game B is losing and the random mixture game C is winning and the reverse-Parrondo effect if game B is winning and the random mixture game C is losing. We compute the exact mean profits for games B and C by applying a state space reduction method with lumped Markov chains and we sketch the Parrondo and reverse-Parrondo regions for
Outlier detection and treatment in industrial sampling survey
Joo, Young Sun ; Cho, Gyo-Young ;
Journal of the Korean Data and Information Science Society, volume 27, issue 1, 2016, Pages 131~142
DOI : 10.7465/jkdi.2016.27.1.131
Outliers in surveys can have a large effect on estimates of totals. This is especially true in business surveys where the populations are drawn are typically skewed. In this paper, we discussed the practical development and implementation of methods to identify and deal with outliers. A detection method is based on quartile method and detected outlier is processed in various ways. The study examines two versions of winsorised estimators with three different cut-off thresholds for each one. For the simulation study, four types of weight transformation function have been considered.
Structural equation modeling on nurses` emotional labor including antecedents and consequences
Kim, Miyeon ; Choi, Heejung ;
Journal of the Korean Data and Information Science Society, volume 27, issue 1, 2016, Pages 143~154
DOI : 10.7465/jkdi.2016.27.1.143
This study was designed to construct and test structural equation modeling on nurses` emotional labor including antecedents and consequences based on the model of Morris and Feldman (1997). A total of 247 nurses working at secondary and tertiary hospitals in three provinces responded the questionnaires. Emotional labor was measured by frequency of emotional labor, attentiveness of required display rules and emotional dissonance. Routineness of job, job autotomy and organizational support were measured as antecedents; and emotional exhaustion, job satisfaction and organizational commitment were included as the consequences of nurses` emotional labor. Data were analyzed by SPSS 21.0 and AMOS 21.0. The model fitness indices for the hypothetical model showed NFI
Prevalence of allergic diseases and its related factors in Korean adolescents-Using data from the 2013 Korea youth risk behavior web-based survey
Han, Ji-Young ; Park, Hyeon-Sook ;
Journal of the Korean Data and Information Science Society, volume 27, issue 1, 2016, Pages 155~168
DOI : 10.7465/jkdi.2016.27.1.155
The purpose of this study was to investigate the prevalence of allergic diseases and its related factorsin Korean adolescents. The subjects were 72,435 middle and high school students. The data were analyzed by
test and multiple logistic regression analysis. The prevalence of asthma, allergic rhinitis, and atopic dermatitis was 9.3%, 30.6%, and 23.4%, respectively. According to multiple logistic regression analysis, asthma was associated with gender, age, economic state, obesity, stress, sleep satisfaction, secondhand smoking, drug use, other allergic diseases and physical activity. Allergic rhinitis was associated with age, economic state, residential area, type of residence, obesity, stress, sleep satisfaction, smoking experience, secondhand smoking, alcohol drinking, other allergic diseases and physical activity. Finally, atopic dermatitis was associated with gender, age, economic state, residential area, obesity, stress, other allergic diseases and physical activity. Conclusion: These results will provide database for prevention and management of allergic diseases.
A meta-analysis on flipped learning: Conditions for successful application and future research direction
Park, Esther O. ; Park, Jihyun ;
Journal of the Korean Data and Information Science Society, volume 27, issue 1, 2016, Pages 169~178
DOI : 10.7465/jkdi.2016.27.1.169
The purpose of this study is to rigorously review extant research on flipped learning that has been given a rising interests as a measure of securing the quality of higher education apart from the improvement of the curriculum. Colleges and universities are criticized for failing in equipping their customers with critical skills, which triggered them to review both curriculum and teaching learning method. In the middle of the teaching and learning method lies the flipped learning. For its relatively short history of research tackling the issues of flipped learning, now is the best time to review the extant studies on the topic to overview and suggest future directions in the field. With the goal in mind, this study restricted the journal articles reviewed to 20 and presented the results based on a conceptual framework derived from a rigorous literature review. Followings are policy suggestions and conditions for successful implementation of flipped learning.
Life satisfaction and self-esteem of children from low-income class : Testing mediation model of depression
Hong, Yeonran ; Jang, Gunja ; Choi, Cheungsook ;
Journal of the Korean Data and Information Science Society, volume 27, issue 1, 2016, Pages 179~189
DOI : 10.7465/jkdi.2016.27.1.179
The purpose of this study was to investigate the mediating effect of depression on the relationship between self-esteem and life satisfaction of children from low-income class. The subjects were 385 children from low-income class in two municipalities. As research methods, frequency, descriptive, correlation statistical analysis using SPSS 20.0 program was conducted. The hypothesized model was tested using structural equation model to identify that model fits best to the collected data. The analysis indicates that depression had direct and negative effects on the life satisfaction. Depression mediates partially the relationship between self-esteem and life satisfaction. This study provides theoretical and practical implications for increasing self-esteem had positive effects on decreasing depression and pressing for improvement of life satisfaction level of children from low-income class.
Effects of professional self-concept, self efficacy on the job satisfaction in general hospital nurses
Park, Eun Jin ; Han, Ji Young ; Jo, Na Young ;
Journal of the Korean Data and Information Science Society, volume 27, issue 1, 2016, Pages 191~201
DOI : 10.7465/jkdi.2016.27.1.191
The purpose of this study was to explore the correlation among professional self-concept, self-efficacy and job satisfaction and to identify the effect of professional self-concept and self-efficacy on job satisfaction in general hospital nurses. Questionnaires were completed by 156 nurses who were working for one general hospital in Busan. The mean score of professional self-concept and self-efficacy were
respectively. The mean score of job satisfaction was
. With respect to job satisfaction, the correlations were .325 (p<.001) for professional self-concept and .212 (p
The effect of oral breast milk on pain response of the neonates during heel lancing
Kim, Kyunghwa ; Park, Youngim ; Kim, Taeim ;
Journal of the Korean Data and Information Science Society, volume 27, issue 1, 2016, Pages 203~215
DOI : 10.7465/jkdi.2016.27.1.203
This study was performed in order to investigate the effect of breast milk on pain relief in newborn during heel lancing. Nonequivalent control group pretest-posttest design by double blinded experiment was used. 50 neonates were randomized to receive breast milk (experimental group, n
Estimation for misclassified data with ultra-high levels
Kang, Moonsu ;
Journal of the Korean Data and Information Science Society, volume 27, issue 1, 2016, Pages 217~223
DOI : 10.7465/jkdi.2016.27.1.217
Outcome misclassification is widespread in classification problems, but methods to account for it are rarely used. In this paper, the problem of inference with misclassified multinomial logit data with a large number of multinomial parameters is addressed. We have had a significant swell of interest in the development of novel methods to infer misclassified data. One simulation study is shown regarding how seriously misclassification issue occurs if the number of categories increase. Then, using the group lasso regression, we will show how the best model should be fitted for that kind of multinomial regression problems comprehensively.
Beyond gene expression level: How are Bayesian methods doing a great job in quantification of isoform diversity and allelic imbalance?
Oh, Sunghee ; Kim, Chul Soo ;
Journal of the Korean Data and Information Science Society, volume 27, issue 1, 2016, Pages 225~243
DOI : 10.7465/jkdi.2016.27.1.225
Thanks to recent advance of next generation sequencing techniques, RNA-seq enabled to have an unprecedented opportunity to identify transcript variants with isoform diversity and allelic imbalance (Anders et al., 2012) by different transcriptional rates. To date, it is well known that those features might be associated with the aberrant patterns of disease complexity such as tissue (Anders and Huber, 2010; Anders et al., 2012; Nariai et al., 2014) specific differential expression at isoform levels or tissue specific allelic imbalance in mal-functionality of disease processes, etc. Nevertheless, the knowledge of post-transcriptional modification and AI in transcriptomic and genomic areas has been little known in the traditional platforms due to the limitation of technology and insufficient resolution. We here stress the potential of isoform variability and allelic specific expression that are relevant to the abnormality of disease mechanisms in transcriptional genetic regulatory networks. In addition, we systematically review how robust Bayesian approaches in RNA-seq have been developed and utilized in this regard in the field.
A Bayesian model for two-way contingency tables with nonignorable nonresponse from small areas
Woo, Namkyo ; Kim, Dal Ho ;
Journal of the Korean Data and Information Science Society, volume 27, issue 1, 2016, Pages 245~254
DOI : 10.7465/jkdi.2016.27.1.245
Many surveys provide categorical data and there may be one or more missing categories. We describe a nonignorable nonresponse model for the analysis of two-way contingency tables from small areas. There are both item and unit nonresponse. One approach to analyze these data is to construct several tables corresponding to missing categories. We describe a hierarchical Bayesian model to analyze two-way categorical data from different areas. This allows a "borrowing of strength" of the data from larger areas to improve the reliability in the estimates of the model parameters corresponding to the small areas. Also we use a nonignorable nonresponse model with Bayesian uncertainty analysis by placing priors in nonidentifiable parameters instead of a sensitivity analysis for nonidentifiable parameters. We use the griddy Gibbs sampler to fit our models and compute DIC and BPP for model diagnostics. We illustrate our method using data from NHANES III data on thirteen states to obtain the finite population proportions.
Tree size determination for classification ensemble
Choi, Sung Hoon ; Kim, Hyunjoong ;
Journal of the Korean Data and Information Science Society, volume 27, issue 1, 2016, Pages 255~264
DOI : 10.7465/jkdi.2016.27.1.255
Classification is a predictive modeling for a categorical target variable. Various classification ensemble methods, which predict with better accuracy by combining multiple classifiers, became a powerful machine learning and data mining paradigm. Well-known methodologies of classification ensemble are boosting, bagging and random forest. In this article, we assume that decision trees are used as classifiers in the ensemble. Further, we hypothesized that tree size affects classification accuracy. To study how the tree size in uences accuracy, we performed experiments using twenty-eight data sets. Then we compare the performances of ensemble algorithms; bagging, double-bagging, boosting and random forest, with different tree sizes in the experiment.
Moments of the ruin time and the total amount of claims until ruin in a diffusion risk process
Kim, Jihoon ; Ahn, Soohan ;
Journal of the Korean Data and Information Science Society, volume 27, issue 1, 2016, Pages 265~274
DOI : 10.7465/jkdi.2016.27.1.265
In this paper, we consider a diffusion risk process, in which, its surplus process behaves like a Brownian motion in-between adjacent epochs of claims. We assume that the claims occur following a Poisson process and their sizes are independent and exponentially distributed with the same intensity. Our main goal is to derive the exact formula of the joint moment generating function of the ruin time and the total amount of aggregated claim sizes until ruin in the diffusion risk process. We also provide a method for computing the related first and second moments using the joint moment generating function and the augmented matrix exponential function.