Go to the main menu
Skip to content
Go to bottom
REFERENCE LINKING PLATFORM OF KOREA S&T JOURNALS
> Journal Vol & Issue
Journal of the Korean Data and Information Science Society
Journal Basic Information
Journal DOI :
Korean Data and Information Science Society
Editor in Chief :
Volume & Issues
Volume 26, Issue 6 - Nov 2015
Volume 26, Issue 5 - Sep 2015
Volume 26, Issue 4 - Jul 2015
Volume 26, Issue 3 - May 2015
Volume 26, Issue 2 - Mar 2015
Volume 26, Issue 1 - Jan 2015
Selecting the target year
Stock investment with a redistribution model of the history-dependent Parrondo game
Jin, Geonjoo ; Lee, Jiyeon ;
Journal of the Korean Data and Information Science Society, volume 26, issue 4, 2015, Pages 781~790
DOI : 10.7465/jkdi.2015.26.4.781
The Parrondo paradox is the counter-intuitive phenomenon: when we combine two losing games we can win the game or when we combine two winning games we can lose the game. In this paper, we assume that an investor adopts the rule of the history-dependent Parrondo game for investment in the stock market. Using the KRX (Korea Exchange) data from 2012 to 2014, we found the Parrondo paradox in the stock trading: the redistribution of profits among accounts can turn the decrease of the expected cumulative profit into the increase of the expected cumulative profit. We also found that the opposite case, namely the reverse Parrondo effect, can happen in the stock trading.
A study on the slope sign test for explosive autoregressive models
Ha, Jeongcheol ; Jung, Jong Mun ;
Journal of the Korean Data and Information Science Society, volume 26, issue 4, 2015, Pages 791~799
DOI : 10.7465/jkdi.2015.26.4.791
In random walk hypothesis, we assume that current change of financial time series is independent of past values. It is interpreted as an existency of a unit root in ARMA models and many researches have been focused on whether
< 1 or not. If some financial data are generated from an explosive autoregressive model, the chance of a bubble economy increases. We have to find the symptoms of it in advance. Since some well-known parameter estimators contain the parameter itself and other statistic is constructed under a specific parameter structure assumption, those are difficut to be adopted. In this paper we investigate a test for explosive autoregressive models using slope signs. We found the properties of the slope sign test statistic under both independent error and correlated error conditions, mainly by simulations.
An improvement plan of mathematics academic achievement in the H/W curriculum
Lee, Seung-Woo ;
Journal of the Korean Data and Information Science Society, volume 26, issue 4, 2015, Pages 801~812
DOI : 10.7465/jkdi.2015.26.4.801
The purpose of this study is to increase the interest in mathematics-related curriculum and improve mathematics academic achievement of H/W majors. Firstly, this paper proposes the educational case study that develops the learner's ability, increases the interest and raises the employment rate in the unfavorable mathematics-related fields for the H/W majors. Secondly, this paper presents the future teaching method on mathematics-related curriculum that applies to the actually teaching-learning based on the computer programming in the field of the H/W. Lastly, this paper suggests the promising pedagogical method for mathematics by using the statistical analyses of academic achievements and surveys in order to present the effective conducting method of mathematics education.
An estimation method for stochastic reaction model
Choi, Boseung ;
Journal of the Korean Data and Information Science Society, volume 26, issue 4, 2015, Pages 813~826
DOI : 10.7465/jkdi.2015.26.4.813
This research deals with an estimation method for kinetic reaction model. The kinetic reaction model is a model to explain spread or changing process based on interaction between species on the Biochemical area. This model can be applied to a model for disease spreading as well as a model for system Biology. In the search, we assumed that the spread of species is stochastic and we construct the reaction model based on stochastic movement. We utilized Gillespie algorithm in order to construct likelihood function. We introduced a Bayesian estimation method using Markov chain Monte Carlo methods that produces more stable results. We applied the Bayesian estimation method to the Lotka-Volterra model and gene transcription model and had more stable estimation results.
Comparison of journal clustering methods based on citation structure
Kim, Jinkwang ; Kim, Sohyung ; Oh, Changhyuck ;
Journal of the Korean Data and Information Science Society, volume 26, issue 4, 2015, Pages 827~839
DOI : 10.7465/jkdi.2015.26.4.827
Extraction of communities from a journal citation database by the citation structure is a useful tool to see closely related groups of the journals. SCI of Thomson Reuters or SCOPUS of Elsevier have had tried to grasp community structure of the journals in their indices according to citation relationships, but such a trial has not been made yet with the Korean Citation Index, KCI. Therefore, in this study, we extracted communities of the journals of the natural science area in KCI, using various clustering algorithms for a social network based on citations among the journals and compared the groups obtained with the classfication of KCI. The infomap algorithm, one of the clustering methods applied in this article, showed the best grouping result in the sense that groups obtained by it are closer to the KCI classification than by other algorithms considered and reflect well the citation structure of the journals. The classification results obtained in this study might be taken consideration when reclassification of the KCI journals will be made in the future.
A study on cluster and positioning of domestic electronic commerce based on purchasing motivation
Jeong, Dong Bin ;
Journal of the Korean Data and Information Science Society, volume 26, issue 4, 2015, Pages 841~856
DOI : 10.7465/jkdi.2015.26.4.841
Thirteen types of business and sixteen administrative districts in Korea are categorized and segmented based on their similarities and visually plotted on multidimensional space. The similarities are determined by five characteristics of quantitative evaluation (simplified process of trading, reduced price, direct contact with supplier, faster process of trading, et cetera). Hence, domestic types of business and administrative districts can be categorized into certain clusters. Also, forms and characteristics of types of business and administrative districts can be evaluated between and within the clusters.
Bounds of PIM-based similarity measures with partially marginal proportion
Park, Hee Chang ;
Journal of the Korean Data and Information Science Society, volume 26, issue 4, 2015, Pages 857~864
DOI : 10.7465/jkdi.2015.26.4.857
By Wikipedia, data mining is the computational process of discovering patterns in huge data sets involving methods at the intersection of association rule, decision tree, clustering, artificial intelligence, machine learning. Clustering or cluster analysis is the task of grouping a set of objects in such a way that objects in the same group are more similar to each other than to those in other groups. The similarity measures being used in the clustering may be classified into various types depending on the characteristics of data. In this paper, we computed bounds for similarity measures based on the probabilistic interestingness measure with partially marginal probability such as Peirce I, Peirce II, Cole I, Cole II, Loevinger, Park I, and Park II measure. We confirmed the absolute value of Loevinger measure wasthe upper limit of the absolute value of any other existing measures. Ordering of other measures is determined by the size of concurrence proportion, non-simultaneous occurrence proportion, and mismatch proportion.
Comprehensive evaluation of baseball player's offensive ability by use of simulation
Kim, Nam Ki ; Kim, Sun Ho ;
Journal of the Korean Data and Information Science Society, volume 26, issue 4, 2015, Pages 865~874
DOI : 10.7465/jkdi.2015.26.4.865
This research is to comprehensively evaluate offensive abilities of baseball players who are expected to produce as many runs as possible by their hitting and running. To this end, we establish a simulation program to obtain the so-called scoring index of an individual player. The scoring index of a player is defined as an expected number of runs scored by an imaginary team that is composed of nine copies of the player. As a simulation input, we use 2014 season data of Korean pro-baseball. As a result, we present the scoring indices of top 10 players, 9 Korean pro-baseball teams, and overall 2014 season. The scoring index can serve as a comprehensive evaluation of offensive ability of a player or a team, selection of players for a (national) team or for a starting line-up, estimation of player's worth, and so on.
A study on differences by gender in influence of recognition, usage, convenience and preference on purchase intention for traditional Korean rice cake
Lee, Mi Ock ;
Journal of the Korean Data and Information Science Society, volume 26, issue 4, 2015, Pages 875~883
DOI : 10.7465/jkdi.2015.26.4.875
The purpose of this study was to examine the differences by gender in influence recognition, usage, convenience and preference on purchase intention for traditional Korean rice cake. For this study, the effect of the gender-based consumption patterns-degree of recognition, usage, convenience, and preference towards the traditional Korean rice cake-on purchasing intention was investigated among the university students in Busan area. Of the entire students, 47.5% were male and 52.5% female; 51.0% had less than \5,000,000 and 49.0% had \5,000,000 or more in household income level per month; and 46.0% live with parent(s) whereas 54.0% by themselves. The findings showed that consumption patterns lie between 4.0 and 6.0 in Likert-type 7-point scale. It showed differences by gender in all of the degree of recognition, usage, convenience, and preference.
Functional clustering for electricity demand data: A case study
Yoon, Sanghoo ; Choi, Youngjean ;
Journal of the Korean Data and Information Science Society, volume 26, issue 4, 2015, Pages 885~894
DOI : 10.7465/jkdi.2015.26.4.885
It is necessary to forecast the electricity demand for reliable and effective operation of the power system. In this study, we try to categorize a functional data, the mean curve in accordance with the time of daily power demand pattern. The data were collected between January 1, 2009 and December 31, 2011. And it were converted to time series data consisting of seasonal components and error component through log transformation and removing trend. Functional clustering by Ma et al. (2006) are applied and parameters are estimated using EM algorithm and generalized cross validation. The number of clusters is determined by classifying holidays or weekdays. Monday, weekday (Tuesday to Friday), Saturday, Sunday or holiday and season are described the mean curve of daily power demand pattern.
Statistical analysis on a specific student group and their responses to sub-health questionnaire
Shin, Jae-Kyoung ;
Journal of the Korean Data and Information Science Society, volume 26, issue 4, 2015, Pages 895~906
DOI : 10.7465/jkdi.2015.26.4.895
Nowadays, many people have much interest in health and quality of life. They can get information on foods known to have remedial value for certain illness from many healthcare experts on TV health related shows. Moreover, a lot of studies on these foods and their remedial value are consistently published. Thus, this study, on the ground that if we know prognosis of a certain disease in advance we can lead a better life without suffering from diseases, statistically analyzes specific students' responses to sub-health questionnaire. The analysis reveals that results of testing the equality of gender and 10 diagnosis indexes shows significant difference within 5% of significance level in 9 diagnosis indexes except for respiratory system. And the result of testing the equality of age and 10 diagnosis indexes indicates that there is no significant difference within 5% of significance level. In the study, students are divided into two age category. However, the age of 20s could be regarded as one category, which could be the reason why there is no significant difference.
Projections of the high-school graduate in Daegu·Gyoungbook
Kim, Jongtae ;
Journal of the Korean Data and Information Science Society, volume 26, issue 4, 2015, Pages 907~914
DOI : 10.7465/jkdi.2015.26.4.907
Reduction in the number of students due to the low birth rate has notice very many changes in the national education policies. The purpose of this study is to propose a method for estimation of the number of students (the population) by age or grade promotion rate of progression rate to estimate the exact number of students (the population) by 2032. It was suggested the nth moving average proportional method and the weighted proportional moving average method as the method of population projections. It presents the means and standard deviations of the measurement errors of the suggested methods by Monte Carlo simulation. Measured in this study are predicted result was a phenomenon is estimated lower than the actual value.
(Co)heritability of acetone and β-hydroxybutyrate concentrations in raw milk related to ketosis in Holsteins
Cho, Kwang-Hyun ; Cho, Chung-Il ; Lee, Joon-Ho ; Park, Kyung-Do ;
Journal of the Korean Data and Information Science Society, volume 26, issue 4, 2015, Pages 915~921
DOI : 10.7465/jkdi.2015.26.4.915
This experiment was conducted to estimate the heritability and coheritablity of daily milk yield, acetone and
-hydroxybutyrate (BHBA) concentrations in raw milk. The average concentrations of acetone and BHBA were
, respectively, and the differences between high group and low group cows were highly significant (p <0.01). The estimates of heritability of daily milk yield, acetone and BHBA concentrations were in the range of 0.18~0.21, 0.11~0.13 and 0.01~0.02, respectively. The estimate of heritability of
acetone did not change much, while that of
increased to 0.03~0.04. The estimates of phenotypic and genetic correlation coefficients between acetone and BHBA were 0.44 and 0.48, respectively. In low milk yield group, the coheritability estimates of BHBA and
when selection was for daily milk yield were 0.26 and 0.32, respectively. These were higher than the coheritability estimate of acetone when selection was for daily milk yield. The same trend was noted in the coherihability estimates from the whole records using both high and low milk yield groups together. BHBA concentration seemed to be more effectively responding than acetone concentration when selection was for daily milk yield.
A study on a multi-stage random tournament competition system and its fairness
Lee, Kee-Won ; Lee, Jung Soon ; Sim, Songyong ;
Journal of the Korean Data and Information Science Society, volume 26, issue 4, 2015, Pages 923~930
DOI : 10.7465/jkdi.2015.26.4.923
There exist many competition systems to determine a winner. Many sports games use the 1-in-2 tournament or its modified version to determine a winner. In this paper, we propose a competition system that can be used when there are many candidates and many random referees to evaluate the candidates. These competitions can be found in the cyber space where many users score many competing apps. We study fairness of our proposed competing system called a multi-stage random tournament in terms of equal probabilities. We also formulate the influence factor of a specific referee under some specific conditions.
Analysis of mortality after death of spouse in relation to duration of bereavement and dependence relation between married couple -using married couples data from survivor's pension of National Pension Service-
Baek, HyeYoun ; Han, Jeonglim ; Lee, Hangsuck ;
Journal of the Korean Data and Information Science Society, volume 26, issue 4, 2015, Pages 931~946
DOI : 10.7465/jkdi.2015.26.4.931
Many multiple life insurance products consider benefits that are contingent on the combined survival status of two lives. To value premiums of the insurance products accurately, we need to consider the impact of the survivorship of one life on another. To show a dependence relation between married couple, we calculate correlation coefficients by using married couples data from National Pension Service and the results show some positive dependence between them. Moreover, by analyzing the death after bereavement, we find a evidence that mortality rates increase after the death of a spouse and, in addition, that this phenomenon, the broken-heart syndrome, diminishes over time. The results of this study can support the method to calculate the premium of multiple life insurance reflecting more realistic joint mortality rates.
Reliability of Delphi survey for traditional knowledge on agricultural resources
Lee, Ki Hoon ; Song, Mi-Jang ; Kim, Hyun ;
Journal of the Korean Data and Information Science Society, volume 26, issue 4, 2015, Pages 947~956
DOI : 10.7465/jkdi.2015.26.4.947
In the knowledge and information age, to discover and protect Intellectual Properties would be very important for their economic value as a major growth engine. This study evaluated the reliability of a Delphi survey conducted by experts to assess the value of agricultural resources knowledge obtained from literature reviews and field interviews. Delphi method is collecting the opinions of experts for several rounds repeatedly, in the next round the experts have chance to modify their opinion. Scores between two rounds are highly correlated and standard deviations are declined for second round to imply that some correction of their evaluations are made. To check reliability of Delphi survey of two rounds Cronbach's reliability coefficient and Generalizability coefficient are derived. The Cronbach alpha's supported the reliability of the method, but the Generalizability analysis revealed some unexpected results while checking the variance components of sources of measurement errors. Despite the increased reliability coefficients, the deviations between the raters are increased which means that additional rounds are required to get consensus, the goal of Delphi research.
Analyzing records of Korean pro-basketball using general linear model
Kim, Sae Hyung ;
Journal of the Korean Data and Information Science Society, volume 26, issue 4, 2015, Pages 957~970
DOI : 10.7465/jkdi.2015.26.4.957
The purpose of this study was to analyze records of Korean pro-basketball using general linear model (two-way ANOVA and hierarchical multiple regression analysis). Korea Basketball League (KBL) informed the records (2014-2015 season) of this study. The eight variables (TA, 2PA, 3PA, 2P, 3P, Ast, TFB, CH) were selected in content validity. SPSS program was used to analyze general linear model. All alpha level was set at 0.05. Major results were as follow. 3PA had significant interaction effect between victory & defeat variable and home & away variable. Victory teams showed that 3PA was higher in home games than away games, and defeat teams was the other. 2PA, AS, TFB, and CH were selected significant variables affecting victory and defeat. In result of hierarchical regression, Ast had significant moderation effect between 3PA and TS. TFB also had significant moderation effect between AS between 2P. The other construct (Ast between 2PA and TS; TFB between AS between 3P) had no significant moderation effect. In the effect of 2PA, 3PA and Ast to TS, CH also had no significant moderation effect.
Large tests of independence in incomplete two-way contingency tables using fractional imputation
Kang, Shin-Soo ; Larsen, Michael D. ;
Journal of the Korean Data and Information Science Society, volume 26, issue 4, 2015, Pages 971~984
DOI : 10.7465/jkdi.2015.26.4.971
Imputation procedures fill-in missing values, thereby enabling complete data analyses. Fully efficient fractional imputation (FEFI) and multiple imputation (MI) create multiple versions of the missing observations, thereby reflecting uncertainty about their true values. Methods have been described for hypothesis testing with multiple imputation. Fractional imputation assigns weights to the observed data to compensate for missing values. The focus of this article is the development of tests of independence using FEFI for partially classified two-way contingency tables. Wald and deviance tests of independence under FEFI are proposed. Simulations are used to compare type I error rates and Power. The partially observed marginal information is useful for estimating the joint distribution of cell probabilities, but it is not useful for testing association. FEFI compares favorably to other methods in simulations.
Add-on selling strategies in an online open market
Shim, Beomsoo ; Lee, Hanjun ;
Journal of the Korean Data and Information Science Society, volume 26, issue 4, 2015, Pages 985~995
DOI : 10.7465/jkdi.2015.26.4.985
Add-on selling can provide new chances to increase sellers' profits and meet customers' needs. Although prior studies have advocated add-on selling for its business value, there is an argument that add-on selling can cause customer repulsion. Therefore, we need to understand customer purchasing pattern related to add-on selling in order to promote it and to mitigate the customer repulsion. To that end, we applied data mining techniques to the 24,925 transactions of data from an online open market in Korea. We then conducted feature selection to investigate the most influential factors that can explain the characteristics of add-on selling transactions using a classification model. We also identified association rules among add-on selling and promotions. Finally, based on the findings in our experiments, we proposed add-on selling strategies for the target online market.
Comparison of two sampling intervals and three sampling intervals VSI charts for monitoring both means and variances
Chang, Duk-Joon ;
Journal of the Korean Data and Information Science Society, volume 26, issue 4, 2015, Pages 997~1006
DOI : 10.7465/jkdi.2015.26.4.997
In industrial quality control, when engineers use VSI control procedure they should consider both required time to signal and switching behaviors together in the case of production process changed. Up to the present, many researchers have studied fixed sampling interval (FSI) chart and variable sampling interval (VSI) chart in the points of average number of samples to signal (ANSS) and average time to signal (ATS). However, ANSS and ATS do not provide any switching information between different sampling intervals of VSI schemes. In this study, performances of two sampling intervals VSI chart and three sampling intervals VSI chart are evaluated and compared. The numerical results show that ANSS and ATS values of two sampling intervals VSI chart and three sampling interval VSI chart are similar regardless the amount of shifts. However, the values of switching behaviors including ANSW are less efficient in three sampling intervals VSI charts than in two sampling intervals VSI chart.
The prevalence of hypertension in Haiti : A retrospective study
Park, Esther O. ; Jang, Gunja ;
Journal of the Korean Data and Information Science Society, volume 26, issue 4, 2015, Pages 1007~1015
DOI : 10.7465/jkdi.2015.26.4.1007
The purpose of this study was to explore the prevalence of hypertension in Haiti and to describe regional characteristics, and thus to speculate Haitian specific regional risk factors. This is a descriptive and comparative study. Subjects were 145 male/female Haitians, who aged between 30 and 59 and have lived in rural areas, had their hypertension indicators being screened. Data were analyzed with descriptive statistics, t-test and ANOVA. More than four fifth of the subjects (82.8%) showed pre-hypertensive or moderate to severe hypertensive condition. 65.5% of the total had abnormal pulse pressure (
) and 13.8% of the sample showed tachycardia (Heart
). Female Haitian tended to have higher diastolic blood pressure and faster heart rate than male Haitian. The residents who lived in Part 5-1 area with less accessibility to drinking water tended to have higher systolic and diastolic pressure. Hydration status of Haitians was highly considered as an affecting factor on the distribution of hypertension as evidenced by different level of accessibility to drinking water.
An application of mutual information in mathematical statistics education
Yi, Seongbaek ; Jang, Dae-Heung ;
Journal of the Korean Data and Information Science Society, volume 26, issue 4, 2015, Pages 1017~1025
DOI : 10.7465/jkdi.2015.26.4.1017
In mathematical statistics education, we can use mutual information as a tool for evaluating the degree of dependency between two random variables. The ordinary correlation coefficient provides information only on linear dependency, not on nonlinear relationship between two random variables if any. In this paper as a measure of the degree of dependency between random variables, we suggest the use of symmetric uncertainty and
which are defined in terms of mutual information. They can be also considered as generalized correlation coefficients for both linear and non-linear dependence of random variables.
A statistical quality control for the dispersion matrix
Jo, Jinnam ;
Journal of the Korean Data and Information Science Society, volume 26, issue 4, 2015, Pages 1027~1034
DOI : 10.7465/jkdi.2015.26.4.1027
A control chart is very useful in monitoring various production process. There are many situations in which the simultaneous control of two or more related quality variables is necessary. When the joint distribution of the process variables is multivariate normal, multivariate Shewhart control charts using the function of the maximum likelihood estimator for monitoring the dispersion matrix are considered for the simultaneous monitoring of the dispersion matrix. The performances of the multivariate Shewhart control charts based on the proposed control statistic are evaluated in term of average run length (ARL). The performance is investigated in three cases, where the variances, covariances, and variances and covariances are changed respectively. The numerical results show that the performances of the proposed multivariate Shewhart control charts are not better than the control charts using the trace of the covariance matrix in the Jeong and Cho (2012) in terms of the ARLs.