Go to the main menu
Skip to content
Go to bottom
REFERENCE LINKING PLATFORM OF KOREA S&T JOURNALS
> Journal Vol & Issue
Journal of the Korean Data and Information Science Society
Journal Basic Information
Journal DOI :
Korean Data and Information Science Society
Editor in Chief :
Volume & Issues
Volume 22, Issue 6 - Dec 2011
Volume 22, Issue 5 - Oct 2011
Volume 22, Issue 4 - Jul 2011
Volume 22, Issue 3 - May 2011
Volume 22, Issue 2 - Mar 2011
Volume 22, Issue 1 - Jan 2011
Selecting the target year
Panel attrition factors in Korean Labor and Income Panel Study
Lee, Sang-Hyeop ; Park, Chan-Yong ; Hye-Mi, Sung-Suk Chung ; Choi, Hye-Mi ;
Journal of the Korean Data and Information Science Society, volume 22, issue 1, 2011, Pages 1~8
In panel studies in which the same respondents are interviewed repeatedly over the long term, panel attrition may cause the problems in the reliability of the result and the representativeness of the sample in panel study. In this article, we explore the risk factors of sample attrition in the first 11 waves of the Korean Labor and Income Panel Study (KLIPS) data covering the years 1998-2008, for which the survival analysis techniques such as life-table method and Cox proportional hazard model based on the time to the attrition of each respondent as the survival time of the respondent are applied.
Test for substitution in telecommunication market system
Kim, Tae-Ho ;
Journal of the Korean Data and Information Science Society, volume 22, issue 1, 2011, Pages 9~18
After the launch of PCS in 1997, the mobile telephone industry has achieved a remarkable growth in market share and sales, and substituted for long-distance wire communication market. This study investigates the interrelationships between the two markets in transition and attempts to perform the statistical tests if the diffusion of mobile telephone was the direct factor for the reduction in sales of long-distance communication industry. The evidence of the direct substitution effect between the two markets is not likely to be found. Unlike the general expectations, the test result seems to prove the sustained initial complementary relation for awhile.
A product recommendation system based on adjacency data
Kim, Jin-Hwa ; Byeon, Hyeon-Su ;
Journal of the Korean Data and Information Science Society, volume 22, issue 1, 2011, Pages 19~27
Recommendation systems are developed to overcome the problems of selection and to promote intention to use. In this study, we propose a recommendation system using adjacency data according to user's behavior over time. For this, the product adjacencies are identified from the adjacency matrix based on graph theory. This research finds that there is a trend in the users' behavior over time though product adjacency fluctuates over time. The system is tested on its usability. The tests show that implementing this recommendation system increases users' intention to purchase and reduces the search time.
CUSUM control chart for Katz family of distributions
Cho, Gyo-Young ;
Journal of the Korean Data and Information Science Society, volume 22, issue 1, 2011, Pages 29~35
In statistical process control, the primary method used to monitor the number of nonconformities is the c-chart. The conventional c-chart is based on the assumption that the occurrence of nonconformities in samples is well modeled by a Poisson distribution. When the Poisson assumption is not met, the X-chart is often used as an alternative charting scheme in practice. And CUSUM-chart is used when it is desirable to detect out of control situations very quickly because of sensitive to a small or gradual drift in the process. In this paper, I compare CUSUM-chart to X-chart for the Katz family covering equi-, under-, and over-dispersed distributions relative to the Poisson distribution.
Using rough set to support arbitrage box spread strategies in KOSPI 200 option markets
Kim, Min-Sik ; Oh, Kyong-Joo ;
Journal of the Korean Data and Information Science Society, volume 22, issue 1, 2011, Pages 37~47
Stock price index option market has various investment strategies that have been developed. Specially, arbitrage strategies are very important to be efficient in option market. The purpose of this study is to improve profit using rough set and Box spread by using past option trading data. Option trading data was based on an actual stock exchange market tick data ranging from 2001 to 2006. Validation process was carried out by transferring the tick data into one-minute intervals. Box spread arbitrage strategies is low risk but low profit. It can be accomplished by back-testing of the existing strategy of the past data and by using rough set, which limit the time line of dealing. This study can make more stable profits with lower risk if control the strategy that can produces a higher profit module compared to that of the same level of risk.
The deduction of objective linguistic information using statistical methods - The grouping of the possibility of interdisciplinary research
Choi, Kyoung-Ho ; Lee, Yong-Wook ;
Journal of the Korean Data and Information Science Society, volume 22, issue 1, 2011, Pages 49~55
There are tries to unite through consilience in many fields. Interdisciplinary research is an instance of those. Linguistic studies called linguistic informatics or quantitative linguistics is a field of interdisciplinary research related with statistics linguists have studied chiefly statistics and linguistics. In the statistical aspect, there is need to supplement somewhat of the result of researches by linguists. This study shows statistical method can supplement insufficient objectivity in linguistic studies, and examines the way to raise a degree of completion of interdisciplinary research on statistics and linguistics. This study also shows an introduction and application of the statistical method can be useful for the deduction of objective linguistic information in linguistic studies.
Association analysis between sports talent test scores and KOSTASS scores
Park, Cheol-Yong ; Lee, Mi-Sook ;
Journal of the Korean Data and Information Science Society, volume 22, issue 1, 2011, Pages 57~64
The Korea Sport Talent Search System (KOSTASS) can easily assess potential of Korean youth in 23 sport events and to identify sports talents who have high potential in track-and-field, swimming and gymnastics. The purpose of present study was to analysis relationship between sports talent test scores and KOSTASS scores. Physique, physical fitness, motor ability and physiological tests including skeletal age were administrated. Data of 201 elementary students who aged 7 to 12 living in Seoul area were collected. For pure association analysis, Pearson correlation coefficient and partial correlation coefficient are used and stepwise regression analysis is used for selecting important sports talent test scores. The results show that skeletal age, weight, chest circumference, 50-meter run, and half-squat jump have relatively low impact on six KOSTASS scores. The KOSTASS program would contribute to identify talented youth in sports among Korean youth population and success for international sports competition.
The study for effectiveness of golf skills to adjust average score using path analysis in 2010 PGA
Min, Dae-Kee ;
Journal of the Korean Data and Information Science Society, volume 22, issue 1, 2011, Pages 65~71
Path analysis is a useful method to find out direct or indirect effects between variables. Compared to regression analysis for studying the casual relationship, this method has a good advantage. In this study, I want to figure out direct or indirect relationships between golf skills and adjust average scores using path analysis. To analyze data, I applied AMOS in SPSS and collected data in 2010 PGA.
An empirical study on the influence of product portfolio and interest rate on the lapse rate in the life insurance industry
Jung, Se-Chang ; Ouh, Seung-Cheol ; Kang, Jung-Chul ;
Journal of the Korean Data and Information Science Society, volume 22, issue 1, 2011, Pages 73~80
The purpose of this study is to analyse the influence of product portfolio and interest rate on the lapse ratio. This issue is very important because of the recent introduction of IFRS and CFP. The fixed-effect model and the random-effect model are estimated with using panel data and the Hausman test is employed in order to select a model. The results of this study is summarized as follows. Firstly, the random effect model is selected. According to the model, the lapse rate increases as the portfolio of savings plan, sickness, and death increases and the interest rate is high. Secondly, health insurance and variable insurance product show a negative relationship with the lapse rate.
A study on insignificant rules discovery in association rule mining
Cho, Kwang-Hyun ; Park, Hee-Chang ;
Journal of the Korean Data and Information Science Society, volume 22, issue 1, 2011, Pages 81~88
Association rule mining searches for interesting relationships among items in a given database. There are three primary quality measures for association rule, support and confidence and lift. In order to improve the efficiency of existing mining algorithms, constraints were applied during the mining process to generate only those association rules that are interesting to users instead of all the association rules. When we create relation rule, we can often find a lot of rules. This can find rule that direct relativity by intervening variable does not exist. In this study we try to discovery an insignificant rule in association rules by intervening variable. Result of this study can understand relativity about rule that is created in relation rule more exactly.
Estimation of conditional mean residual life function with random censored data
Lee, Won-Kee ; Song, Myung-Unn ; Jeong, Seong-Hwa ;
Journal of the Korean Data and Information Science Society, volume 22, issue 1, 2011, Pages 89~97
The aims of this study were to propose a method of estimation for mean residual life function (MRLF) from conditional survival function using the Buckley and James's (1979) pseudo random variables, and then to assess the performance of the proposed method through the simulation studies. The mean squared error (MSE) of proposed method were less than those of the Cox's proportional hazard model (PHM) and Beran's nonparametric method for non-PHM case. Futhermore in the case of PHM, the MSE's of proposed method were similar to those of Cox's PHM. Finally, to evaluate the appropriateness of practical use, we applied the proposed method to the gastric cancer data. The data set consist of the 1, 192 patients with gastric cancer underwent surgery at the Department of Surgery, K-University Hospital.
Semi-supervised learning using similarity and dissimilarity
Seok, Kyung-Ha ;
Journal of the Korean Data and Information Science Society, volume 22, issue 1, 2011, Pages 99~105
We propose a semi-supervised learning algorithm based on a form of regularization that incorporates similarity and dissimilarity penalty terms. Our approach uses a graph-based encoding of similarity and dissimilarity. We also present a model-selection method which employs cross-validation techniques to choose hyperparameters which affect the performance of the proposed method. Simulations using two types of dat sets demonstrate that the proposed method is promising.
A study on log-density ratio in logistic regression model for binary data
Kahng, Myung-Wook ;
Journal of the Korean Data and Information Science Society, volume 22, issue 1, 2011, Pages 107~113
We present methods for studying the log-density ratio, which allow us to select which predictors are needed, and how they should be included in the logistic regression model. Under multivariate normal distributional assumptions, we investigate the form of the log-density ratio as a function of many predictors. The linear, quadratic and crossproduct terms are required in general. If two covariance matrices are equal, then the crossproduct and quadratic terms are not needed. If the variables are uncorrelated, we do not need the crossproduct terms, but we still need the linear and quadratic terms.
Noninformative priors for stress-strength reliability in the Pareto distributions
Kang, Sang-Gil ; Kim, Dal-Ho ; Lee, Woo-Dong ;
Journal of the Korean Data and Information Science Society, volume 22, issue 1, 2011, Pages 115~123
In this paper, we develop the noninformative priors for stress-strength reliability from the Pareto distributions. We develop the matching priors and the reference priors. It turns out that the second order matching prior does not match the alternative coverage probabilities, and is not a highest posterior density matching or a cumelative distribution function matching priors. Also we reveal that the one-at-a-time reference prior and Jeffreys' prior are the second order matching prior. We show that the proposed reference prior matches the target coverage probabilities in a frequentist sense through simulation study, and an example is given.
An analysis of the potential impact of various ozone regulatory standards on mortality
Kim, Yong-Ku ;
Journal of the Korean Data and Information Science Society, volume 22, issue 1, 2011, Pages 125~136
Ground-level ozone, an air pollutant that is monitored by the Environmental Protection Agency (EPA), damages human health by irritating the respiratory system, reducing lung function, damaging lung cells, and aggravating asthma and other chronic conditions. In March 2008, the EPA strengthened ozone standards by lowering acceptable limits from 84 parts per billion to 75 parts per billion. Here epidemiologic data is used to study the effects of ozone regulation on human health and assessed how various regulatory standards for ozone may affect nonaccidental mortality, including respiratory-related deaths during ozone season. The assessment uses statistical methods based on hierarchical Bayesian models to predict the potential effects of the different regulatory standards. It also analyzes the variability of the results and ho they are impacted by different modeling assumptions. We focused on the technical an statistical approach to assessing relationship between new ozone regulations and mortality while other researches have detailed the relationship between ozone and human mortality. We shows a statistical correlation between ozone regulations and mortality, with lower limits of acceptable ozone linked to a decrease in deaths, and projects that mortality is expected to decrease by reducing ozone regulatory standards.