• Title/Summary/Keyword: Stepwise Regression

Search Result 2,420, Processing Time 0.039 seconds

A Prediction Method Combining Clustering Method and Stepwise Regression (군집분석 기법과 단계별 회귀모델을 결합한 예측 방법)

  • Chong Il-gyo;Jun Chi-Hyuck
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 2002.05a
    • /
    • pp.949-952
    • /
    • 2002
  • A regression model is used in predicting the response variable given predictor variables However, in case of large number of predictor variables, a regression model has some problems such as multicollinearity, interpretation of the functional relationship between the response and predictors and prediction accuracy. A clustering method and stepwise regression could be used to reduce the amount of data by grouping predictors having similar properties and by selecting the subset of predictors. respectively. This paper proposes a prediction method combining clustering method and stepwise regression. The proposed method fits a global model and local models and predicts responses given new observations by using both models. The paper also compares the performance of proposed method with stepwise regression via a real data of ample obtained in a steel process.

  • PDF

Quantitative Analysis by Diffuse Reflectance Infrared Fourier Transform and Linear Stepwise Multiple Regression Analysis I -Simultaneous quantitation of ethenzamide, isopropylantipyrine, caffeine, and allylisopropylacetylurea in tablet by DRIFT and linear stepwise multiple regression analysis-

  • Park, Man-Ki;Yoon, Hye-Ran;Kim, Kyoung-Ho;Cho, Jung-Hwan
    • Archives of Pharmacal Research
    • /
    • v.11 no.2
    • /
    • pp.99-113
    • /
    • 1988
  • Quantitation of ethenzamide, isopropylantipyrine and caffeine takes about 41 hrs by conventional GC method. Quantitation of allylisoprorylacetylurea takes about 40 hrs by conventional UV method. But quantitation of them takes about 6 hrs by DRIFT developing method. Each standard and sample sieved, powdered and acquired DRIFT spectrum. Out of them peak of each component was selected and ratio of each peak to standard peak was acquired, and then linear stepwise multiple regression was performed with these data and concentration. Reflectance value, Kubelka-Munk equation and Inverse-Kubelka-Munk equation were modified by us. Inverse-Kubelka-Munk equation completed the deficit of Kubelka-Munk equation. Correlation coefficients acquired by conventioanl GC and UV against DRIFT were more than 0.95.

  • PDF

Alternative Derivation of Stepwise Multivariate Linear Regression (段階的 多變量 線型回歸에 관하여)

  • 申敏雄;金周成
    • Journal of the Korean Statistical Society
    • /
    • v.7 no.2
    • /
    • pp.105-108
    • /
    • 1978
  • Freund, Vail, and Ross, Goldberger and Jochems and Goldberger have given some results for the stepwise estimation of the parameters of a univariate regression model, D.G. Kabe gave similar results for a multivariate linear regression model. We give here alternative derivation of some results derived by D.G. Kabe.

  • PDF

Analysis of Client Propensity in Cyber Counseling Using Bayesian Variable Selection

  • Pi, Su-Young
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.6 no.4
    • /
    • pp.277-281
    • /
    • 2006
  • Cyber counseling, one of the most compatible type of consultation for the information society, enables people to reveal their mental agonies and private problems anonymously, since it does not require face-to-face interview between a counsellor and a client. However, there are few cyber counseling centers which provide high quality and trustworthy service, although the number of cyber counseling center has highly increased. Therefore, this paper is intended to enable an appropriate consultation for each client by analyzing client propensity using Bayesian variable selection. Bayesian variable selection is superior to stepwise regression analysis method in finding out a regression model. Stepwise regression analysis method, which has been generally used to analyze individual propensity in linear regression model, is not efficient since it is hard to select a proper model for its own defects. In this paper, based on the case database of current cyber counseling centers in the web, we will analyze clients' propensities using Bayesian variable selection to enable individually target counseling and to activate cyber counseling programs.

Effective Components on the Taste of Ordinary Korean Soy Sauce (한국재래식 간장의 맛에 영향을 미치는 성분)

  • 김종규;정영건;양성호
    • Microbiology and Biotechnology Letters
    • /
    • v.13 no.3
    • /
    • pp.285-287
    • /
    • 1985
  • To investigate effective constituents of the many taste components in ordinary Korean soy sauce, we analyzed free amino acids, organic acids, free sugars and saline as taste components in ordinary Korean soy sauce, and determined sensory score of the ordinary Korean soy sauce taste with 45 persons of the trained pannels. The relationships between original data transformed with variables and sensory score of the ordinary Korean soy sauce were analyzed by stepwise multiple regression analysis. Eighty five percents of the ordinary Korean soy sauce taste is affected by twenty one kinds (Isoleucine, Leucine, Valine, NaCl, Lactic acid, Alanine, Phenylalanine, Tartaric acid, Sugar(\ulcorner), Proline, Malic acid, Glycine, Tryptophan, Arginine, Glutaric acid, Maltose, Histidine, Glucose, Fructose and Serine) of the taste components by stepwise multiple regression analysis of original data. Eighty one percents of the ordinary Korean soy sance taste is affected by sixteen kinds (Lactic acid, NaCl, Fumaric.Succinic acid, Tyrosine, Tartaric acid, Glycine, Malonic acid, Malic acid, Tryptophan, Glutaric acid, Methionine, Histidine, Cysteine, Maltose, Fructose and (Glutamic acid) of the taste components by stepwise multiple frgression analysis of original data transformed with square root. Eighty five percents of the ordinary Korean soy sauce taste is affected by nineteen kinds (Fumaric.Succinic acid, Lactic acid, Phenylalanine, NaCl, Tyrosine, Sugar(\ulcorner), Tartaric acid, Leucine, Glutaric acid, Methionine, Glycine, Tryptophan, Histidine, Proline, Cysteine, Glutamic acid, Maltose, Threonine and Oxalic acid) of the taste components by stepwise multiple regression analysis of original data transformed with logarithm.

  • PDF

Evaluating Variable Selection Techniques for Multivariate Linear Regression (다중선형회귀모형에서의 변수선택기법 평가)

  • Ryu, Nahyeon;Kim, Hyungseok;Kang, Pilsung
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.42 no.5
    • /
    • pp.314-326
    • /
    • 2016
  • The purpose of variable selection techniques is to select a subset of relevant variables for a particular learning algorithm in order to improve the accuracy of prediction model and improve the efficiency of the model. We conduct an empirical analysis to evaluate and compare seven well-known variable selection techniques for multiple linear regression model, which is one of the most commonly used regression model in practice. The variable selection techniques we apply are forward selection, backward elimination, stepwise selection, genetic algorithm (GA), ridge regression, lasso (Least Absolute Shrinkage and Selection Operator) and elastic net. Based on the experiment with 49 regression data sets, it is found that GA resulted in the lowest error rates while lasso most significantly reduces the number of variables. In terms of computational efficiency, forward/backward elimination and lasso requires less time than the other techniques.

A Multivariate Analysis of Korean Professional Players Salary (한국 프로스포츠 선수들의 연봉에 대한 다변량적 분석)

  • Song, Jong-Woo
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.3
    • /
    • pp.441-453
    • /
    • 2008
  • We analyzed Korean professional basketball and baseball players salary under the assumption that it depends on the personal records and contribution to the team in the previous year. We extensively used data visualization tools to check the relationship among the variables, to find outliers and to do model diagnostics. We used multiple linear regression and regression tree to fit the model and used cross-validation to find an optimal model. We check the relationship between variables carefully and chose a set of variables for the stepwise regression instead of using all variables. We found that points per game, number of assists, number of free throw successes, career are important variables for the basketball players. For the baseball pitchers, career, number of strike-outs per 9 innings, ERA, number of homeruns are important variables. For the baseball hitters, career, number of hits, FA are important variables.

A Comparative Analysis of the Forecasting Performance of Coal and Iron Ore in Gwangyang Port Using Stepwise Regression and Artificial Neural Network Model (단계적 회귀분석과 인공신경망 모형을 이용한 광양항 석탄·철광석 물동량 예측력 비교 분석)

  • Cho, Sang-Ho;Nam, Hyung-Sik;Ryu, Ki-Jin;Ryoo, Dong-Keun
    • Journal of Navigation and Port Research
    • /
    • v.44 no.3
    • /
    • pp.187-194
    • /
    • 2020
  • It is very important to forecast freight volume accurately to establish major port policies and future operation plans. Thus, related studies are being conducted because of this importance. In this paper, stepwise regression analysis and artificial neural network model were analyzed to compare the predictive power of each model on Gwangyang Port, the largest domestic port for coal and iron ore transportation. Data of a total of 121 months J anuary 2009-J anuary 2019 were used. Factors affecting coal and iron ore trade volume were selected and classified into supply-related factors and market/economy-related factors. In the stepwise regression analysis, the tonnage of ships entering the port, coal price, and dollar exchange rate were selected as the final variables in case of the Gwangyang Port coal volume forecasting model. In the iron ore volume forecasting model, the tonnage of ships entering the port and the price of iron ore were selected as the final variables. In the analysis using the artificial neural network model, trial-and-error method that various Hyper-parameters affecting the performance of the model were selected to identify the most optimal model used. The analysis results showed that the artificial neural network model had better predictive performance than the stepwise regression analysis. The model which showed the most excellent performance was the Gwangyang Port Coal Volume Forecasting Artificial Neural Network Model. In comparing forecasted values by various predictive models and actually measured values, the artificial neural network model showed closer values to the actual highest point and the lowest point than the stepwise regression analysis.

Validation of Nursing Care Sensitive Outcomes related to Knowledge (지식에 관한 간호결과도구의 타당성 조사)

  • 이은주
    • Journal of Korean Academy of Nursing
    • /
    • v.33 no.5
    • /
    • pp.625-632
    • /
    • 2003
  • Purpose: The purpose of this study was to assess the importance and sensitivity to nursing interventions of four nursing sensitive nursing outcomes selected from the Nursing Outcomes Classification (NOC). Outcomes for this study were 'Knowledge: Diet', 'Knowledge: Disease Process', 'Knowledge: Energy Conservation', and 'Knowledge: Health Behaviors'. Method: Data were collected from 183 nurses working in 2 university hospitals. Fehring method was used to estimate outcome and indicators' content and sensitivity validity. Multiple and stepwise regression were used to evaluate relationships between each outcome and its indicators. Result: Results confirmed the importance and nursing sensitivity of outcomes and their indicators. Key indicators of each outcomes were found by multiple regression. 'Knowledge: Diet' was suggested for adding new indicators because the variance explained by indicators was relatively low. Not all of the indicators selected for stepwise regression model were rated for highly in Fehring method. The R² statistics of the stepwise regression models were between 18 and 63% in importance by selected indicators and between 34 and 68% in contribution by selected indicators. Conclusion: This study refined what outcomes and indicators will be useful in clinical practice. Further research will be required for the revision of outcome and indicators of NOC. However, this study refined what outcomes and indicators will be useful in clinical practice.