• Title/Summary/Keyword: Regression analysis

Search Result 23,386, Processing Time 0.047 seconds

Application and Understanding of Regression Analysis in the Quality Improvement Activities (식스시그마 품질개선 단계에서 GLM 회귀분석의 이해와 적용)

  • Choi, Sung-Woon
    • Proceedings of the Safety Management and Science Conference
    • /
    • 2010.11a
    • /
    • pp.539-550
    • /
    • 2010
  • The study presents the application strategy and understanding of regression analysis with GLM(Generalized Linear Model) unifying with other statistical techniques such as correlation analysis and design of experiment(DOE). The quidelines proposed in this paper can be used for practioners to implement GLM and ANOVA(Analysis of Variance) for the DMAIC 5 steps of six sigma breakthrough.

  • PDF

Sensitivity Analysis in Principal Component Regression with Quadratic Approximation

  • Shin, Jae-Kyoung;Chang, Duk-Joon
    • Journal of the Korean Data and Information Science Society
    • /
    • v.14 no.3
    • /
    • pp.623-630
    • /
    • 2003
  • Recently, Tanaka(1988) derived two influence functions related to an eigenvalue problem $(A-\lambda_sI)\upsilon_s=0$ of real symmetric matrix A and used them for sensitivity analysis in principal component analysis. In this paper, we deal with the perturbation expansions up to quadratic terms of the same functions and discuss the application to sensitivity analysis in principal component regression analysis(PCRA). Numerical example is given to show how the approximation improves with the quadratic term.

  • PDF

Analysis of the Effect of Wind Direction on Ozone Level

  • Na, Jong-Hwa;Sung, Su-Jin;Yu, Hye-Kyung
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.4
    • /
    • pp.527-536
    • /
    • 2012
  • In this paper we analyze the effect of circular variables such as wind direction, time and month on the ozone level. In particular, we examined the effect of wind direction by exploratory data analysis methods and provide the correlation and regression analyzes in the cases including all circular explanatory variables. In the analysis, we convert time and month variables to circular variables and analyze the effect of these variables on regression analysis; in addition, we also consider circular-circular regression. We used weather condition and air pollution data collected from Dongdaemoon district of Seoul in 2007.

A Quantitative Model for the Projection of Health Expenditure (의료비 결정요인 분석을 위한 계량적 모형 고안)

  • Kim, Han-Joong;Lee, Young-Doo;Nam, Chung-Mo
    • Journal of Preventive Medicine and Public Health
    • /
    • v.24 no.1 s.33
    • /
    • pp.29-36
    • /
    • 1991
  • A multiple regression analysis using ordinary least square (OLS) is frequently used for the projection of health expenditure as well as for the identification of factors affecting health care costs. Data for the analysis often have mixed characteristics of time series and cross section. Parameters as a result of OLS estimation, in this case, are no longer the best linear unbiased estimators (BLUE) because the data do not satisfy basic assumptions of regression analysis. The study theoretically examined statistical problems induced when OLS estimation was applied with the time series cross section data. Then both the OLS regression and time series cross section regression (TSCS regression) were applied to the same empirical da. Finally, the difference in parameters between the two estimations were explained through residual analysis.

  • PDF

A Study on Determinants of Inventory Turnover using Quantile Regression Analysis (분위회귀분석을 이용한 재고회전율 결정요인 분석)

  • Kim, Gilwhan
    • Asia-Pacific Journal of Business
    • /
    • v.13 no.1
    • /
    • pp.185-195
    • /
    • 2022
  • Purpose - This study attempts to analyze the determinants of inventory turnover by applying quantile regression analysis. Design/methodology/approach - By selecting the gross margin, capital intensity, and sale surprise as the determinants of inventory turnover, we investigate their effects on inventory turnover at the several quartiles (10%, 25%, 50%, 75%, 90%) of inventory turnover with quantile regression analysis. Findings - The effects of gross margin and capital intensity on inventory turnover are different for each quartile. But the effects of sale surprise on inventory turnover are not different for each quartile. Research implications or Originality -This study is the first attempt to examine the effects of inventory turnover determinants on inventory turnover by applying quantile regression analysis was not employed in the prior studies. Thus, this study is meaningful in that it shows the possible way to review inventory management strategies that can be applied differently to the firms with different inventory turnover levels.

Analysis on the Regional Variation of the Rate of Inpatient Medical Costs in Local-Out: Geographically Weighted Regression Approach (지리적가중회귀분석을 이용한 관외입원진료비 비율의 지역 간 차이 분석)

  • Jo, Eun-Kyung;Lee, Kwang-Soo
    • The Korean Journal of Health Service Management
    • /
    • v.8 no.2
    • /
    • pp.11-22
    • /
    • 2014
  • This study purposed to analyze the regional variation of the local-out rates of inpatient services. Multiple data sources collected from National Health Insurance Corporation and statistics Korea were merged to produce the analysis data set. The unit of analysis in this study was city, Gun, Gu, and all of them were included in analysis. The dependent variable measured the local-out rate of inpatient cost in study regions. Local environments were measured by variables in three dimensions: provider factors, socio-demographic factors, and health status. Along with the traditional ordinary least square (OLS) based regression model, geographically weighted regression (GWR) model were applied to test their effects. SPSS v21 and ArcMap v10.2 were applied for the statistical analysis. Results from OLS regression showed that most variables had significant relationships with the local-out rate of inpatient services. However, some variables had shown diverse directions in regression coefficients depending on regions in GWR. This implied that the study variables might not have consistent effects and they may varied depending the locations.

The Effects of Urban Forest on Summer Air Temperature in Seoul, Korea (도시림의 여름 대기온도 저감효과 - 서울시를 대상으로 -)

  • 조용현;신수영
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.30 no.4
    • /
    • pp.28-36
    • /
    • 2002
  • The main purpose of this study was to estimate a new regression model to explain the relationship between urban forest and air temperature in summer, 2001. This study consists of two parts: correlation coefficient analysis and regression analysis. According to correlation coefficient analysis, thermal infra-red radiations of the major land use categories found significant difference in each category. However there were no significant relationship between the data (thermal infra-red radiation and NDVI) derived from Landsat-7 ETM+ image and air temperature at Automatic Weather Stations(AWSs). After estimating various regression models for summer air temperature, the final models were chosen. The final regression models consisted of two variables such as forest m and traffic facilities area. The regression models explained over 78% of the variability in air temperatures. The regression models with variables of forest area and traffic facilities area showed that the coefficient of the first variable was even more significant than the second one. However, the negative impact of the traffic facilities area was slightly greater than the positive impact of the forest area. Consequently, the effects of forest area and traffic facilities area were apparent to explain summer air temperature in Seoul. Therefore two policies have the most important implications to mitigate the summer air temperature in Seoul: to expand and to conserve the urban forest; and to change the Oafnc facilities'characteristics. The results from this study are expected to be useful not merely in informing the public that urban forest mitigates summer air temperahne, but in urging the necessity of budgets for trees and managing urban forests. It is recommended that field swey of summer air temperature be Performed for the vadidation of the models. The main purpose of this study was to estimate a new regression model to explain the relationship between urban forest and air temperature in summer, 2001. This study consists of two parts: correlation coefficient analysis and regression analysis. According to correlation coefficient analysis, thermal infra-red radiations of the major land use categories found significant difference in each category. However there were no significant relationship between the data (thermal infra-red radiation and NDVI) derived from Landsat-7 ETM+ image and air temperature at Automatic Weather Stations(AWSs). After estimating various regression models for summer air temperature, the final models were chosen. The final regression models consisted of two variables such as forest m and traffic facilities area. The regression models explained over 78% of the variability in air temperatures. The regression models with variables of forest area and traffic facilities area showed that the coefficient of the first variable was even more significant than the second one. However, the negative impact of the traffic facilities area was slightly greater than the positive impact of the forest area. Consequently, the effects of forest area and traffic facilities area were apparent to explain summer air temperature in Seoul. Therefore two policies have the most important implications to mitigate the summer air temperature in Seoul: to expand and to conserve the urban forest; and to change the traffic facilities'characteristics. The results from this study are expected to be useful not merely in informing the public that urban forest mitigates summer air temperature, but in urging the necessity of budgets for trees and managing urban forests. It is recommended that field survey of summer air temperature be Performed for the vadidation of the models.

Support Vector Machine for Interval Regression

  • Hong Dug Hun;Hwang Changha
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2004.11a
    • /
    • pp.67-72
    • /
    • 2004
  • Support vector machine (SVM) has been very successful in pattern recognition and function estimation problems for crisp data. This paper proposes a new method to evaluate interval linear and nonlinear regression models combining the possibility and necessity estimation formulation with the principle of SVM. For data sets with crisp inputs and interval outputs, the possibility and necessity models have been recently utilized, which are based on quadratic programming approach giving more diverse spread coefficients than a linear programming one. SVM also uses quadratic programming approach whose another advantage in interval regression analysis is to be able to integrate both the property of central tendency in least squares and the possibilistic property In fuzzy regression. However this is not a computationally expensive way. SVM allows us to perform interval nonlinear regression analysis by constructing an interval linear regression function in a high dimensional feature space. In particular, SVM is a very attractive approach to model nonlinear interval data. The proposed algorithm here is model-free method in the sense that we do not have to assume the underlying model function for interval nonlinear regression model with crisp inputs and interval output. Experimental results are then presented which indicate the performance of this algorithm.

  • PDF

Development of Strength Prediction Model for Lightweight Soil Using Polynomial Regression Analysis (다항회귀분석을 활용한 혼합경량토의 강도산정 모델 개발)

  • Lim, Byung-Gwon;Kim, Yun-Tae
    • Journal of Ocean Engineering and Technology
    • /
    • v.26 no.2
    • /
    • pp.39-47
    • /
    • 2012
  • The objective of this study was to develop a strength prediction model using a polynomial regression analysis based on the experimental results obtained from ninety samples. As the results of a correlation analysis between various mixing factors and unconfined compressive strength using SPSS (statistical package for the social sciences), the governing factors in the strength of lightweight soil were found to be the crumb rubber content, bottom ash content,and water-cement ratio. After selecting the governing factors affecting the strength through the correlation analysis, a strength prediction model, which consisted of the selected governing factors, was developed using the polynomial regression analysis. The strengths calculated from the proposed model were similar to those resulting from laboratory tests (R2=87.5%). Therefore, the proposed model can be used to predict the strength of lightweight mixtures with various mixing ratios without time-consuming experimental tests.

The Geometry Prediction of Back-bead in Arc Welding

  • Lee, Jeong-Ick;Koh, Byung-Kab
    • Transactions of the Korean Society of Machine Tool Engineers
    • /
    • v.16 no.5
    • /
    • pp.84-89
    • /
    • 2007
  • This research was done on the basis of assumption that there is a relationship between welding parameters and geometry of the back-bead being a gap in arc welding. Multiple regression analysis was used as method for predicting the geometry of the back-bead. The analysis data and the verification data were used for the formation of multiple regression analysis. The method was used to perform the prediction of the back-bead.