• Title/Summary/Keyword: principal components

Search Result 994, Processing Time 0.035 seconds

Principal Component Regression by Principal Component Selection

  • Lee, Hosung;Park, Yun Mi;Lee, Seokho
    • Communications for Statistical Applications and Methods
    • /
    • v.22 no.2
    • /
    • pp.173-180
    • /
    • 2015
  • We propose a selection procedure of principal components in principal component regression. Our method selects principal components using variable selection procedures instead of a small subset of major principal components in principal component regression. Our procedure consists of two steps to improve estimation and prediction. First, we reduce the number of principal components using the conventional principal component regression to yield the set of candidate principal components and then select principal components among the candidate set using sparse regression techniques. The performance of our proposals is demonstrated numerically and compared with the typical dimension reduction approaches (including principal component regression and partial least square regression) using synthetic and real datasets.

A New Deletion Criterion of Principal Components Regression with Orientations of the Parameters

  • Lee, Won-Woo
    • Journal of the Korean Statistical Society
    • /
    • v.16 no.2
    • /
    • pp.55-70
    • /
    • 1987
  • The principal components regression is one of the substitues for least squares method when there exists multicollinearity in the multiple linear regression model. It is observed graphically that the performance of the principal components regression is strongly dependent upon the values of the parameters. Accordingly, a new deletion criterion which determines proper principal components to be deleted from the analysis is developed and its usefulness is checked by simulations.

  • PDF

AN EFFICIENT ALGORITHM FOR SLIDING WINDOW BASED INCREMENTAL PRINCIPAL COMPONENTS ANALYSIS

  • Lee, Geunseop
    • Journal of the Korean Mathematical Society
    • /
    • v.57 no.2
    • /
    • pp.401-414
    • /
    • 2020
  • It is computationally expensive to compute principal components from scratch at every update or downdate when new data arrive and existing data are truncated from the data matrix frequently. To overcome this limitations, incremental principal component analysis is considered. Specifically, we present a sliding window based efficient incremental principal component computation from a covariance matrix which comprises of two procedures; simultaneous update and downdate of principal components, followed by the rank-one matrix update. Additionally we track the accurate decomposition error and the adaptive numerical rank. Experiments show that the proposed algorithm enables a faster execution speed and no-meaningful decomposition error differences compared to typical incremental principal component analysis algorithms, thereby maintaining a good approximation for the principal components.

Numerical Investigations in Choosing the Number of Principal Components in Principal Component Regression - CASE I

  • Shin, Jae-Kyoung;Moon, Sung-Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.8 no.2
    • /
    • pp.127-134
    • /
    • 1997
  • A method is proposed for the choice of the number of principal components in principal component regression based on the predicted error sum of squares. To do this, we approximately evaluate that statistic using a linear approximation based on the perturbation expansion. In this paper, we apply the proposed method to various data sets and discuss some properties in choosing the number of principal components in principal component regression.

  • PDF

Numerical Investigations in Choosing the Number of Principal Components in Principal Component Regression - CASE II

  • Shin, Jae-Kyoung;Moon, Sung-Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.10 no.1
    • /
    • pp.163-172
    • /
    • 1999
  • We propose a cross-validatory method for the choice of the number of principal components in principal component regression based on the magnitudes of correlations with y. There are two different manners in choosing principal components, one is the order of eigenvalues(Shin and Moon, 1997) and the other is that of correlations with y. We apply our method to various data sets and compare results of those two methods.

  • PDF

Procedure for the Selection of Principal Components in Principal Components Regression (주성분회귀분석에서 주성분선정을 위한 새로운 방법)

  • Kim, Bu-Yong;Shin, Myung-Hee
    • The Korean Journal of Applied Statistics
    • /
    • v.23 no.5
    • /
    • pp.967-975
    • /
    • 2010
  • Since the least squares estimation is not appropriate when multicollinearity exists among the regressors of the linear regression model, the principal components regression is used to deal with the multicollinearity problem. This article suggests a new procedure for the selection of suitable principal components. The procedure is based on the condition index instead of the eigenvalue. The principal components corresponding to the indices are removed from the model if any condition indices are larger than the upper limit of the cutoff value. On the other hand, the corresponding principal components are included if any condition indices are smaller than the lower limit. The forward inclusion method is employed to select proper principal components if any condition indices are between the upper limit and the lower limit. The limits are obtained from the linear model which is constructed on the basis of the conjoint analysis. The procedure is evaluated by Monte Carlo simulation in terms of the mean square error of estimator. The simulation results indicate that the proposed procedure is superior to the existing methods.

Genetic Diversity of Soybean Pod Shape Based on Elliptic Fourier Descriptors

  • Truong Ngon T.;Gwag Jae-Gyun;Park Yong-Jin;Lee Suk-Ha
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.50 no.1
    • /
    • pp.60-66
    • /
    • 2005
  • Pod shape of twenty soybean (Glycine max L. Merrill) genotypes was evaluated quantitatively by image analysis using elliptic Fourier descriptors and their principal components. The closed contour of each pod projection was extracted, and 80 elliptic Fourier coefficients were calculated for each contour. The Fourier coefficients were standardized so that they were invariant of size, rotation, shift, and chain code starting point. Then, the principal components on the standardized Fourier coefficients were evaluated. The cumulative contribution at the fifth principal component was higher than $95\%$, indicating that the first, second, third, fourth, and fifth principal components represented the aspect ratio of the pod, the location of the pod centroid, the sharpness of the two pod tips and the roundness of the base in the pod contour, respectively. Analysis of variance revealed significant genotypic differences in these principal components and seed number per pod. As the principal components for pod shape varied continuously, pod shape might be controlled by polygenes. It was concluded that principal component scores based on elliptic Fourier descriptors yield seemed to be useful in quantitative parameters not only for evaluating soybean pod shape in a soybean breeding program but also for describing pod shape for evaluating soybean germplasm.

Observation on the shape of the neck -by principal component analysis of the mesurements- (피복 구성을 위한 경부 형태의 관찰)

  • 이연순
    • Journal of the Ergonomics Society of Korea
    • /
    • v.10 no.2
    • /
    • pp.31-42
    • /
    • 1991
  • To understand the shape of the neck in a view of garment planning, principal component analysis has been appliedto the measurement of the neck. The neck surface development and the cross sections of the neck have been observed. The materials consist of the body mearsurements, the neck surface developments and the cross sec- tions of the necks of a total of 108 korean woman students. The difference between the right side and the left side of the neck has not been reconginiged. But the differenece among the height of the front neck point, that of the side neck point and that of the back neck point has been recognized. 2. The initial 41 items have been found having variety and duplication. So two criteria have been made to solve those problems and the selection of 34 items have been made by each criterion. 3. 43 and 34 items have been compared by means of accumulative ratios of contribution and of clearness within the meaning of principal component. As a result, 34 measurement items have been further anylysis. 4. As a result of principal component analysis on the 34 items, the four principal components have been found obtaines and inter-preted. The four principal components are 1) the thick of the neck, 2) the front neck-line on the waist basic pattern, basic pattern, 3) the shape of the neck surface development, and 4) the back neck-line on the waist basic pattern. 5. According to the graphic informations concerning these principal components, the meaning of these four principal components has been grasped on the visual. As a result, there is a large individual difference in the shape of neck.

  • PDF

Assessment and Classification of Korean Indigenous Corn Lines by Application of Principal Component Analysis (주성분분석에 의한 재래종 옥수수의 해석)

  • 이인섭;박종옥
    • Journal of Life Science
    • /
    • v.13 no.3
    • /
    • pp.343-348
    • /
    • 2003
  • This study was conducted to get basic information on the Korean local corn line collected from Busan City and Kyungnam Province, a total of 49 lines were selected and assessed by the principal component analysis method. In the result of principal component analysis for 7 characteristics, 67.4% and 86.3% of total variation could be appreciated by the first two and first four principal components, respectively. Contribution of characteristics to principal component was high at upper principal components and low at lower principal components. Biological meaning of principal component and plant types corresponding to the each principal component were explained clearly by the correlation coefficient between principal component and characteristics. The first principal component appeared to correspond to the size of plant and ear, and the duration of vegetative growing period. The second principal component appeared to correspond to the number of ear and tiller. But the meaning of the third and fourth principal components were not clear.

Application of varimax rotated principal component analysis in quantifying some zoometrical traits of a relict cow

  • Pares-Casanova, P.M.;Sinfreu, I.;Villalba, D.
    • Korean Journal of Veterinary Research
    • /
    • v.53 no.1
    • /
    • pp.7-10
    • /
    • 2013
  • A study was conducted to determine the interdependence among the conformation traits of 28 "Pallaresa" cows using principal component analysis. Originally 21 body linear measurements were obtained, from which eight traits are subsequently eliminated. From the principal components analysis, with raw varimax rotation of the transformation matrix, two principal components were extracted, which accounted for 65.8% of the total variance. The first principal component alone explained 51.6% of the variation, and tended to describe general size, while the second principal component had its loadings for back-sternal diameter. The two extracted principal components, which are traits related to dorsal heights and back-sternal diameter, could be considered in selection programs.