• Title/Summary/Keyword: principal components analysis

Search Result 757, Processing Time 0.03 seconds

AN EFFICIENT ALGORITHM FOR SLIDING WINDOW BASED INCREMENTAL PRINCIPAL COMPONENTS ANALYSIS

  • Lee, Geunseop
    • Journal of the Korean Mathematical Society
    • /
    • v.57 no.2
    • /
    • pp.401-414
    • /
    • 2020
  • It is computationally expensive to compute principal components from scratch at every update or downdate when new data arrive and existing data are truncated from the data matrix frequently. To overcome this limitations, incremental principal component analysis is considered. Specifically, we present a sliding window based efficient incremental principal component computation from a covariance matrix which comprises of two procedures; simultaneous update and downdate of principal components, followed by the rank-one matrix update. Additionally we track the accurate decomposition error and the adaptive numerical rank. Experiments show that the proposed algorithm enables a faster execution speed and no-meaningful decomposition error differences compared to typical incremental principal component analysis algorithms, thereby maintaining a good approximation for the principal components.

A New Deletion Criterion of Principal Components Regression with Orientations of the Parameters

  • Lee, Won-Woo
    • Journal of the Korean Statistical Society
    • /
    • v.16 no.2
    • /
    • pp.55-70
    • /
    • 1987
  • The principal components regression is one of the substitues for least squares method when there exists multicollinearity in the multiple linear regression model. It is observed graphically that the performance of the principal components regression is strongly dependent upon the values of the parameters. Accordingly, a new deletion criterion which determines proper principal components to be deleted from the analysis is developed and its usefulness is checked by simulations.

  • PDF

Genetic Diversity of Soybean Pod Shape Based on Elliptic Fourier Descriptors

  • Truong Ngon T.;Gwag Jae-Gyun;Park Yong-Jin;Lee Suk-Ha
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.50 no.1
    • /
    • pp.60-66
    • /
    • 2005
  • Pod shape of twenty soybean (Glycine max L. Merrill) genotypes was evaluated quantitatively by image analysis using elliptic Fourier descriptors and their principal components. The closed contour of each pod projection was extracted, and 80 elliptic Fourier coefficients were calculated for each contour. The Fourier coefficients were standardized so that they were invariant of size, rotation, shift, and chain code starting point. Then, the principal components on the standardized Fourier coefficients were evaluated. The cumulative contribution at the fifth principal component was higher than $95\%$, indicating that the first, second, third, fourth, and fifth principal components represented the aspect ratio of the pod, the location of the pod centroid, the sharpness of the two pod tips and the roundness of the base in the pod contour, respectively. Analysis of variance revealed significant genotypic differences in these principal components and seed number per pod. As the principal components for pod shape varied continuously, pod shape might be controlled by polygenes. It was concluded that principal component scores based on elliptic Fourier descriptors yield seemed to be useful in quantitative parameters not only for evaluating soybean pod shape in a soybean breeding program but also for describing pod shape for evaluating soybean germplasm.

Application of varimax rotated principal component analysis in quantifying some zoometrical traits of a relict cow

  • Pares-Casanova, P.M.;Sinfreu, I.;Villalba, D.
    • Korean Journal of Veterinary Research
    • /
    • v.53 no.1
    • /
    • pp.7-10
    • /
    • 2013
  • A study was conducted to determine the interdependence among the conformation traits of 28 "Pallaresa" cows using principal component analysis. Originally 21 body linear measurements were obtained, from which eight traits are subsequently eliminated. From the principal components analysis, with raw varimax rotation of the transformation matrix, two principal components were extracted, which accounted for 65.8% of the total variance. The first principal component alone explained 51.6% of the variation, and tended to describe general size, while the second principal component had its loadings for back-sternal diameter. The two extracted principal components, which are traits related to dorsal heights and back-sternal diameter, could be considered in selection programs.

Assessment and Classification of Korean Indigenous Corn Lines by Application of Principal Component Analysis (주성분분석에 의한 재래종 옥수수의 해석)

  • 이인섭;박종옥
    • Journal of Life Science
    • /
    • v.13 no.3
    • /
    • pp.343-348
    • /
    • 2003
  • This study was conducted to get basic information on the Korean local corn line collected from Busan City and Kyungnam Province, a total of 49 lines were selected and assessed by the principal component analysis method. In the result of principal component analysis for 7 characteristics, 67.4% and 86.3% of total variation could be appreciated by the first two and first four principal components, respectively. Contribution of characteristics to principal component was high at upper principal components and low at lower principal components. Biological meaning of principal component and plant types corresponding to the each principal component were explained clearly by the correlation coefficient between principal component and characteristics. The first principal component appeared to correspond to the size of plant and ear, and the duration of vegetative growing period. The second principal component appeared to correspond to the number of ear and tiller. But the meaning of the third and fourth principal components were not clear.

Observation on the shape of the neck -by principal component analysis of the mesurements- (피복 구성을 위한 경부 형태의 관찰)

  • 이연순
    • Journal of the Ergonomics Society of Korea
    • /
    • v.10 no.2
    • /
    • pp.31-42
    • /
    • 1991
  • To understand the shape of the neck in a view of garment planning, principal component analysis has been appliedto the measurement of the neck. The neck surface development and the cross sections of the neck have been observed. The materials consist of the body mearsurements, the neck surface developments and the cross sec- tions of the necks of a total of 108 korean woman students. The difference between the right side and the left side of the neck has not been reconginiged. But the differenece among the height of the front neck point, that of the side neck point and that of the back neck point has been recognized. 2. The initial 41 items have been found having variety and duplication. So two criteria have been made to solve those problems and the selection of 34 items have been made by each criterion. 3. 43 and 34 items have been compared by means of accumulative ratios of contribution and of clearness within the meaning of principal component. As a result, 34 measurement items have been further anylysis. 4. As a result of principal component analysis on the 34 items, the four principal components have been found obtaines and inter-preted. The four principal components are 1) the thick of the neck, 2) the front neck-line on the waist basic pattern, basic pattern, 3) the shape of the neck surface development, and 4) the back neck-line on the waist basic pattern. 5. According to the graphic informations concerning these principal components, the meaning of these four principal components has been grasped on the visual. As a result, there is a large individual difference in the shape of neck.

  • PDF

A Taxonomy of Korean Isopyroideae (Ranunculaceae)

  • Lee, Nam-Sook;Yeau, Sung-Hee
    • Animal cells and systems
    • /
    • v.2 no.4
    • /
    • pp.439-449
    • /
    • 1998
  • To discuss the taxonomic dispositions of Korean Isopyroideae (Ranunculaceae) taxa, principal components analysis and cluster analysis were performed using quantitative and qualitative morphological characters. The principal components analysis revealed that the size and number of ovule, ovary width, ratio of style length/ovary length, filament length, sepal size, style length, leaf size, and ovary length are important characters to distinguish Korean Isopyroideae taxa. The cluster and principal components analyses based on both quantitative and quantitative characters demonstrate that lsopyrum mandshuricum is more closely related to Enemion raddeanum than to Semiaquilegia adoxoides. Even though Enemion s not separated from Isopyrum by uantitative characters, they are distinguished by qualitative characters, suggesting that our taxa, Enemion, Semiaquilegia, Isopyrum and Aquilegia, should be recognized in Korean Isopyroideae. In addition, cluster analyses suggest that S. adoxoides could be separated from Aquilegia buergeriana var, oxysepala.

  • PDF

Simple principal component analysis using Lasso (라소를 이용한 간편한 주성분분석)

  • Park, Cheolyong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.3
    • /
    • pp.533-541
    • /
    • 2013
  • In this study, a simple principal component analysis using Lasso is proposed. This method consists of two steps. The first step is to compute principal components by the principal component analysis. The second step is to regress each principal component on the original data matrix by Lasso regression method. Each of new principal components is computed as the linear combination of original data matrix using the scaled estimated Lasso regression coefficient as the coefficients of the combination. This method leads to easily interpretable principal components with more 0 coefficients by the properties of Lasso regression models. This is because the estimator of the regression of each principal component on the original data matrix is the corresponding eigenvector. This method is applied to real and simulated data sets with the help of an R package for Lasso regression and its usefulness is demonstrated.

New EM algorithm for Principal Component Analysis (주성분 분석을 위한 새로운 EM 알고리듬)

  • 안종훈;오종훈
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2001.04b
    • /
    • pp.529-531
    • /
    • 2001
  • We present an expectation-maximization algorithm for principal component analysis via orthogonalization. The algorithm finds actual principal components, whereas previously proposed EM algorithms can only find principal subspace. New algorithm is simple and more efficient thant probabilistic PCA specially in noiseless cases. Conventional PCA needs computation of inverse of the covariance matrices, which makes the algorithm prohibitively expensive when the dimensions of data space is large. This EM algorithm is very powerful for high dimensional data when only a few principal components are needed.

  • PDF

Procedure for the Selection of Principal Components in Principal Components Regression (주성분회귀분석에서 주성분선정을 위한 새로운 방법)

  • Kim, Bu-Yong;Shin, Myung-Hee
    • The Korean Journal of Applied Statistics
    • /
    • v.23 no.5
    • /
    • pp.967-975
    • /
    • 2010
  • Since the least squares estimation is not appropriate when multicollinearity exists among the regressors of the linear regression model, the principal components regression is used to deal with the multicollinearity problem. This article suggests a new procedure for the selection of suitable principal components. The procedure is based on the condition index instead of the eigenvalue. The principal components corresponding to the indices are removed from the model if any condition indices are larger than the upper limit of the cutoff value. On the other hand, the corresponding principal components are included if any condition indices are smaller than the lower limit. The forward inclusion method is employed to select proper principal components if any condition indices are between the upper limit and the lower limit. The limits are obtained from the linear model which is constructed on the basis of the conjoint analysis. The procedure is evaluated by Monte Carlo simulation in terms of the mean square error of estimator. The simulation results indicate that the proposed procedure is superior to the existing methods.