• Title/Summary/Keyword: Principal component analyses

Search Result 169, Processing Time 0.033 seconds

Morphological Analyses of 15 Species of Bradybaenidae in Korea (한국산 달팽이과 15종의 외부형태 분석)

  • 이준상;권오길
    • The Korean Journal of Malacology
    • /
    • v.9 no.2
    • /
    • pp.44-56
    • /
    • 1993
  • These studies were conducted to compare the external morphologies of 15 species of land snails in 8 genera of family Bradybaenidae in Korea. Principal component analyses of 15 species can be divided into five groups based on the morphological characters and the result of cluster analyses were in accord with the principal component analyses. The morphological intraspecific similarity of Nesiohelix samarangae of Mitan and Cheju population(S=0.999) proved to be highest. The degree of interspecific similarity between Aegista(Plectotropis)divesa and Aegista(plectotropis)quelpartensis(S=0.970)were more similar than the other interspecies.

  • PDF

Resistant Singular Value Decomposition and Its Statistical Applications

  • Park, Yong-Seok;Huh, Myung-Hoe
    • Journal of the Korean Statistical Society
    • /
    • v.25 no.1
    • /
    • pp.49-66
    • /
    • 1996
  • The singular value decomposition is one of the most useful methods in the area of matrix computation. It gives dimension reduction which is the centeral idea in many multivariate analyses. But this method is not resistant, i.e., it is very sensitive to small changes in the input data. In this article, we derive the resistant version of singular value decomposition for principal component analysis. And we give its statistical applications to biplot which is similar to principal component analysis in aspects of the dimension reduction of an n x p data matrix. Therefore, we derive the resistant principal component analysis and biplot based on the resistant singular value decomposition. They provide graphical multivariate data analyses relatively little influenced by outlying observations.

  • PDF

Analysis for Soil Pollution by Heavy Metals in the Area of Kyongbuk (경북지역 토양의 중금속 분석)

  • Dho, Hyon-Seung;Kim, Sung-Duk;Lee, Seung-Joo
    • Journal of the Korea Safety Management & Science
    • /
    • v.12 no.2
    • /
    • pp.231-236
    • /
    • 2010
  • The investigation was initiated with data from 27 abandoned mines along with 12 locations in Kyongbuk abandoned mines. The analyses for soil pollution by heavy metal pollutants were conducted by using correlation analysis, cluster analysis, and principal component analysis. The correlation analysis indicated that Ni and pH were highly correlated compared to those of other heavy metal ions. The principal component analyses showed that the heavy metal ions might be classified into two catagories, such as antropogenic and lithogenic components. The cluster analysis was also clearly divided by two groups. The respective two groups might be Pb-Zn-Cd-Cu and As-Hg-Ni.

Water Demand Forecasting by Characteristics of City Using Principal Component and Cluster Analyses

  • Choi, Tae-Ho;Kwon, O-Eun;Koo, Ja-Yong
    • Environmental Engineering Research
    • /
    • v.15 no.3
    • /
    • pp.135-140
    • /
    • 2010
  • With the various urban characteristics of each city, the existing water demand prediction, which uses average liter per capita day, cannot be used to achieve an accurate prediction as it fails to consider several variables. Thus, this study considered social and industrial factors of 164 local cities, in addition to population and other directly influential factors, and used main substance and cluster analyses to develop a more efficient water demand prediction model that considers unique localities of each city. After clustering, a multiple regression model was developed that proved that the $R^2$ value of the inclusive multiple regression model was 0.59; whereas, those of Clusters A and B were 0.62 and 0.74, respectively. Thus, the multiple regression model was considered more reasonable and valid than the inclusive multiple regression model. In summary, the water demand prediction model using principal component and cluster analyses as the standards to classify localities has a better modification coefficient than that of the inclusive multiple regression model, which does not consider localities.

Hierarchically penalized sparse principal component analysis (계층적 벌점함수를 이용한 주성분분석)

  • Kang, Jongkyeong;Park, Jaeshin;Bang, Sungwan
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.1
    • /
    • pp.135-145
    • /
    • 2017
  • Principal component analysis (PCA) describes the variation of multivariate data in terms of a set of uncorrelated variables. Since each principal component is a linear combination of all variables and the loadings are typically non-zero, it is difficult to interpret the derived principal components. Sparse principal component analysis (SPCA) is a specialized technique using the elastic net penalty function to produce sparse loadings in principal component analysis. When data are structured by groups of variables, it is desirable to select variables in a grouped manner. In this paper, we propose a new PCA method to improve variable selection performance when variables are grouped, which not only selects important groups but also removes unimportant variables within identified groups. To incorporate group information into model fitting, we consider a hierarchical lasso penalty instead of the elastic net penalty in SPCA. Real data analyses demonstrate the performance and usefulness of the proposed method.

Results of Discriminant Analysis with Respect to Cluster Analyses Under Dimensional Reduction

  • Chae, Seong-San
    • Communications for Statistical Applications and Methods
    • /
    • v.9 no.2
    • /
    • pp.543-553
    • /
    • 2002
  • Principal component analysis is applied to reduce p-dimensions into q-dimensions ( $q {\leq} p$). Any partition of a collection of data points with p and q variables generated by the application of six hierarchical clustering methods is re-classified by discriminant analysis. From the application of discriminant analysis through each hierarchical clustering method, correct classification ratios are obtained. The results illustrate which method is more reasonable in exploratory data analysis.

Real-time Face Detection based on PCA and LDA (PCA와 LDA를 이용한 실시간 얼굴 검출)

  • 홍은혜;고병철;변혜란
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2002.10d
    • /
    • pp.538-540
    • /
    • 2002
  • 본 논문에서는 실시간 카메라 입력 영상에 적합한 얼굴 검출을 위해 다양한 외부적 환경에 덜 민감한 새로운 알고리즘을 제안한다. 빛이나 조명의 영향에 의한 오류를 방지하기 위해 전처리 과정을 포함시키고 형판 정합방법의 단점을 개선하기 위해 얼굴 인식에서 주로 쓰이는 방법인 주성분 분석(PCA :Principal Component Analyses) 변환을 적용하고. 생성된 주성분(Principal Component)을 선형 판별 분석(LDA: Linear Discriminant Analysis)의 입력으로 사용하는 방법을 통해 얼굴을 검출하도록 하였다. 실험을 위해 실제 환경과 같은 6개 카테고리의 동영상을 중심으로 실험한 결과, 본 논문에서 제안하는 방법이 기존의 PCA만을 이용한 방법보다 좋은 성능을 보여줌을 알 수 있었다.

  • PDF

Principal Component and Multiple Regression Analysis for Steel Fiber Reinforced Concrete (SFRC) Beams

  • Islam, Mohammad S.;Alam, Shahria
    • International Journal of Concrete Structures and Materials
    • /
    • v.7 no.4
    • /
    • pp.303-317
    • /
    • 2013
  • This study evaluates the shear strength of steel fiber reinforced concrete (SFRC) beams from a database, which consists of extensive experimental results of 222 SFRC beams having no stirrups. In order to predict the analytical shear strength of the SFRC beams more precisely, the selected beams were sorted into six different groups based on their ultimate concrete strength (low strength with $f_c^{\prime}$ <50 MPa and high strength with $f_c^{\prime}$ <50 MPa), span-depth ratio (shallow beam with $a/d{\geq}2.5 $and deep beam with a/d<2.5) and steel fiber shape (plain, crimped and hooked). Principal component and multiple regression analyses were performed to determine the most feasible model in predicting the shear strength of SFRC beams. A variety of statistical analyses were conducted, and compared with those of the existing equations in estimating the shear strength of SFRC beams. The results showed that the recommended empirical equations were best suited to assess the shear strength of SFRC beams more accurately as compared to those obtained by the previously developed models.

Analyses of Power Consumption of the Heat Pump Dryer in the Automobile Drying Process by using the Principal Component Analysis and Multiple Regression (주성분 분석과 다중회귀모형을 사용한 자동차 건조 공정의 히트펌프 건조기 소모 전력 분석)

  • Lee, Chang-Yong;Song, Gensoo;Kim, Jinho
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.38 no.1
    • /
    • pp.143-151
    • /
    • 2015
  • In this paper, we investigate how the power consumption of a heat pump dryer depends on various factors in the drying process by analyzing variables that affect the power consumption. Since there are in general many variables that affect the power consumption, for a feasible analysis, we utilize the principal component analysis to reduce the number of variables (or dimensionality) to two or three. We find that the first component is correlated positively to the entrance temperature of various devices such as compressor, expander, evaporator, and the second, negatively to condenser. We then model the power consumption as a multiple regression with two and/or three transformed variables of the selected principal components. We find that fitted value from the multiple regression explains 80~90% of the observed value of the power consumption. This results can be applied to a more elaborate control of the power consumption in the heat pump dryer.

Sensor array optimization techniques for exhaled breath analysis to discriminate diabetics using an electronic nose

  • Jeon, Jin-Young;Choi, Jang-Sik;Yu, Joon-Boo;Lee, Hae-Ryong;Jang, Byoung Kuk;Byun, Hyung-Gi
    • ETRI Journal
    • /
    • v.40 no.6
    • /
    • pp.802-812
    • /
    • 2018
  • Disease discrimination using an electronic nose is achieved by measuring the presence of a specific gas contained in the exhaled breath of patients. Many studies have reported the presence of acetone in the breath of diabetic patients. These studies suggest that acetone can be used as a biomarker of diabetes, enabling diagnoses to be made by measuring acetone levels in exhaled breath. In this study, we perform a chemical sensor array optimization to improve the performance of an electronic nose system using Wilks' lambda, sensor selection based on a principal component (B4), and a stepwise elimination (SE) technique to detect the presence of acetone gas in human breath. By applying five different temperatures to four sensors fabricated from different synthetic materials, a total of 20 sensing combinations are created, and three sensing combinations are selected for the sensor array using optimization techniques. The measurements and analyses of the exhaled breath using the electronic nose system together with the optimized sensor array show that diabetic patients and control groups can be easily differentiated. The results are confirmed using principal component analysis (PCA).