• Title/Summary/Keyword: Latent Variable Model

Search Result 125, Processing Time 0.033 seconds

Gene Expression Pattern Analysis via Latent Variable Models Coupled with Topographic Clustering

  • Chang, Jeong-Ho;Chi, Sung Wook;Zhang, Byoung Tak
    • Genomics & Informatics
    • /
    • v.1 no.1
    • /
    • pp.32-39
    • /
    • 2003
  • We present a latent variable model-based approach to the analysis of gene expression patterns, coupled with topographic clustering. Aspect model, a latent variable model for dyadic data, is applied to extract latent patterns underlying complex variations of gene expression levels. Then a topographic clustering is performed to find coherent groups of genes, based on the extracted latent patterns as well as individual gene expression behaviors. Applied to cell cycle­regulated genes of the yeast Saccharomyces cerevisiae, the proposed method could discover biologically meaningful patterns related with characteristic expression behavior in particular cell cycle phases. In addition, the display of the variation in the composition of these latent patterns on the cluster map provided more facilitated interpretation of the resulting cluster structure. From this, we argue that latent variable models, coupled with topographic clustering, are a promising tool for explorative analysis of gene expression data.

Asymptotic Test for Dimensionality in Probabilistic Principal Component Analysis with Missing Values

  • Park, Chong-sun
    • Communications for Statistical Applications and Methods
    • /
    • v.11 no.1
    • /
    • pp.49-58
    • /
    • 2004
  • In this talk we proposed an asymptotic test for dimensionality in the latent variable model for probabilistic principal component analysis with missing values at random. Proposed algorithm is a sequential likelihood ratio test for an appropriate Normal latent variable model for the principal component analysis. Modified EM-algorithm is used to find MLE for the model parameters. Results from simulations and real data sets give us promising evidences that the proposed method is useful in finding necessary number of components in the principal component analysis with missing values at random.

Latent class analysis with multiple latent group variables

  • Lee, Jung Wun;Chung, Hwan
    • Communications for Statistical Applications and Methods
    • /
    • v.24 no.2
    • /
    • pp.173-191
    • /
    • 2017
  • This study develops a new type of latent class analysis (LCA) in order to explain the associations between one latent variable and several other categorical latent variables. Our model postulates that the prevalence of the latent variable of interest is affected by another latent variable composed of other several latent variables. For the parameter estimation, we propose deterministic annealing EM (DAEM) to deal with local maxima problem in the proposed model. We perform simulation study to demonstrate how DAEM can find the set of parameter estimates at the global maximum of the likelihood over the repeated samples. We apply the proposed LCA model in an investigation of the effect of and joint patterns for drug-using behavior to violent behavior among US high school male students using data from the Youth Risk Behavior Surveillance System 2015. Considering the age of male adolescents as a covariate influencing violent behavior, we identified three classes of violent behavior and three classes of drug-using behavior. We also discovered that the prevalence of violent behavior is affected by the type of drug used for drug-using behavior.

A Stagewise Approach to Structural Equation Modeling (구조식 모형에 대한 단계적 접근)

  • Lee, Bora;Park, Changsoon
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.1
    • /
    • pp.61-74
    • /
    • 2015
  • Structural equation modeling (SEM) is a widely used in social sciences such as education, business administration, and psychology. In SEM, the latent variable score is the estimate of the latent variable which cannot be observed directly. This study uses stagewise structural equation modeling(stagewise SEM; SSEM) by partitioning the whole model into several stages. The traditional estimation method minimizes the discrepancy function using the variance-covariance of all observed variables. This method can lead to inappropriate situations where exogenous latent variables may be affected by endogenous latent variables. The SSEM approach can avoid such situations and reduce the complexity of the whole SEM in estimating parameters.

Bayesian Approach for Determining the Order p in Autoregressive Models

  • Kim, Chansoo;Chung, Younshik
    • Communications for Statistical Applications and Methods
    • /
    • v.8 no.3
    • /
    • pp.777-786
    • /
    • 2001
  • The autoregressive models have been used to describe a wade variety of time series. Then the problem of determining the order in the times series model is very important in data analysis. We consider the Bayesian approach for finding the order of autoregressive(AR) error models using the latent variable which is motivated by Tanner and Wong(1987). The latent variables are combined with the coefficient parameters and the sequential steps are proposed to set up the prior of the latent variables. Markov chain Monte Carlo method(Gibbs sampler and Metropolis-Hasting algorithm) is used in order to overcome the difficulties of Bayesian computations. Three examples including AR(3) error model are presented to illustrate our proposed methodology.

  • PDF

Latent Variable Fit to Interlaboratory Studies

  • Jeon, Gyeongbae
    • Communications for Statistical Applications and Methods
    • /
    • v.7 no.3
    • /
    • pp.885-897
    • /
    • 2000
  • The use of an unweighted mean and of separate tests is part of the current practice for analyzing interlaboratory studies, and we hope to improve on this method. We fit, using maximum likelihood(ML), a rather intricate, multi-parameter measurement model with the material's true value as a latent variable in a situation where quite serviceable regression and ANOVA calculations have already been developed. The model fit leads to both a weighted estimate of he overall mean, and to tests for equality of means, slopes and variances. Maximum likelihood tests for difference among variances poses a challenge in that the likelihood can easily becoem unbounded. Thus the major objective become to provide a useful test of variance equality.

  • PDF

Non-Conservatism of Bonferroni-Adjusted Test

  • Jeon, Cyeong-Bae;Lee, Sung-Duck
    • Communications for Statistical Applications and Methods
    • /
    • v.8 no.1
    • /
    • pp.219-227
    • /
    • 2001
  • Another approach (multi-parameter measurement method) of interlaboratory studies of test methods is presented. When the unrestricted normal likelihood for the fixed latent variable model is unbounded, we propose a me쇙 of restricting the parameter space by formulating realistic alternative hypothesis under which the likelihood is bounded. A simulation study verified the claim of conservatism of level of significance based on assumptions about central chi-square distributed test statistics and on Bonferroni approximations. We showed a randomization approach that furnished empirical significance levels would be better than a Bonferroni adjustment.

  • PDF

An Analysis of Traffic Accident Injury Severity for Elderly Driver on Goyang-Si using Structural Equation Model (구조방정식을 이용한 고령운전자 교통사고 인적 피해 심각도 분석 (고양시를 중심으로))

  • Kim, Soullam;Yun, Duk Geun
    • International Journal of Highway Engineering
    • /
    • v.17 no.3
    • /
    • pp.117-124
    • /
    • 2015
  • PURPOSES : The purpose of this study is to verify traffic accident injury severity factors for elderly drivers and the relative relationship of these factors. METHODS : To verify the complicated relationship among traffic accident injury severity factors, this study employed a structural equation model (SEM). To develop the SEM structure, only the severity of human injuries was considered; moreover, the observed variables were selected through confirmatory factor analysis (CFA). The number of fatalities, serious injuries, moderate injuries, and minor injuries were selected for observed variables of severity. For latent variables, the accident situation, environment, and vehicle and driver factors were respectively defined. Seven observed variables were selected among the latent variables. RESULTS : This study showed that the vehicle and driver factor was the most influential factor for accident severity among the latent factors. For the observed variable, the type of vehicle, type of accident, and status of day or night for each latent variable were the most relative observed variables for the accident severity factor. To verify the validity of the SEM, several model fitting methods, including ${\chi}^2/df$, GFI, AGFI, CFI, and others, were applied, and the model produced meaningful results. CONCLUSIONS : Based on an analysis of results of traffic accident injury severity for elderly drivers, the vehicle and driver factor was the most influential one for injury severity. Therefore, education tailored to elderly drivers is needed to improve driving behavior of elderly driver.

Combined RP/SP Model with Latent Variables (잠재변수를 이용한 RP/SP 결합모형에 관한 연구)

  • Kim, Jin-Hui;Jeong, Jin-Hyeok;Son, Gi-Min
    • Journal of Korean Society of Transportation
    • /
    • v.28 no.4
    • /
    • pp.119-128
    • /
    • 2010
  • Mode choice behavior is associated with travelers' latent behavior that is an unobservable preference to travel behavior or mode characteristics. This paper specifically addresses the problem of unobservable factors, that is latent behavior, in mode choice models. Consideration of latent behavior in mode choice models reduces the errors that come from unobservable factors. In this study, the authors defined the latent variables that mean a quantitative latent behavior factors, and developed the combined RP/SP model with latent variables using the mode choice behavior survey data. The data has traveler's revealed preference of existent modes along the Han River and stated preference of new water transit on the Han River. Also, The data has travelers' latent behavior. Latent variables were defined by factor analysis using the latent behaviour data. In conclusion, it is significant that the relationship between traveler's latent behavior and mode choice behavior. In addition, the goodness-of-fit of the mode choice models with latent variables are better than the model without latent variables.

Analysis of Virus Types by a Latent Variable Model (Latent variable model에 의한 바이러스 유형 분석)

  • Kim Soo-Jin;Joung Je-Gun;Tae Kang Soo;Zhang Byoung-Tak
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2005.11b
    • /
    • pp.262-264
    • /
    • 2005
  • 인유두종 바이러스(Human Papillomavirus: HPV)는 사마귀로부터 생식기 및 배설기의 침윤성 암에 이르기까지 여러 질병과 연관되어 있음이 알려져 있다. 현재 200종 이상이 알려져 있고, 이 중 85개는 전체 유전자가 밝혀져 있다. HPV 감염 시 만들어지는 단백질 중 E6. E7 단백질은 암 억제 유전자(p53, pRb)에 결합하여 세포의 암 억제 기능을 저하시키고 이로 인해 암을 발생시킨다. 본 논문은 암 발생과 밀접한 관련이 있는 HPV의 E6 단백질 서열과 HPV 유형(HPV Type)을 가지고, PLSA (Probabilistic Latent Semantic Analysis) 방법을 이용하여 HPV를 클러스터링(clustering) 해 보았다. 실험 결과, 특정 클러스터는 질병과 밀접하게 연관되어 있으며, 이와 관련된 주요 서열 분석이 가능함을 보여주고 있다.

  • PDF