Detection of Differentially Expressed Genes by Clustering Genes Using Class-Wise Averaged Data in Microarray Data

- Journal title : Communications for Statistical Applications and Methods
- Volume 14, Issue 3, 2007, pp.687-698
- Publisher : The Korean Statistical Society
- DOI : 10.5351/CKSS.2007.14.3.687

Title & Authors

Detection of Differentially Expressed Genes by Clustering Genes Using Class-Wise Averaged Data in Microarray Data

Kim, Seung-Gu;

Kim, Seung-Gu;

Abstract

A normal mixture model with which dependence between classes is incorporated is proposed in order to detect differentially expressed genes. Gene clustering approaches suffer from the high dimensional column of microarray expression data matrix which leads to the over-fit problem. Various methods are proposed to solve the problem. In this paper, use of simple averaging data within each class is proposed to overcome the various problems due to high dimensionality when the normal mixture model is fitted. Some experiments through simulated data set and real data set show its availability in actuality.

Keywords

Class-dependence;differentially expressed gene;microarray;normal mixture model;

Language

Korean

References

1.

김승구 (2007). Use of factor analyzer normal mixture model with mean pattern modeling on clustering genes. 한국통계학회논문집, 13, 113-123

2.

Allison, D. B. Gadbury, G. L., Heo, M., Fercndez, J. R., Lee, C.-K., Prolla, T. A. and Weindruch, R. (2002). A mixture model approach for the analysis of microarray gene expression data. Computational Statistics and Data Analysis, 39, 1-20

3.

Alon, U., Barkai, N., Notterman, D. A., Gish, K., Ybarra, S., Mack, D. and Levine, A. J. (1999). Broad patterns of gene expression revealed by clustering analysis of tomor and normal colon tissues probed by oligonucleotide arrays. In Proceedings of the National Academy of Sciences of the United States of America, 96, 6745-6750

4.

Do, K.-A, Mueller, P. and Tang, F. (2005). A nonparametric Bayesian mixture model for gene expression. Applied Statistics, 54, 1-18

5.

Efron, B. and Tibshirani, R. (2002). Empirical Bayes methods and false discovery rates for microarrays. Genetic Epidemiology, 23, 70-86

6.

Efron, B. (2004). Large-scale simultaneous hypothesis testing: the choice of a null hypothesis. Journal of the American Statistical Association, 99, 96-104

7.

He, Y., Pan, W. and Lin, J. (2006). Cluster analysis using multivariate normal mixture models to detect differential gene expression with microarray data. Computational Statistics and Data Analysis, 51, 641-658

8.

McLachlan, J. L., Peel, D. and Bean, R. W. (2003). Modeling high-dimensional data by mixtures of factor analyzers. Computational Statistics & Data Analysis, 41, 379-388

9.

McLachlan, G. J., Bean, R. W. and Jones, L. B.-T. (2006). A simple implementation of a normal mixture approach to differential gene expression in multiclass microarrays. Bioinformatics, 22, 1608-1615

10.

Tusher, V. G., Tibshirani, R. and Chu, G. (2001). Significance analysis of microarrays applied to the ionizing radiation response. In Proceedings of the National Academy of Sciences of the United States of America, 98, 5116-5121