Identification of Cluster with Composite Mean and Variance Kim, Seung-Gu;
Consider a cluster, so called a 'son cluster', whose mean and variance is composed of the means and variances of both clusters called as a 'father cluster' and a 'mother cluster'. In this paper, a method for identifying each of three clusters is provided by modeling the relationship with father and mother clusters. Under the normal mixture model, the parameters are estimated via EM algorithm. We were able to overcome the problems of estimation using ECM approximation. Numerical examples show that our method can effectively identify the three clusters, so called a 'family of clusters'.
Composite cluster;strength of derivation;family of clusters;normal mixture model;EM algorithm;
김승구 (2007). Normal mixture model with general linear regressive restriction: Applied to Microarray Gene Clustering, <한국통계학회논문집>, 14, 205-213.
Dempster, A. P., Laird, N. M. and Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm (with discussion), Journal of the Royal Statistical Society B, 39, 1-38.
Green, P. J. (1990). On use of the EM algorithm for penalized likelihood estimation, Journal of Royal Statistical Society B, 52, 443-452.
Levin, A., Lischinski, D. and Weiss, Y. (2008). A closed form solution to natural image matting, IEEE Transactions on Pattern Analysis and Machine Intelligence, 30, 228-242.
McLachlan, G. and Peel, D. (2000). Finite Mixture Models, John Wiley & Sons, Inc.
Meng, X.-L. and Rubin, D. (1993). Maximum likelihood estimation via the ECM algorithm: A general framework, Biometrika, 80, 267-278.
Ng, S. K., McLachlan, G. J., Wang, K., Ben-Tovim, L. and Ng, S. W. (2006). A Mixture model with random-effects components for clustering correlated gene-expression profiles, Bioinformatics, 22, 1745-1752.
Schwarz, G. (1978). Estimating the dimension of a model, Annals of Statistics, 6, 461-464.
Titterington, D. M., Smith, A. F. and Makov, U. E. (1994). Statistical Analysis of Finite Mixture Distributions, John Wiely & Sons.
Wang, J. and Cohen, M. F. (2005). An interactive optimization approach for unified image segmentation and matting, ICCV 2005, 936-943.
Wang, S. and Zhu, J. (2008). Variable selection for model-based high-dimensional clustering and its application to Microarray data, Bioinformatics, 64, 440-448.