Advanced SearchSearch Tips
Ordering Variables and Categories on the Mosaic Plot
facebook(new window)  Pirnt(new window) E-mail(new window) Excel Download
 Title & Authors
Ordering Variables and Categories on the Mosaic Plot
Lee, Moon-Joo; Huh, Myung-Hoe;
  PDF(new window)
Mosaic plots, proposed by Hartigan and Kleiner (1981, 1984), are very useful in visualizing categorical data. In mosaic plot, multi-way classified cell frequencies are represented by rectangles with proportional area. The plot is easy to understand while preserving the information contained in the data. Plot's appearance, however, does change substantially depending on the order of variables and the orders of categories with variable put into the plot. In this study, we propose the algorithms for ordering variables and categories of the categorical data to be explored via mosaic plots. We demonstrate our methods to three well-known datasets: Titanic, Housing and PreSex.
Mosaic plot;ordering variables;ordering categories;Cramer's V;Gamma coefficient;
 Cited by
Bickel, P. J., Hammel, E. A. and O'Connell, J. W. (1975). Sex bias in graduate admissions: Data from Berkeley, Science, 187, 398-403 crossref(new window)

Cramer, H. (1946). Mathematical Methods of Statistics, Princeton University Press, Princeton

Friendly, M. (1994). Mosaic displays for multi-way contingency tables, Journal of the American Statistical Association, 89, 190-200 crossref(new window)

Garson, G. D. (2008). Nominal association: Phi, contingency coefficient, Tschuprow's T, Cramer's V, lambda, uncertainty coefficient, Statnotes: Topics in Multivariate Analysis, Retrieved from 06/25/2008

Goodman, L. A. and Kruskal, W. H. (1979). Measures of Association for Cross Classifications, SpringerVerlag, New York

Greenacre, M. J. (1984). Theory and Applications of Correspondence Analysis, Academic Press, London

Hartigan, J. A. and Kleiner, B. (1981). Mosaics for contingency tables, In Computer Science and Statistics: Proceedings of the 13th Symposium on the Interface, (ed. by W.F. Eddy). New York: Springer-Verlag, 268-273

Hartigan, J. A. and Kleiner, B. (1984). A mosaic of television ratings, The American Statistician, 38, 32-35 crossref(new window)

Huh, M. Y. (2004). Line mosaic plot: Algorithm and implementation, COMPSTAT, 2004 Symposium, Physica-Verlag/Springer

Hurley, C. B. (2004). Clustering visualizations of multidimensional data, Journal of Computational & Graphical Statistics, 13, 788-806 crossref(new window)

Madsen, M. (1976). Statistical analysis of multiple contingency tables: Two examples, Scandinavian Journal of Statistics, 3, 97-106

Thomes, B. and Collard, J. (1979). Who Divorces?, Routledge and Kegan, London

van der Heijden, P. G. M. and de Leeuw, J. (1985). Correspondence analysis used complementary to log-linear analysis, Psychometrika, 50, 429-447 crossref(new window)