Correspondence analysis and 2-way clustering
- Ciampi, Antonio 1
- González Marcos, Ana 2
- Castejón Limas, Manuel 2
- 1 McGill University, Montreal, Canada, Department of Epidemiology and Statistics, Canada
-
2
Universidad de León
info
ISSN: 1696-2281
Año de publicación: 2005
Volumen: 29
Número: 1
Páginas: 27-41
Tipo: Artículo
Otras publicaciones en: Sort: Statistics and Operations Research Transactions
Resumen
Correspondence analysis followed by clustering of both rows and columns of a data matrix is proposed as an approach to two-way clustering. The novelty of this contribution consists of: i) proposing a simple method for the selecting of the number of axes; ii) visualizing the data matrix as is done in micro-array analysis; iii) enhancing this representation by emphasizing those variables and those individuals which are 'well represented' in the subspace of the chosen axes. The approach is applied to a 'traditional' clustering problem: the classification of a group of psychiatric patients.
Referencias bibliográficas
- Bengio, Y., Vincent, P., Paiement, J-F., Delalleau, O., Ouimet, M., and Le Roux, N. (2003). ‘Spectral clustering and Kernel PCA are learning eigenfunctions’. Technical Report 1239, DeÌpartement d’Informatique et Recherche OpeÌrationnelle, UniversiteÌ de MontreÌal. http://www.iro.umontreal.ca/∼lisa/publications.html.
- Caussinus, H. & Ruiz-Gazen, A. (1995). Metrics for finding typical structures by means of principal component analysis. In Data Science and its Applications, Y. Escoufier & C. Hayashi (eds)., Tokyo: Academic Press, 177-192.
- Caussinus, H. & Ruiz-Gazen, A. (2003). Which structures do generalized principal component analysis display? The case of multiple correspondence analysis. To appear in Multiple Correspondence Analysis and Related Methods (eds. Michael Greenacre and JoÌrg Blasius), London: Chapman & Hall, 2006.
- Gordon, A.D. (1999). Classification, 2nd Edition, London: Chapman & Hall.
- Greenacre, M.J. (1984). Theory and Application of Correspondence Analysis, London: Academic Press.
- Greenacre, M. (1993). Correspondence Analysis in Practice. London: Academic Press.
- Greenacre, M. (2000). Correspondence analysis of a square symmetric matrix. Applied Statistics, 49, 297- 310.
- Ng, A. Y., Jordan, M. I., and Y. Weiss, Y. (2002). On spectral clustering: Analysis and an algorithm. In Advances in Neural Information Processing Systems (NIPS), T. Dietterich, S. Becker and Z. Ghahramani (eds.), volume 14. Cambridge MA: MIT Press.
- Tibshirani, R., Hastie T., Eisen, M., Ross, D., Botstein, D., and Brown, P. (1999). Clustering methods for the analysis of DNA microarray data. Technical report, Department of Statistics, Stanford University. http://www-stat.stanford.edu/∼tibs/lab/publications.html.