New index for clustering tendency.
- Forina, M. 2
- Lanteri, S. 2
- Esteban Díez, I. 1
-
1
Universidad de La Rioja
info
-
2
University of Genoa
info
ISSN: 0003-2670
Datum der Publikation: 2001
Ausgabe: 446
Nummer: 1-2
Seiten: 59-70
Art: Artikel
Andere Publikationen in: Analytica Chimica Acta
Zusammenfassung
A new index for clustering tendency is described. The index is based on the frequency distribution of the lengths of the edges in the minimum spanning tree connecting the objects, compared with the probability distribution of the lengths of edges of the minimum spanning tree connecting the same number of objects described by variables extracted from the uniform distribution. The here suggested index shows some advantages when compared with the Hopkins original index and with its modification suggested by Fernández Pierna and Massart. It can be used both to detect clusters, to measure the degree of non-uniformity of a data set (as required in many cases of multivariate calibration and QSAR studies), and to detect outliers. © 2001 Elsevier Science B.V. All rights reserved.