Extraction of representative subsets by potential funtion method and genetics algoritms

Pizarro Millán, C.; Forina, M.; Casolino, C.; Leardi, R.

doi:10.1016/S0169-7439(97)00080-4

Extraction of representative subsets by potential funtion method and genetics algoritms

Pizarro Millán, C. ¹
Forina, M. ²
Casolino, C. ²
Leardi, R. ²

1 Universidad de La Rioja

Universidad de La Rioja

Logroño, España

ROR https://ror.org/0553yr311
2 University of Genoa

University of Genoa

Génova, Italia

ROR https://ror.org/0107c5v14

Journal:

Chemometrics and Intelligent Laboratory Systems

ISSN: 0169-7439

Year of publication: 1998

Volume: 40

Issue: 1

Pages: 33-52

Type: Article

beta Ver similares en nube de resultados

DOI: 10.1016/S0169-7439(97)00080-4 SCOPUS: 2-s2.0-0032076969 WoS: WOS:000074360500003 GOOGLE SCHOLAR

More publications in: Chemometrics and Intelligent Laboratory Systems

Institutional repository: Open access Editor

Abstract

Two procedures are suggested to select a representative subset from a large data set. The first is based on the use of the estimate of the multivariate probability density distribution by means of the potential functions technique. The first object selected for the subset is that for which the probability density is larger. Then, the distribution is corrected, by subtraction of the contribution of the selected object multiplied by a selection factor. The second procedure uses genetic algorithms to individuate the subset that reproduces the variance-covariance matrix with the minimum error. Both methods meet the requirement to obtain a representative subset, but the results obtained with the method based on potential functions are generally more satisfactory in the case when the original set is not a random sample from an infinite population, but is the finite population itself. Several examples show how the extraction of a representative subset from a large data set can give some advantages in the use of representation techniques (i.e., eigenvector projection, non-linear maps, Kohonen maps) and in class modelling techniques.

Extraction of representative subsets by potential funtion method and genetics algoritms

Universidad de La Rioja

University of Genoa

Abstract