GA-ACE: Alternating Conditional Expectations regression with selection of significant predictors by Genetic Algorithms.

  1. Esteban-Díez, I. 1
  2. González-Sáiz, J.-M. 1
  3. Pizarro, C. 1
  4. Forina, M. 2
  1. 1 Universidad de La Rioja
    info

    Universidad de La Rioja

    Logroño, España

    ROR https://ror.org/0553yr311

  2. 2 University of Genoa
    info

    University of Genoa

    Génova, Italia

    ROR https://ror.org/0107c5v14

Revista:
Analytica Chimica Acta

ISSN: 0003-2670

Año de publicación: 2006

Volumen: 555

Número: 1

Páginas: 96-106

Tipo: Artículo

DOI: 10.1016/J.ACA.2005.08.053 SCOPUS: 2-s2.0-28844487491 WoS: WOS:000234525500013 GOOGLE SCHOLAR

Otras publicaciones en: Analytica Chimica Acta

Resumen

The non-linear regression technique known as alternating conditional expectations (ACE) method is only applicable when the number of objects available for calibration is considerably greater than the number of considered predictors. Alternating conditional expectations regression with selection of significant predictors by genetic algorithms (GA-ACE), the non-linear regression technique presented here, is based on the ACE algorithm but introducing several modifications to resolve the applicability limitations of the original ACE method, thus facilitating the practical implementation of a very interesting calibration tool. In order to overcome the lack of reliability displayed by the original ACE algorithm when working on data sets characterized by a too large number of variables and prior to the development of the non-linear regression model, GA-ACE applies genetic algorithms as a variable selection technique to select a reduced subset of significant predictors able to accurately model and predict a considered variable response. Furthermore, GA-ACE actually provides two alternative application approaches, since it allows either the performance of prior data compression computing a number of principal components to be subsequently subjected to GA-selection, or working directly on original variables. In this study, GA-ACE was applied to two real calibration problems, with a very low observation/variable ratio (NIR data), and the results were compared with those obtained by several linear regression techniques usually employed. When using the GA-ACE non-linear method, notably improved regression models were developed for the two response variables modeled, with root mean square errors of the residuals in external prediction (RMSEP) equal to 11.51 and 6.03% for moisture and lipid contents of roasted coffee samples, respectively. The improvement achieved by applying the new non-linear method introduced is even more remarkable taking into account the results obtained with the best performance linear method (IPW-PLS) applied to predict the studied responses (14.61 and 7.74% RMSEP, respectively). © 2005 Elsevier B.V. All rights reserved.