This study describes the development of a quantitative structure-property relationship (QSPR) to predict the retention index (I) of volatile and semivolatile compounds identified in Arabica coffee samples from different geographical origin. The analytical method utilized rapid headspace solid-phase microextraction (HSSPME)-gas chromatography-time-of-flight mass spectrometry (GC-TOFMS) data measured in the divinylbenzene/carboxen/polydimethylsiloxane (DVB/CAR/PDMS) fiber. A total of 102 molecules were optimized with the PM6/ZDO level of theory using Gaussian 09 Rev D.01, in order to calculate 3006 molecular descriptors in the alvaDesc software. Initially, the number of descriptors was reduced to 1237 by means of the V-WSP unsupervised variable reduction to be submitted to the model development. The ordinary least squares were coupled to the genetic algorithms supervised variable subset selection to find the best three descriptors for the QSPR model. For model validation, the dataset was split into a training set (70%) and a test set (30%). The quality of the model was evaluated by means of the coefficient of determination (R2) and the root mean square error (RMSE) for the training set (R2train=0.920 and RMSEtrain=71.8) and the test set (R2test=0.897 and RMSEtest=81.5). Other cross validation criteria were used such as leave-one-out (R2loo=0.869 and RMSEloo=91.7), leave-many-out (R2lmo=0.876 and RMSElmo=93.1) and bootstrap (R2loo=0.863 and RMSEloo=94.2). The computational model accomplished the five principles defined by the OECD to be applicable as a tool for the identification of other volatile and semivolatile constituents in other coffee varieties.
Previous Article in event
Previous Article in session
Next Article in event
Next Article in session
QUANTITATIVE STRUCTURE-PROPERTY RELATIONSHIP FOR THE RETENTION INDEX OF VOLATILE AND SEMIVOLATILE COMPOUNDS OF COFFEE
Published:
14 November 2021
by MDPI
in The 25th International Electronic Conference on Synthetic Organic Chemistry
session Computational Chemistry
Abstract:
Keywords: Coffee; GC-TOFMS; Retention index; molecular descriptors, QSPR