Please login first
In silico toxicity prediction of phenol derivatives with ISIDA descriptors using multiple linear regression and machine learning approach
* 1 , 2 , 3, 4, 5 , 6, 7
1  Universidad Regional Amazónica Ikiam, Parroquia Muyuna km 7 vía Alto Tena, 150150, Tena-Napo, Ecuador
2  Facultad de Ciencias de La Ingeniería, Universidad Técnica Estatal de Quevedo, Ecuador
3  Departamento de Química Física Aplicada, Facultad de Ciencias, Universidad Autónoma de Madrid, 28049 Madrid, España
4  Department of Dev. Mathematics, Houston Community College-West Loop Campus, Houston TX, 77081, USA.
5  Department of Mathematics, Lone Star College-CyFair Campus, Houston, TX, 77433, USA
6  Department of Systems and Computer Engineering, Carleton University, Ottawa, ON, Canada
7  Department of Coatings and Polymeric Materials, North Dakota State University, Fargo, ND, 58102, USA.

https://doi.org/10.3390/mol2net-06-09110 (registering DOI)
Abstract:

Phenolic compounds are considered as dangerous pollutants, which produces serious environmental problems by pollution of water streams because of their great water solubility and high toxicity. In this paper we present the modeling of inhibitory grown activity against Tetrahymena pyriformis with structural feature descriptors. Quantitative structure-toxicity relationship model for acute toxicity of phenol derivatives was performed using Multiple Linear Regression (MLR), Reduced Error Pruning Tree (REPTree), M5 Model Rules (M5R), Multilayer Perceptron (MLP), Instance-Based Learning algorithms using K nearest neighbor (IBk-ANN), Support vector machine (SVM), and Radial basis function network (RBF). The correlation coefficients (R2) of training sets and test sets were 0.88 and 0.86 for the best MLR model, 0.82 and 0.72 for the best machine learning model (SVM), respectively. Following to the obtained results, our proposed model may be useful to predict of toxicity and risk assessment of phenol derivatives compound.

Keywords: Phenolic compounds; acute toxicity; Tetrahymena pyriformis; structural feature descriptors; Support vector machine
Top