Metabolic Reaction Networks (MRNs) are complex networks produced by thousands of chemical reactions or transformations (links) of metabolites (nodes) in a live organism. An essential goal of chemical biology is to test the connectivity (structure) of these complex MRNs models presented for new microorganisms with promising features. In theory, we can undertake hands-on testing (Manual Curation). However, due to the large number of possible combinations of node pairs, this is a difficult operation (possible metabolic reactions). We combined Combinatorial, Perturbation Theory, and Machine Learning approaches in this study to find a CPTML model for MRNs >40 organisms compiled by Barabasis' group. First, we used a novel type of node index termed Markov linear indices fk to quantify the local structure of a very large collection of nodes in each MRN. Next, for over 150 000 MRN query and reference node combinations, we computed CPT operators. Finally, we fed these CPT operators into several ML algorithms. The CPTML linear model obtained using the LDA algorithm is capable of distinguishing nodes (metabolites) with correct reaction assignment from nodes with incorrect reaction assignment with accuracy, specificity, and sensitivity values ranging from 85 to 100 % in both the training and external validation data series. Meanwhile, the top three non-linear models with more than 97.5 % accuracy were found to be PTML models based on Bayesian networks, J48-Decision Tree, and Random Forest algorithms. The new work sets the door for the investigation of MRNs from various organisms using PTML models. Finally, the new CPTML could be a useful tool for determining the structure of MRNs in new species in biotechnology.
Previous Article in event
Previous Article in congress
Next Article in event
Next Article in congress
Combinatorial Perturbation-Theory Machine Learning (CPTML) Models for Curation of Metabolic Reaction Networks
Published:
18 October 2021
by MDPI
in MOL2NET'21, Conference on Molecular, Biomed., Comput. & Network Science and Engineering, 7th ed.
congress BIOMODE.ECO-06: Biotech., Mol. Eng., Nat. Prod. Develop. and Ecology Congress, Paris, France-Ohio, USA, 2021.
Abstract:
Keywords: Barabasis' group; Machine learning; Markov linear indices; Metabolic Networks; Perturbation-Theory