The mRNA molecules expressed in cow’s milk are important molecular biomarkers for different physiological and pathological conditions in cattle. The prediction of the quantity that a specific mRNA type could be expressed in cow’s milk is a challenging theoretical task. The current study presents for the first time several different Machine Learning models to predict the mRNA expression using the mRNA secondary structure fragments. This unique methodology is based on a dataset of experimental mRNA expression data. Each mRNA molecule has a specific secondary structure represented as a string that can be used to read all the possible mRNA secondary structure fragments. This information is used as input for the Machine Learning methods from Weka software in order to obtain classification models that can predict low, medium and high expression of new mRNA types in the cow’s milk. The mRNA expression levels have been measured with High Throughput Screening techniques. The initial features included the counting of the mRNA secondary structure fragments for each expressed mRNA. The model features were transformed in frequencies and the expression levels were converted into low and high classes. In order to reduce the high number of possible features, a feature selection method has been applied. Thus, the best classification model was obtained with BayesNet method and is based on 24 features and 4067 cases. The model has the true positive rate for the low mRNA expression class of 0.78 (average true positive rate of 0.66). Further studies are needed improve the current results, using datasets with different feature sets and more advanced Machine Learning methods.
Prediction of mRNA expression in cow’s milk using mRNA secondary structures and Machine Learning classifiers
Published: 28 May 2017 by MDPI AG in Proceedings of MOL2NET 2017, International Conference on Multidisciplinary Sciences, 3rd edition in MOL2NET 2017, International Conference on Multidisciplinary Sciences, 3rd edition session Computational Sciences, Statistics, Artificial Intelligence, Complex Networks, Machine Learning, and Big Data
MDPI AG, 10.3390/mol2net-03-03907
Keywords: classification models, mRNA expression, secondary structure, machine learning methods, Mrna Molecules, Mrna Types