Please login first
Machine Learning Approaches Reveal That the Number of Tests Doesn't Matter to the Prediction of Global Confirmed COVID-19 Cases
1  Professor of Applied Statistics and Data Science, Institute of Statistical Research and Training, University of Dhaka, Dhaka 1000, Bangladesh
Academic Editor: Wataru Takeuchi

Abstract:

The coronavirus disease 2019 (COVID-19) has developed into a global pandemic, affecting every nation and territory in the world. Machine learning-based approaches are useful when trying to understand the complexity behind the spread of the disease and how to contain its spread effectively. The unsupervised learning method could be useful to evaluate the shortcomings of health facilities in areas of increased infection as well as what strategies are necessary to prevent disease spread within or outside of the country. To contribute toward the well-being of society, this paper focuses on the implementation of machine learning techniques for identifying common prevailing public healthcare facilities and concerns related to COVID-19 as well as attitudes to infection prevention strategies held by people from different countries concerning the current pandemic situation. Regression tree, random forest, cluster analysis, and principal component machine learning techniques are used to analyze the global COVID-19 data of 133 countries obtained from the Worldometer website as of April 17, 2020. The analysis revealed that there are four major clusters among the countries. Eight countries have the highest cumulative infected cases and deaths, forming the first cluster. Seven countries, the United States, Spain, Italy, France, Germany, the United Kingdom, and Iran, play a vital role in explaining the 60% variation of the total variations by us of the first component characterized by all variables except for the rate variables. The remaining countries explain only 20% of the variation of the total variation by use of the second component characterized by only rate variables. Most strikingly, the analysis found that the variable number of tests by the country did not play a vital role in the prediction of the cumulative number of confirmed cases.

Keywords: Covid-19; Machine learning; Prediction
Comments on this paper



 
 
Top