TY - JOUR
T1 - Forecasting and classification of new cases of COVID 19 before vaccination using decision trees and Gaussian mixture model
AU - Hamdi, Monia
AU - Hilali-Jaghdam, Inès
AU - Elnaim, Bushra Elamin
AU - Elhag, Azhari A.
N1 - Publisher Copyright:
© 2022 THE AUTHORS
PY - 2023/1
Y1 - 2023/1
N2 - Regarding the pandemic taking place in the world from the spread of the Coronavirus pandemic and viral mutations, the need has arisen to analyze the epidemic data in terms of numbers of infected and deaths, different geographical regions, and the dynamics of the spread of the virus. In China, the total number of reported infections is 224,659 on June 11, 2022. In this paper, the Gaussian Mixture Model and the decision tree method were used to classify and predict new cases of the virus. Although we focus mainly on the Chinese case, the model is general and adapted to any context without loss of validity of the qualitative results. The Chi-Squared (χ2) Automatic Interaction Detection (CHAID) was applied in creating the decision tree structure, the data has been classified into five classes, according to the BIC criterion. The best mixture model is the E (Equal variance) with five components. The considered data sets of the world health organization (WHO) were used from January 5, 2020, to 12, November 2021. We provide numerical results based on the Chinese case.
AB - Regarding the pandemic taking place in the world from the spread of the Coronavirus pandemic and viral mutations, the need has arisen to analyze the epidemic data in terms of numbers of infected and deaths, different geographical regions, and the dynamics of the spread of the virus. In China, the total number of reported infections is 224,659 on June 11, 2022. In this paper, the Gaussian Mixture Model and the decision tree method were used to classify and predict new cases of the virus. Although we focus mainly on the Chinese case, the model is general and adapted to any context without loss of validity of the qualitative results. The Chi-Squared (χ2) Automatic Interaction Detection (CHAID) was applied in creating the decision tree structure, the data has been classified into five classes, according to the BIC criterion. The best mixture model is the E (Equal variance) with five components. The considered data sets of the world health organization (WHO) were used from January 5, 2020, to 12, November 2021. We provide numerical results based on the Chinese case.
KW - Chi-Squared Automatic Interaction Detection (CHAID)
KW - Decision Tree (DT)
KW - Gaussian Mixture Model (GMM)
KW - Machine Learning (ML)
UR - http://www.scopus.com/inward/record.url?scp=85135789529&partnerID=8YFLogxK
U2 - 10.1016/j.aej.2022.07.011
DO - 10.1016/j.aej.2022.07.011
M3 - Article
AN - SCOPUS:85135789529
SN - 1110-0168
VL - 62
SP - 327
EP - 333
JO - Alexandria Engineering Journal
JF - Alexandria Engineering Journal
ER -