TY - JOUR
T1 - On the implementation of a new version of the Weibull distribution and machine learning approach to model the COVID-19 data
AU - Zhou, Yinghui
AU - Ahmad, Zubair
AU - Almaspoor, Zahra
AU - Khan, Faridoon
AU - Tag-Eldin, Elsayed
AU - Iqbal, Zahoor
AU - El-Morshedy, Mahmoud
N1 - Publisher Copyright:
© 2023 the Author(s)
PY - 2023
Y1 - 2023
N2 - Statistical methodologies have broader applications in almost every sector of life including education, hydrology, reliability, management, and healthcare sciences. Among these sectors, statistical modeling and predicting data in the healthcare sector is very crucial. In this paper, we introduce a new method, namely, a new extended exponential family to update the distributional flexibility of the existing models. Based on this approach, a new version of the Weibull model, namely, a new extended exponential Weibull model is introduced. The applicability of the new extended exponential Weibull model is shown by considering two data sets taken from the health sciences. The first data set represents the mortality rate of the patients infected by the coronavirus disease 2019 (COVID-19) in Mexico. Whereas, the second set represents the mortality rate of COVID-19 patients in Holland. Utilizing the same data sets, we carry out forecasting using three machine learning (ML) methods including support vector regression (SVR), random forest (RF), and neural network autoregression (NNAR). To assess their forecasting performances, two statistical accuracy measures, namely, root mean square error (RMSE) and mean absolute error (MAE) are considered. Based on our findings, it is observed that the RF algorithm is very effective in predicting the death rate of the COVID-19 data in Mexico. Whereas, for the second data, the SVR performs better as compared to the other methods.
AB - Statistical methodologies have broader applications in almost every sector of life including education, hydrology, reliability, management, and healthcare sciences. Among these sectors, statistical modeling and predicting data in the healthcare sector is very crucial. In this paper, we introduce a new method, namely, a new extended exponential family to update the distributional flexibility of the existing models. Based on this approach, a new version of the Weibull model, namely, a new extended exponential Weibull model is introduced. The applicability of the new extended exponential Weibull model is shown by considering two data sets taken from the health sciences. The first data set represents the mortality rate of the patients infected by the coronavirus disease 2019 (COVID-19) in Mexico. Whereas, the second set represents the mortality rate of COVID-19 patients in Holland. Utilizing the same data sets, we carry out forecasting using three machine learning (ML) methods including support vector regression (SVR), random forest (RF), and neural network autoregression (NNAR). To assess their forecasting performances, two statistical accuracy measures, namely, root mean square error (RMSE) and mean absolute error (MAE) are considered. Based on our findings, it is observed that the RF algorithm is very effective in predicting the death rate of the COVID-19 data in Mexico. Whereas, for the second data, the SVR performs better as compared to the other methods.
KW - family of distributions
KW - healthcare sector
KW - machine learning algorithms
KW - mathematical properties
KW - simulation
KW - statistical modeling
UR - http://www.scopus.com/inward/record.url?scp=85140890792&partnerID=8YFLogxK
U2 - 10.3934/mbe.2023016
DO - 10.3934/mbe.2023016
M3 - Article
C2 - 36650769
AN - SCOPUS:85140890792
SN - 1547-1063
VL - 20
SP - 337
EP - 364
JO - Mathematical Biosciences and Engineering
JF - Mathematical Biosciences and Engineering
IS - 1
ER -