TY - JOUR
T1 - Optimizing HCV Disease Prediction in Egypt
T2 - The hyOPTGB Framework
AU - Elshewey, Ahmed M.
AU - Shams, Mahmoud Y.
AU - Tawfeek, Sayed M.
AU - Alharbi, Amal H.
AU - Ibrahim, Abdelhameed
AU - Abdelhamid, Abdelaziz A.
AU - Eid, Marwa M.
AU - Khodadadi, Nima
AU - Abualigah, Laith
AU - Khafaga, Doaa Sami
AU - Tarek, Zahraa
N1 - Publisher Copyright:
© 2023 by the authors.
PY - 2023/11
Y1 - 2023/11
N2 - The paper focuses on the hepatitis C virus (HCV) infection in Egypt, which has one of the highest rates of HCV in the world. The high prevalence is linked to several factors, including the use of injection drugs, poor sterilization practices in medical facilities, and low public awareness. This paper introduces a hyOPTGB model, which employs an optimized gradient boosting (GB) classifier to predict HCV disease in Egypt. The model’s accuracy is enhanced by optimizing hyperparameters with the OPTUNA framework. Min-Max normalization is used as a preprocessing step for scaling the dataset values and using the forward selection (FS) wrapped method to identify essential features. The dataset used in the study contains 1385 instances and 29 features and is available at the UCI machine learning repository. The authors compare the performance of five machine learning models, including decision tree (DT), support vector machine (SVM), dummy classifier (DC), ridge classifier (RC), and bagging classifier (BC), with the hyOPTGB model. The system’s efficacy is assessed using various metrics, including accuracy, recall, precision, and F1-score. The hyOPTGB model outperformed the other machine learning models, achieving a 95.3% accuracy rate. The authors also compared the hyOPTGB model against other models proposed by authors who used the same dataset.
AB - The paper focuses on the hepatitis C virus (HCV) infection in Egypt, which has one of the highest rates of HCV in the world. The high prevalence is linked to several factors, including the use of injection drugs, poor sterilization practices in medical facilities, and low public awareness. This paper introduces a hyOPTGB model, which employs an optimized gradient boosting (GB) classifier to predict HCV disease in Egypt. The model’s accuracy is enhanced by optimizing hyperparameters with the OPTUNA framework. Min-Max normalization is used as a preprocessing step for scaling the dataset values and using the forward selection (FS) wrapped method to identify essential features. The dataset used in the study contains 1385 instances and 29 features and is available at the UCI machine learning repository. The authors compare the performance of five machine learning models, including decision tree (DT), support vector machine (SVM), dummy classifier (DC), ridge classifier (RC), and bagging classifier (BC), with the hyOPTGB model. The system’s efficacy is assessed using various metrics, including accuracy, recall, precision, and F1-score. The hyOPTGB model outperformed the other machine learning models, achieving a 95.3% accuracy rate. The authors also compared the hyOPTGB model against other models proposed by authors who used the same dataset.
KW - gradient boosting (GB)
KW - hepatitis C virus (HCV)
KW - hyperparameters
KW - optimization
KW - OPTUNA
UR - http://www.scopus.com/inward/record.url?scp=85178346018&partnerID=8YFLogxK
U2 - 10.3390/diagnostics13223439
DO - 10.3390/diagnostics13223439
M3 - Article
AN - SCOPUS:85178346018
SN - 2075-4418
VL - 13
JO - Diagnostics
JF - Diagnostics
IS - 22
M1 - 3439
ER -