TY - JOUR
T1 - Predicting soil compaction parameters in expansive soils using advanced machine learning models
T2 - a comparative study
AU - Almuaythir, Sultan
AU - Zaini, Muhammad Syamsul Imran
AU - Lodhi, Rida Hameed
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/12
Y1 - 2025/12
N2 - Expansive soils present substantial challenges for construction projects, particularly in arid regions where their swelling and shrinking behavior can result in structural damage. This research conducted an in-depth analysis of 195 soil samples, investigating six crucial factors, including clay content, liquid and plastic limit, specific gravity, plasticity index, and swell percentage, influencing soil compaction. The study used five computational models to predict optimal water content and maximum dry density. Among the models, XG-Boost exhibited exceptional performance with high predictive accuracy, scoring 0.941 and 0.912 for water content and density, respectively. Random Forest method also performed well, whereas Long Short-Term Memory Network (LSTMN) and k-Nearest Neighbors methods achieved moderate success. Detailed performance metrics, including correlation coefficients (r), mean absolute error (MAE), root mean squared error (RMSE), relative root mean squared error (RRMSE), and performance index (PI), were analyzed for each model across the training, testing, and validation datasets. XG-Boost consistently outperformed other models, demonstrating its robust and reliable predictive power. The analysis of absolute error highlighted the accuracy and consistency of XG-Boost and Random Forest in predicting the maximum dry density.
AB - Expansive soils present substantial challenges for construction projects, particularly in arid regions where their swelling and shrinking behavior can result in structural damage. This research conducted an in-depth analysis of 195 soil samples, investigating six crucial factors, including clay content, liquid and plastic limit, specific gravity, plasticity index, and swell percentage, influencing soil compaction. The study used five computational models to predict optimal water content and maximum dry density. Among the models, XG-Boost exhibited exceptional performance with high predictive accuracy, scoring 0.941 and 0.912 for water content and density, respectively. Random Forest method also performed well, whereas Long Short-Term Memory Network (LSTMN) and k-Nearest Neighbors methods achieved moderate success. Detailed performance metrics, including correlation coefficients (r), mean absolute error (MAE), root mean squared error (RMSE), relative root mean squared error (RRMSE), and performance index (PI), were analyzed for each model across the training, testing, and validation datasets. XG-Boost consistently outperformed other models, demonstrating its robust and reliable predictive power. The analysis of absolute error highlighted the accuracy and consistency of XG-Boost and Random Forest in predicting the maximum dry density.
KW - Compaction characteristics
KW - Expansive clay soil
KW - Machine learing approaches
KW - Prediction model
UR - http://www.scopus.com/inward/record.url?scp=105010092995&partnerID=8YFLogxK
U2 - 10.1038/s41598-025-09279-2
DO - 10.1038/s41598-025-09279-2
M3 - Article
C2 - 40617942
AN - SCOPUS:105010092995
SN - 2045-2322
VL - 15
JO - Scientific Reports
JF - Scientific Reports
IS - 1
M1 - 24018
ER -