TY - JOUR
T1 - Comparison between Regression Models, Support Vector Machine (SVM), and Artificial Neural Network (ANN) in River Water Quality Prediction
AU - Najwa Mohd Rizal, Nur
AU - Hayder, Gasim
AU - Mnzool, Mohammed
AU - Elnaim, Bushra M.E.
AU - Mohammed, Adil Omer Yousif
AU - Khayyat, Manal M.
N1 - Publisher Copyright:
© 2022 by the authors.
PY - 2022/8
Y1 - 2022/8
N2 - Both anthropogenic and natural sources of pollution are regionally significant. Therefore, in order to monitor and protect the quality of Langat River from deterioration, we use Artificial Intelligence (AI) to model the river water quality. This study has applied several machine learning models (two support vector machines (SVMs), six regression models, and artificial neural network (ANN)) to predict total suspended solids (TSS), total solids (TS), and dissolved solids (DS)) in Langat River, Malaysia. All of the models have been assessed using root mean square error (RMSE), mean square error (MSE) as well as the determination of coefficient (R2). Based on the model performance metrics, the ANN model outperformed all models, while the GPR and SVM models exhibited the characteristic of over-fitting. The remaining machine learning models exhibited fair to poor performances. Although there are a few researches conducted to predict TDS using ANN, however, there are less to no research conducted to predict TS and TSS in Langat River. Therefore, this is the first study to evaluate the water quality (TSS, TS, and DS) of Langat River using the aforementioned models (especially SVM and the six regression models).
AB - Both anthropogenic and natural sources of pollution are regionally significant. Therefore, in order to monitor and protect the quality of Langat River from deterioration, we use Artificial Intelligence (AI) to model the river water quality. This study has applied several machine learning models (two support vector machines (SVMs), six regression models, and artificial neural network (ANN)) to predict total suspended solids (TSS), total solids (TS), and dissolved solids (DS)) in Langat River, Malaysia. All of the models have been assessed using root mean square error (RMSE), mean square error (MSE) as well as the determination of coefficient (R2). Based on the model performance metrics, the ANN model outperformed all models, while the GPR and SVM models exhibited the characteristic of over-fitting. The remaining machine learning models exhibited fair to poor performances. Although there are a few researches conducted to predict TDS using ANN, however, there are less to no research conducted to predict TS and TSS in Langat River. Therefore, this is the first study to evaluate the water quality (TSS, TS, and DS) of Langat River using the aforementioned models (especially SVM and the six regression models).
KW - ANN
KW - regression models
KW - river
KW - SVM
KW - water quality parameters
UR - http://www.scopus.com/inward/record.url?scp=85137570511&partnerID=8YFLogxK
U2 - 10.3390/pr10081652
DO - 10.3390/pr10081652
M3 - Article
AN - SCOPUS:85137570511
SN - 2227-9717
VL - 10
JO - Processes
JF - Processes
IS - 8
M1 - 1652
ER -