Machine learning-driven surface water quality prediction: an intuitive GUI solution for forecasting TDS and DO levels

Research output: Contribution to journalArticlepeer-review

Abstract

Industrialization and human activities significantly affect water quality, introducing pollutants into aquatic ecosystems. This study employs machine learning models, specifically two hybrid models (the random forest model and the gene expression model) and standalone models, to predict total dissolved solids (TDS) and dissolved oxygen (DO) levels from 11 inputs. Particle swarm optimization (PSO) was used to enhance model performance, with 423 samples analyzed (80% training, 20% testing). k-fold cross-validation assessed model reliability using metrics, such as R2, RMSE, and MAE. The PSO-RF model outperformed others in TDS prediction (R2 = 0.99, RMSE = 0.0001) and showed positive results for DO (R2 = 0.96, RMSE = 0.32). The PSO-GEP model also performed well (R2 = 0.99 for TDS and 0.95 for DO). Shapley analysis indicated high correlations between total solids and turbidity with TDS, and water temperature with DO. New predictive equations and a graphical user interface (GUI) were developed for practical applications.

Original languageEnglish
Pages (from-to)514-546
Number of pages33
JournalWater Quality Research Journal
Volume60
Issue number4
DOIs
StatePublished - Nov 2025

Keywords

  • hybrid wavelet models
  • k-fold cross-validation
  • machine learning algorithms
  • sensitivity analysis
  • surface water quality

Fingerprint

Dive into the research topics of 'Machine learning-driven surface water quality prediction: an intuitive GUI solution for forecasting TDS and DO levels'. Together they form a unique fingerprint.

Cite this