Streamlined meteorological drought monitoring through fuzzy clustering and deep learning

  • Muhammad Ilyas
  • , Rizwan Niaz
  • , Luca Di Persio
  • , Mohammed A. Alshahrani
  • , Nafisa A. Albasheir
  • , Mohammed M.A. Almazah

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

This work presents a rigorous mathematical framework for monitoring meteorological drought by integrating advanced machine learning, feature selection, and clustering strategies. The central aim is to predict drought events based on a combination of meteorological indicators, using the Standardized Precipitation Index (SPI) as the target variable. A new Composite Drought Index (CDI) is introduced to encapsulate multivariate drought information, constructed from the Precipitation Concentration Index, Temperature Condition Index, Wind Speed Condition Index, and Soil Moisture Condition Index. The CDI demonstrates strong empirical consistency with the established drought index, SPI, based on four decades of data from thirty-two meteorological stations. To capture spatial heterogeneity, the study employs fuzzy clustering to group stations into meteorologically homogeneous classes. Within each cluster, the Boruta algorithm is used to isolate the most relevant features by assessing their relative importance, ensuring that only statistically informative variables contribute to model construction. Drought prediction is then performed using a suite of machine learning models, including Random Forest, Support Vector Regression, Extreme Gradient Boosting, and Deep Feedforward Neural Networks. A hybrid model combining deep neural networks with random forests achieves the best overall performance by extracting latent features through deep architectures and refining predictions via ensemble methods. This hybrid yields the lowest prediction errors, with Mean Absolute Error ranging from 0.1570 to 0.2664, Mean Squared Error between 0.0409 and 0.1093, and Root Mean Squared Error between 0.2022 and 0.3306. It also attains the highest Nash-Sutcliffe Efficiency, from 0.8973 to 0.9547, and Kling-Gupta Efficiency, from 0.7253 to 0.8807. The study’s main contributions include the formal definition of CDI as a multivariate index, the incorporation of fuzzy clustering to enhance spatial generalization, and the deployment of a deep-ensemble model to capture complex nonlinear and temporal dependencies in meteorological data. Empirical results demonstrate that CDI significantly outperforms univariate indices, and that the hybrid model provides better predictive performance than conventional deep learning approaches such as CNN and LSTM. The framework is adaptable for real-time drought monitoring and early warning systems, offering practical value for climate resilience in drought-prone regions.

Original languageEnglish
Article number389
JournalTheoretical and Applied Climatology
Volume156
Issue number7
DOIs
StatePublished - Jul 2025

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 11 - Sustainable Cities and Communities
    SDG 11 Sustainable Cities and Communities
  2. SDG 13 - Climate Action
    SDG 13 Climate Action

Fingerprint

Dive into the research topics of 'Streamlined meteorological drought monitoring through fuzzy clustering and deep learning'. Together they form a unique fingerprint.

Cite this