Satellite-Based Object Detection With PSNR-Driven Image Enhancement and Structural Interpretability

Ahmad Almadhor; Nejib Ghazouani; Javed Mallick; Abdullah Alqahtani; Natalia Kryvinska; Abdullah Al Hejaili; Gabriel Avelino Sampedro; Thippa Reddy Gadekallu

doi:10.1109/JSTARS.2025.3593849

Satellite-Based Object Detection With PSNR-Driven Image Enhancement and Structural Interpretability

Ahmad Almadhor
, Nejib Ghazouani
, Javed Mallick
, Abdullah Alqahtani
, Natalia Kryvinska
, Abdullah Al Hejaili
, Gabriel Avelino Sampedro
, Thippa Reddy Gadekallu

Computer Sciences

Research output: Contribution to journal › Article › peer-review

Abstract

Satellite image analysis faces persistent challenges in real-time processing, classification across heterogeneous terrains, and limited model interpretability. Ensuring input image quality before inference remains a critical bottleneck in operational remote sensing. To address these issues, we propose a unified deep learning framework that combines quality-assured preprocessing, efficient training, and interpretable multilabel classification for high-resolution imagery. Central to the framework is the swin vision transformer, integrated within a GPU-accelerated pipeline featuring Albumentations-based augmentation and enhancement techniques (distortion correction, histogram equalization, and denoising), validated by structural similarity index measure (0.9564) and peak signal-to-noise ratio (30.11 dB). Confidence-based prediction filtering improves reliability, while specialized modules for land use and land cover and coastal region analysis enhance semantic understanding. Interpretability is achieved using shapley additive explanations, local interpretable model-agnostic explanations, and occlusion maps to visualize spatial attention. On a curated nine-class subset of the MLRS-Net dataset, the model achieves 99.74% validation accuracy with up to 99.9% confidence in real-time inference. The proposed framework delivers a scalable, robust, and explainable solution for real-time satellite monitoring applications.

Original language	English
Pages (from-to)	21228-21238
Number of pages	11
Journal	IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Volume	18
DOIs	https://doi.org/10.1109/JSTARS.2025.3593849
State	Published - 2025

Keywords

Deep learning
multilabel classification
peak signal-to-noise ratio (PSNR)
real-time remote sensing
satellite imagery
structural similarity index measure (SSIM)
swin transformer

Access to Document

10.1109/JSTARS.2025.3593849

Cite this

Almadhor, A., Ghazouani, N., Mallick, J., Alqahtani, A., Kryvinska, N., Al Hejaili, A., Sampedro, G. A., & Gadekallu, T. R. (2025). Satellite-Based Object Detection With PSNR-Driven Image Enhancement and Structural Interpretability. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 18, 21228-21238. https://doi.org/10.1109/JSTARS.2025.3593849

@article{6c08a4bf9da1418ab468794d61b46a9e,

title = "Satellite-Based Object Detection With PSNR-Driven Image Enhancement and Structural Interpretability",

abstract = "Satellite image analysis faces persistent challenges in real-time processing, classification across heterogeneous terrains, and limited model interpretability. Ensuring input image quality before inference remains a critical bottleneck in operational remote sensing. To address these issues, we propose a unified deep learning framework that combines quality-assured preprocessing, efficient training, and interpretable multilabel classification for high-resolution imagery. Central to the framework is the swin vision transformer, integrated within a GPU-accelerated pipeline featuring Albumentations-based augmentation and enhancement techniques (distortion correction, histogram equalization, and denoising), validated by structural similarity index measure (0.9564) and peak signal-to-noise ratio (30.11 dB). Confidence-based prediction filtering improves reliability, while specialized modules for land use and land cover and coastal region analysis enhance semantic understanding. Interpretability is achieved using shapley additive explanations, local interpretable model-agnostic explanations, and occlusion maps to visualize spatial attention. On a curated nine-class subset of the MLRS-Net dataset, the model achieves 99.74\% validation accuracy with up to 99.9\% confidence in real-time inference. The proposed framework delivers a scalable, robust, and explainable solution for real-time satellite monitoring applications.",

keywords = "Deep learning, multilabel classification, peak signal-to-noise ratio (PSNR), real-time remote sensing, satellite imagery, structural similarity index measure (SSIM), swin transformer",

author = "Ahmad Almadhor and Nejib Ghazouani and Javed Mallick and Abdullah Alqahtani and Natalia Kryvinska and \{Al Hejaili\}, Abdullah and Sampedro, \{Gabriel Avelino\} and Gadekallu, \{Thippa Reddy\}",

note = "Publisher Copyright: {\textcopyright} IEEE. 2008-2012 IEEE.",

year = "2025",

doi = "10.1109/JSTARS.2025.3593849",

language = "English",

volume = "18",

pages = "21228--21238",

journal = "IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing",

issn = "1939-1404",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

Almadhor, A, Ghazouani, N, Mallick, J, Alqahtani, A, Kryvinska, N, Al Hejaili, A, Sampedro, GA & Gadekallu, TR 2025, 'Satellite-Based Object Detection With PSNR-Driven Image Enhancement and Structural Interpretability', IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol. 18, pp. 21228-21238. https://doi.org/10.1109/JSTARS.2025.3593849

TY - JOUR

T1 - Satellite-Based Object Detection With PSNR-Driven Image Enhancement and Structural Interpretability

AU - Almadhor, Ahmad

AU - Ghazouani, Nejib

AU - Mallick, Javed

AU - Alqahtani, Abdullah

AU - Kryvinska, Natalia

AU - Al Hejaili, Abdullah

AU - Sampedro, Gabriel Avelino

AU - Gadekallu, Thippa Reddy

N1 - Publisher Copyright: © IEEE. 2008-2012 IEEE.

PY - 2025

Y1 - 2025

N2 - Satellite image analysis faces persistent challenges in real-time processing, classification across heterogeneous terrains, and limited model interpretability. Ensuring input image quality before inference remains a critical bottleneck in operational remote sensing. To address these issues, we propose a unified deep learning framework that combines quality-assured preprocessing, efficient training, and interpretable multilabel classification for high-resolution imagery. Central to the framework is the swin vision transformer, integrated within a GPU-accelerated pipeline featuring Albumentations-based augmentation and enhancement techniques (distortion correction, histogram equalization, and denoising), validated by structural similarity index measure (0.9564) and peak signal-to-noise ratio (30.11 dB). Confidence-based prediction filtering improves reliability, while specialized modules for land use and land cover and coastal region analysis enhance semantic understanding. Interpretability is achieved using shapley additive explanations, local interpretable model-agnostic explanations, and occlusion maps to visualize spatial attention. On a curated nine-class subset of the MLRS-Net dataset, the model achieves 99.74% validation accuracy with up to 99.9% confidence in real-time inference. The proposed framework delivers a scalable, robust, and explainable solution for real-time satellite monitoring applications.

AB - Satellite image analysis faces persistent challenges in real-time processing, classification across heterogeneous terrains, and limited model interpretability. Ensuring input image quality before inference remains a critical bottleneck in operational remote sensing. To address these issues, we propose a unified deep learning framework that combines quality-assured preprocessing, efficient training, and interpretable multilabel classification for high-resolution imagery. Central to the framework is the swin vision transformer, integrated within a GPU-accelerated pipeline featuring Albumentations-based augmentation and enhancement techniques (distortion correction, histogram equalization, and denoising), validated by structural similarity index measure (0.9564) and peak signal-to-noise ratio (30.11 dB). Confidence-based prediction filtering improves reliability, while specialized modules for land use and land cover and coastal region analysis enhance semantic understanding. Interpretability is achieved using shapley additive explanations, local interpretable model-agnostic explanations, and occlusion maps to visualize spatial attention. On a curated nine-class subset of the MLRS-Net dataset, the model achieves 99.74% validation accuracy with up to 99.9% confidence in real-time inference. The proposed framework delivers a scalable, robust, and explainable solution for real-time satellite monitoring applications.

KW - Deep learning

KW - multilabel classification

KW - peak signal-to-noise ratio (PSNR)

KW - real-time remote sensing

KW - satellite imagery

KW - structural similarity index measure (SSIM)

KW - swin transformer

UR - https://www.scopus.com/pages/publications/105012268930

U2 - 10.1109/JSTARS.2025.3593849

DO - 10.1109/JSTARS.2025.3593849

M3 - Article

AN - SCOPUS:105012268930

SN - 1939-1404

VL - 18

SP - 21228

EP - 21238

JO - IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

JF - IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

ER -

Satellite-Based Object Detection With PSNR-Driven Image Enhancement and Structural Interpretability

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this