Abstract
Satellite image analysis faces persistent challenges in real-time processing, classification across heterogeneous terrains, and limited model interpretability. Ensuring input image quality before inference remains a critical bottleneck in operational remote sensing. To address these issues, we propose a unified deep learning framework that combines quality-assured preprocessing, efficient training, and interpretable multilabel classification for high-resolution imagery. Central to the framework is the swin vision transformer, integrated within a GPU-accelerated pipeline featuring Albumentations-based augmentation and enhancement techniques (distortion correction, histogram equalization, and denoising), validated by structural similarity index measure (0.9564) and peak signal-to-noise ratio (30.11 dB). Confidence-based prediction filtering improves reliability, while specialized modules for land use and land cover and coastal region analysis enhance semantic understanding. Interpretability is achieved using shapley additive explanations, local interpretable model-agnostic explanations, and occlusion maps to visualize spatial attention. On a curated nine-class subset of the MLRS-Net dataset, the model achieves 99.74% validation accuracy with up to 99.9% confidence in real-time inference. The proposed framework delivers a scalable, robust, and explainable solution for real-time satellite monitoring applications.
| Original language | English |
|---|---|
| Pages (from-to) | 21228-21238 |
| Number of pages | 11 |
| Journal | IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing |
| Volume | 18 |
| DOIs | |
| State | Published - 2025 |
Keywords
- Deep learning
- multilabel classification
- peak signal-to-noise ratio (PSNR)
- real-time remote sensing
- satellite imagery
- structural similarity index measure (SSIM)
- swin transformer
Fingerprint
Dive into the research topics of 'Satellite-Based Object Detection With PSNR-Driven Image Enhancement and Structural Interpretability'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver