Pyramidal attention with progressive multi-stage iterative feature refinement for salient object segmentation

Rahim Khan, Nada Alzaben, Yousef Ibrahim Daradkeh, Xianxun Zhu, Inam Ullah

Research output: Contribution to journalArticlepeer-review

Abstract

Accurate detection of salient objects in complex visual scenes remains a fundamental yet challenging task in visual intelligence, often impeded by significant scale variation, background clutter, and indistinct object boundaries. While recent approaches attempt to exploit multi-level features, they frequently encounter limitations such as semantic misalignment across feature hierarchies, spatial detail degradation, and weak cross-dataset generalization. To overcome these challenges, we propose a novel Pyramidal Attention Mechanism (PAM) with Progressive Multi-stage Iterative Feature Refinement Network (PIFRNet) designed for robust and precise Salient Object Detection (SOD). Specifically, our method begins by hierarchically aggregating features from four representative stages of a powerful backbone, ensuring rich multi-scale context and semantic diversity. To bridge semantic gaps and recover fine structures, we introduce a Progressive Bilateral Feature Refinement (PBFR) module, which enhances early-stage features through cascaded convolutions and spatial attention. Furthermore, the novel PAM, equipped with dilated convolutions, is introduced to refine high-level semantics and reinforce object completeness. The network integrates these components through a multi-stage iterative refinement process, enabling gradual enhancement of spatial precision and structural fidelity. Extensive experiments conducted on five public SOD benchmarks demonstrate that our approach achieves superior performance compared to state-of-the-art methods, both quantitatively and qualitatively. Cross-dataset evaluations further validate its strong generalization capability, making it highly applicable to real-world visual intelligence scenarios.

Original languageEnglish
Article number105670
JournalImage and Vision Computing
Volume162
DOIs
StatePublished - Oct 2025

Keywords

  • Bilateral merging
  • Hierarchical aggregation
  • Multi-scale representation
  • Pyramidal attention
  • Saliency detection
  • Visual intelligence

Fingerprint

Dive into the research topics of 'Pyramidal attention with progressive multi-stage iterative feature refinement for salient object segmentation'. Together they form a unique fingerprint.

Cite this