TY - JOUR
T1 - Multiscale attention-over-attention network for retinal disease recognition in OCT radiology images
AU - Alenezi, Abdulmajeed M.
AU - Aloqalaa, Daniyah A.
AU - Singh, Sushil Kumar
AU - Alrabiah, Raqinah
AU - Habib, Shabana
AU - Islam, Muhammad
AU - Daradkeh, Yousef Ibrahim
N1 - Publisher Copyright:
Copyright © 2024 Alenezi, Aloqalaa, Singh, Alrabiah, Habib, Islam and Daradkeh.
PY - 2024
Y1 - 2024
N2 - Retinal disease recognition using Optical Coherence Tomography (OCT) images plays a pivotal role in the early diagnosis and treatment of conditions. However, the previous attempts relied on extracting single-scale features often refined by stacked layered attentions. This paper presents a novel deep learning-based Multiscale Feature Enhancement via a Dual Attention Network specifically designed for retinal disease recognition in OCT images. Our approach leverages the EfficientNetB7 backbone to extract multiscale features from OCT images, ensuring a comprehensive representation of global and local retinal structures. To further refine feature extraction, we propose a Pyramidal Attention mechanism that integrates Multi-Head Self-Attention (MHSA) with Dense Atrous Spatial Pyramid Pooling (DASPP), effectively capturing long-range dependencies and contextual information at multiple scales. Additionally, Efficient Channel Attention (ECA) and Spatial Refinement modules are introduced to enhance channel-wise and spatial feature representations, enabling precise localization of retinal abnormalities. A comprehensive ablation study confirms the progressive impact of integrated blocks and attention mechanisms that enhance overall performance. Our findings underscore the potential of advanced attention mechanisms and multiscale processing, highlighting the effectiveness of the network. Extensive experiments on two benchmark datasets demonstrate the superiority of the proposed network over existing state-of-the-art methods.
AB - Retinal disease recognition using Optical Coherence Tomography (OCT) images plays a pivotal role in the early diagnosis and treatment of conditions. However, the previous attempts relied on extracting single-scale features often refined by stacked layered attentions. This paper presents a novel deep learning-based Multiscale Feature Enhancement via a Dual Attention Network specifically designed for retinal disease recognition in OCT images. Our approach leverages the EfficientNetB7 backbone to extract multiscale features from OCT images, ensuring a comprehensive representation of global and local retinal structures. To further refine feature extraction, we propose a Pyramidal Attention mechanism that integrates Multi-Head Self-Attention (MHSA) with Dense Atrous Spatial Pyramid Pooling (DASPP), effectively capturing long-range dependencies and contextual information at multiple scales. Additionally, Efficient Channel Attention (ECA) and Spatial Refinement modules are introduced to enhance channel-wise and spatial feature representations, enabling precise localization of retinal abnormalities. A comprehensive ablation study confirms the progressive impact of integrated blocks and attention mechanisms that enhance overall performance. Our findings underscore the potential of advanced attention mechanisms and multiscale processing, highlighting the effectiveness of the network. Extensive experiments on two benchmark datasets demonstrate the superiority of the proposed network over existing state-of-the-art methods.
KW - OCT imaging
KW - attention mechanism
KW - deep learning
KW - medical imaging
KW - multi-level features
KW - retinal recognition
UR - http://www.scopus.com/inward/record.url?scp=85210077858&partnerID=8YFLogxK
U2 - 10.3389/fmed.2024.1499393
DO - 10.3389/fmed.2024.1499393
M3 - Article
AN - SCOPUS:85210077858
SN - 2296-858X
VL - 11
JO - Frontiers in Medicine
JF - Frontiers in Medicine
M1 - 1499393
ER -