Multiscale attention-over-attention network for retinal disease recognition in OCT radiology images

Abdulmajeed M. Alenezi; Daniyah A. Aloqalaa; Sushil Kumar Singh; Raqinah Alrabiah; Shabana Habib; Muhammad Islam; Yousef Ibrahim Daradkeh

doi:10.3389/fmed.2024.1499393

Multiscale attention-over-attention network for retinal disease recognition in OCT radiology images

Abdulmajeed M. Alenezi, Daniyah A. Aloqalaa, Sushil Kumar Singh, Raqinah Alrabiah, Shabana Habib, Muhammad Islam, Yousef Ibrahim Daradkeh

Computer Engineering

Research output: Contribution to journal › Article › peer-review

Abstract

Retinal disease recognition using Optical Coherence Tomography (OCT) images plays a pivotal role in the early diagnosis and treatment of conditions. However, the previous attempts relied on extracting single-scale features often refined by stacked layered attentions. This paper presents a novel deep learning-based Multiscale Feature Enhancement via a Dual Attention Network specifically designed for retinal disease recognition in OCT images. Our approach leverages the EfficientNetB7 backbone to extract multiscale features from OCT images, ensuring a comprehensive representation of global and local retinal structures. To further refine feature extraction, we propose a Pyramidal Attention mechanism that integrates Multi-Head Self-Attention (MHSA) with Dense Atrous Spatial Pyramid Pooling (DASPP), effectively capturing long-range dependencies and contextual information at multiple scales. Additionally, Efficient Channel Attention (ECA) and Spatial Refinement modules are introduced to enhance channel-wise and spatial feature representations, enabling precise localization of retinal abnormalities. A comprehensive ablation study confirms the progressive impact of integrated blocks and attention mechanisms that enhance overall performance. Our findings underscore the potential of advanced attention mechanisms and multiscale processing, highlighting the effectiveness of the network. Extensive experiments on two benchmark datasets demonstrate the superiority of the proposed network over existing state-of-the-art methods.

Original language	English
Article number	1499393
Journal	Frontiers in Medicine
Volume	11
DOIs	https://doi.org/10.3389/fmed.2024.1499393
State	Published - 2024

Keywords

OCT imaging
attention mechanism
deep learning
medical imaging
multi-level features
retinal recognition

Access to Document

10.3389/fmed.2024.1499393

Cite this

@article{9de0bcd91bff45a4b9c977fbf50865ee,

title = "Multiscale attention-over-attention network for retinal disease recognition in OCT radiology images",

abstract = "Retinal disease recognition using Optical Coherence Tomography (OCT) images plays a pivotal role in the early diagnosis and treatment of conditions. However, the previous attempts relied on extracting single-scale features often refined by stacked layered attentions. This paper presents a novel deep learning-based Multiscale Feature Enhancement via a Dual Attention Network specifically designed for retinal disease recognition in OCT images. Our approach leverages the EfficientNetB7 backbone to extract multiscale features from OCT images, ensuring a comprehensive representation of global and local retinal structures. To further refine feature extraction, we propose a Pyramidal Attention mechanism that integrates Multi-Head Self-Attention (MHSA) with Dense Atrous Spatial Pyramid Pooling (DASPP), effectively capturing long-range dependencies and contextual information at multiple scales. Additionally, Efficient Channel Attention (ECA) and Spatial Refinement modules are introduced to enhance channel-wise and spatial feature representations, enabling precise localization of retinal abnormalities. A comprehensive ablation study confirms the progressive impact of integrated blocks and attention mechanisms that enhance overall performance. Our findings underscore the potential of advanced attention mechanisms and multiscale processing, highlighting the effectiveness of the network. Extensive experiments on two benchmark datasets demonstrate the superiority of the proposed network over existing state-of-the-art methods.",

keywords = "OCT imaging, attention mechanism, deep learning, medical imaging, multi-level features, retinal recognition",

author = "Alenezi, \{Abdulmajeed M.\} and Aloqalaa, \{Daniyah A.\} and Singh, \{Sushil Kumar\} and Raqinah Alrabiah and Shabana Habib and Muhammad Islam and Daradkeh, \{Yousef Ibrahim\}",

note = "Publisher Copyright: Copyright {\textcopyright} 2024 Alenezi, Aloqalaa, Singh, Alrabiah, Habib, Islam and Daradkeh.",

year = "2024",

doi = "10.3389/fmed.2024.1499393",

language = "English",

volume = "11",

journal = "Frontiers in Medicine",

issn = "2296-858X",

publisher = "Frontiers Media SA",

}

TY - JOUR

T1 - Multiscale attention-over-attention network for retinal disease recognition in OCT radiology images

AU - Alenezi, Abdulmajeed M.

AU - Aloqalaa, Daniyah A.

AU - Singh, Sushil Kumar

AU - Alrabiah, Raqinah

AU - Habib, Shabana

AU - Islam, Muhammad

AU - Daradkeh, Yousef Ibrahim

PY - 2024

Y1 - 2024

N2 - Retinal disease recognition using Optical Coherence Tomography (OCT) images plays a pivotal role in the early diagnosis and treatment of conditions. However, the previous attempts relied on extracting single-scale features often refined by stacked layered attentions. This paper presents a novel deep learning-based Multiscale Feature Enhancement via a Dual Attention Network specifically designed for retinal disease recognition in OCT images. Our approach leverages the EfficientNetB7 backbone to extract multiscale features from OCT images, ensuring a comprehensive representation of global and local retinal structures. To further refine feature extraction, we propose a Pyramidal Attention mechanism that integrates Multi-Head Self-Attention (MHSA) with Dense Atrous Spatial Pyramid Pooling (DASPP), effectively capturing long-range dependencies and contextual information at multiple scales. Additionally, Efficient Channel Attention (ECA) and Spatial Refinement modules are introduced to enhance channel-wise and spatial feature representations, enabling precise localization of retinal abnormalities. A comprehensive ablation study confirms the progressive impact of integrated blocks and attention mechanisms that enhance overall performance. Our findings underscore the potential of advanced attention mechanisms and multiscale processing, highlighting the effectiveness of the network. Extensive experiments on two benchmark datasets demonstrate the superiority of the proposed network over existing state-of-the-art methods.

AB - Retinal disease recognition using Optical Coherence Tomography (OCT) images plays a pivotal role in the early diagnosis and treatment of conditions. However, the previous attempts relied on extracting single-scale features often refined by stacked layered attentions. This paper presents a novel deep learning-based Multiscale Feature Enhancement via a Dual Attention Network specifically designed for retinal disease recognition in OCT images. Our approach leverages the EfficientNetB7 backbone to extract multiscale features from OCT images, ensuring a comprehensive representation of global and local retinal structures. To further refine feature extraction, we propose a Pyramidal Attention mechanism that integrates Multi-Head Self-Attention (MHSA) with Dense Atrous Spatial Pyramid Pooling (DASPP), effectively capturing long-range dependencies and contextual information at multiple scales. Additionally, Efficient Channel Attention (ECA) and Spatial Refinement modules are introduced to enhance channel-wise and spatial feature representations, enabling precise localization of retinal abnormalities. A comprehensive ablation study confirms the progressive impact of integrated blocks and attention mechanisms that enhance overall performance. Our findings underscore the potential of advanced attention mechanisms and multiscale processing, highlighting the effectiveness of the network. Extensive experiments on two benchmark datasets demonstrate the superiority of the proposed network over existing state-of-the-art methods.

KW - OCT imaging

KW - attention mechanism

KW - deep learning

KW - medical imaging

KW - multi-level features

KW - retinal recognition

UR - http://www.scopus.com/inward/record.url?scp=85210077858&partnerID=8YFLogxK

U2 - 10.3389/fmed.2024.1499393

DO - 10.3389/fmed.2024.1499393

M3 - Article

AN - SCOPUS:85210077858

SN - 2296-858X

VL - 11

JO - Frontiers in Medicine

JF - Frontiers in Medicine

M1 - 1499393

ER -

Multiscale attention-over-attention network for retinal disease recognition in OCT radiology images

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this