Channel attention deep learning for renaissance art analysis

Olfa Mzoughi

doi:10.1007/s11042-025-20999-5

Channel attention deep learning for renaissance art analysis

Olfa Mzoughi

Computer Sciences

Research output: Contribution to journal › Article › peer-review

Abstract

The integration of deep learning and computer vision technologies has revolutionized the study of paintings, offering new insights into historical and cultural contexts. However, accurately identifying objects in paintings remains a challenge due to the inherent complexities of various artistic styles. This research focuses on the Renaissance period, specifically examining the Early Renaissance, High Renaissance, Tenebrism, and Academicism styles, to compare human portrayal through the object detection of faces and hands. Our contributions include a detailed analysis of visual characteristics across these styles and the development of a channel-wise attention mechanism within deep learning object detection. Our findings reveal that integrating the Squeeze-and-Excitation (SE) block with a multi-scale model like RetinaNet significantly enhances the model’s robustness by addressing issues related to low color palette and color imbalance. The results demonstrate marked improvements in detection accuracy and adaptability within the complex context of Renaissance art, underscoring the effectiveness of the SE-enhanced RetinaNet model in advancing object detection in stylized images.

Original language	English
Journal	Multimedia Tools and Applications
DOIs	https://doi.org/10.1007/s11042-025-20999-5
State	Accepted/In press - 2025

Keywords

Artworks
Attention mechanism
Deep learning
Face and hand detection
Renaissance

Access to Document

10.1007/s11042-025-20999-5

Cite this

@article{b1cb9addb7894d48ad64b7a06733cf8f,

title = "Channel attention deep learning for renaissance art analysis",

abstract = "The integration of deep learning and computer vision technologies has revolutionized the study of paintings, offering new insights into historical and cultural contexts. However, accurately identifying objects in paintings remains a challenge due to the inherent complexities of various artistic styles. This research focuses on the Renaissance period, specifically examining the Early Renaissance, High Renaissance, Tenebrism, and Academicism styles, to compare human portrayal through the object detection of faces and hands. Our contributions include a detailed analysis of visual characteristics across these styles and the development of a channel-wise attention mechanism within deep learning object detection. Our findings reveal that integrating the Squeeze-and-Excitation (SE) block with a multi-scale model like RetinaNet significantly enhances the model{\textquoteright}s robustness by addressing issues related to low color palette and color imbalance. The results demonstrate marked improvements in detection accuracy and adaptability within the complex context of Renaissance art, underscoring the effectiveness of the SE-enhanced RetinaNet model in advancing object detection in stylized images.",

keywords = "Artworks, Attention mechanism, Deep learning, Face and hand detection, Renaissance",

author = "Olfa Mzoughi",

note = "Publisher Copyright: {\textcopyright} The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2025.",

year = "2025",

doi = "10.1007/s11042-025-20999-5",

language = "English",

journal = "Multimedia Tools and Applications",

issn = "1380-7501",

publisher = "Springer",

}

TY - JOUR

T1 - Channel attention deep learning for renaissance art analysis

AU - Mzoughi, Olfa

N1 - Publisher Copyright: © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2025.

PY - 2025

Y1 - 2025

N2 - The integration of deep learning and computer vision technologies has revolutionized the study of paintings, offering new insights into historical and cultural contexts. However, accurately identifying objects in paintings remains a challenge due to the inherent complexities of various artistic styles. This research focuses on the Renaissance period, specifically examining the Early Renaissance, High Renaissance, Tenebrism, and Academicism styles, to compare human portrayal through the object detection of faces and hands. Our contributions include a detailed analysis of visual characteristics across these styles and the development of a channel-wise attention mechanism within deep learning object detection. Our findings reveal that integrating the Squeeze-and-Excitation (SE) block with a multi-scale model like RetinaNet significantly enhances the model’s robustness by addressing issues related to low color palette and color imbalance. The results demonstrate marked improvements in detection accuracy and adaptability within the complex context of Renaissance art, underscoring the effectiveness of the SE-enhanced RetinaNet model in advancing object detection in stylized images.

AB - The integration of deep learning and computer vision technologies has revolutionized the study of paintings, offering new insights into historical and cultural contexts. However, accurately identifying objects in paintings remains a challenge due to the inherent complexities of various artistic styles. This research focuses on the Renaissance period, specifically examining the Early Renaissance, High Renaissance, Tenebrism, and Academicism styles, to compare human portrayal through the object detection of faces and hands. Our contributions include a detailed analysis of visual characteristics across these styles and the development of a channel-wise attention mechanism within deep learning object detection. Our findings reveal that integrating the Squeeze-and-Excitation (SE) block with a multi-scale model like RetinaNet significantly enhances the model’s robustness by addressing issues related to low color palette and color imbalance. The results demonstrate marked improvements in detection accuracy and adaptability within the complex context of Renaissance art, underscoring the effectiveness of the SE-enhanced RetinaNet model in advancing object detection in stylized images.

KW - Artworks

KW - Attention mechanism

KW - Deep learning

KW - Face and hand detection

KW - Renaissance

UR - http://www.scopus.com/inward/record.url?scp=105009527575&partnerID=8YFLogxK

U2 - 10.1007/s11042-025-20999-5

DO - 10.1007/s11042-025-20999-5

M3 - Article

AN - SCOPUS:105009527575

SN - 1380-7501

JO - Multimedia Tools and Applications

JF - Multimedia Tools and Applications

ER -

Channel attention deep learning for renaissance art analysis

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this