TY - JOUR
T1 - Channel attention deep learning for renaissance art analysis
AU - Mzoughi, Olfa
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2025.
PY - 2025
Y1 - 2025
N2 - The integration of deep learning and computer vision technologies has revolutionized the study of paintings, offering new insights into historical and cultural contexts. However, accurately identifying objects in paintings remains a challenge due to the inherent complexities of various artistic styles. This research focuses on the Renaissance period, specifically examining the Early Renaissance, High Renaissance, Tenebrism, and Academicism styles, to compare human portrayal through the object detection of faces and hands. Our contributions include a detailed analysis of visual characteristics across these styles and the development of a channel-wise attention mechanism within deep learning object detection. Our findings reveal that integrating the Squeeze-and-Excitation (SE) block with a multi-scale model like RetinaNet significantly enhances the model’s robustness by addressing issues related to low color palette and color imbalance. The results demonstrate marked improvements in detection accuracy and adaptability within the complex context of Renaissance art, underscoring the effectiveness of the SE-enhanced RetinaNet model in advancing object detection in stylized images.
AB - The integration of deep learning and computer vision technologies has revolutionized the study of paintings, offering new insights into historical and cultural contexts. However, accurately identifying objects in paintings remains a challenge due to the inherent complexities of various artistic styles. This research focuses on the Renaissance period, specifically examining the Early Renaissance, High Renaissance, Tenebrism, and Academicism styles, to compare human portrayal through the object detection of faces and hands. Our contributions include a detailed analysis of visual characteristics across these styles and the development of a channel-wise attention mechanism within deep learning object detection. Our findings reveal that integrating the Squeeze-and-Excitation (SE) block with a multi-scale model like RetinaNet significantly enhances the model’s robustness by addressing issues related to low color palette and color imbalance. The results demonstrate marked improvements in detection accuracy and adaptability within the complex context of Renaissance art, underscoring the effectiveness of the SE-enhanced RetinaNet model in advancing object detection in stylized images.
KW - Artworks
KW - Attention mechanism
KW - Deep learning
KW - Face and hand detection
KW - Renaissance
UR - http://www.scopus.com/inward/record.url?scp=105009527575&partnerID=8YFLogxK
U2 - 10.1007/s11042-025-20999-5
DO - 10.1007/s11042-025-20999-5
M3 - Article
AN - SCOPUS:105009527575
SN - 1380-7501
JO - Multimedia Tools and Applications
JF - Multimedia Tools and Applications
ER -