TY - JOUR
T1 - Optimal Deep Neural Network-Based Model for Answering Visual Medical Question
AU - Gasmi, Karim
AU - Ltaifa, Ibtihel Ben
AU - Lejeune, Gaël
AU - Alshammari, Hamoud
AU - Ammar, Lassaad Ben
AU - Mahmood, Mahmood A.
N1 - Publisher Copyright:
© 2021 Taylor & Francis Group, LLC.
PY - 2022
Y1 - 2022
N2 - Over the last few years, the amount of available information has increased exponentially in all professional fields, including the medical field. Modern-day patients have access to a wealth of medical information, whether it be from brochures, newspapers, television campaigns, or internet documents. To facilitate and accelerate the search for medical information, more precise systems have been implemented, such as visual question-and-answer systems. A visual question-and-answer system is designed to provide direct and precise answers to questions asked in natural language. In this context, we propose an optimal deep neural network model based on an adaptive optimization algorithm, which takes medical images and natural language questions as input, then provides precise answers as output. Our model starts by classifying medical questions following an embedding phase. We then use a deep learning model for visual and textual feature extraction and emergence. In this paper, we aim to maximize the accuracy rate and minimize the number of epochs in order to accelerate the process. This is a multi-objective optimization problem. The selection of deep learning model parameters, such as epoch number and batch size, is an essential step in improving the model, thus, we use an adaptive genetic algorithm to determine the optimal deep learning parameters. Finally, we propose a dense layer for answer retrieval. To evaluate our model, we used the ImageCLEF 2019 VQA data set. Our model outperforms existing visual question-and-answer systems and offers a significantly higher retrieval accuracy rate.
AB - Over the last few years, the amount of available information has increased exponentially in all professional fields, including the medical field. Modern-day patients have access to a wealth of medical information, whether it be from brochures, newspapers, television campaigns, or internet documents. To facilitate and accelerate the search for medical information, more precise systems have been implemented, such as visual question-and-answer systems. A visual question-and-answer system is designed to provide direct and precise answers to questions asked in natural language. In this context, we propose an optimal deep neural network model based on an adaptive optimization algorithm, which takes medical images and natural language questions as input, then provides precise answers as output. Our model starts by classifying medical questions following an embedding phase. We then use a deep learning model for visual and textual feature extraction and emergence. In this paper, we aim to maximize the accuracy rate and minimize the number of epochs in order to accelerate the process. This is a multi-objective optimization problem. The selection of deep learning model parameters, such as epoch number and batch size, is an essential step in improving the model, thus, we use an adaptive genetic algorithm to determine the optimal deep learning parameters. Finally, we propose a dense layer for answer retrieval. To evaluate our model, we used the ImageCLEF 2019 VQA data set. Our model outperforms existing visual question-and-answer systems and offers a significantly higher retrieval accuracy rate.
KW - Bi-LSTM
KW - deep learning
KW - EfficientNet
KW - genetic algorithm
KW - medical visual question answering
KW - optimization
UR - https://www.scopus.com/pages/publications/85122006090
U2 - 10.1080/01969722.2021.2018543
DO - 10.1080/01969722.2021.2018543
M3 - Article
AN - SCOPUS:85122006090
SN - 0196-9722
VL - 53
SP - 403
EP - 424
JO - Cybernetics and Systems
JF - Cybernetics and Systems
IS - 5
ER -