Optimal Deep Neural Network-Based Model for Answering Visual Medical Question

  • Karim Gasmi
  • , Ibtihel Ben Ltaifa
  • , Gaël Lejeune
  • , Hamoud Alshammari
  • , Lassaad Ben Ammar
  • , Mahmood A. Mahmood

Research output: Contribution to journalArticlepeer-review

24 Scopus citations

Abstract

Over the last few years, the amount of available information has increased exponentially in all professional fields, including the medical field. Modern-day patients have access to a wealth of medical information, whether it be from brochures, newspapers, television campaigns, or internet documents. To facilitate and accelerate the search for medical information, more precise systems have been implemented, such as visual question-and-answer systems. A visual question-and-answer system is designed to provide direct and precise answers to questions asked in natural language. In this context, we propose an optimal deep neural network model based on an adaptive optimization algorithm, which takes medical images and natural language questions as input, then provides precise answers as output. Our model starts by classifying medical questions following an embedding phase. We then use a deep learning model for visual and textual feature extraction and emergence. In this paper, we aim to maximize the accuracy rate and minimize the number of epochs in order to accelerate the process. This is a multi-objective optimization problem. The selection of deep learning model parameters, such as epoch number and batch size, is an essential step in improving the model, thus, we use an adaptive genetic algorithm to determine the optimal deep learning parameters. Finally, we propose a dense layer for answer retrieval. To evaluate our model, we used the ImageCLEF 2019 VQA data set. Our model outperforms existing visual question-and-answer systems and offers a significantly higher retrieval accuracy rate.

Original languageEnglish
Pages (from-to)403-424
Number of pages22
JournalCybernetics and Systems
Volume53
Issue number5
DOIs
StatePublished - 2022

Keywords

  • Bi-LSTM
  • deep learning
  • EfficientNet
  • genetic algorithm
  • medical visual question answering
  • optimization

Fingerprint

Dive into the research topics of 'Optimal Deep Neural Network-Based Model for Answering Visual Medical Question'. Together they form a unique fingerprint.

Cite this