HybridHR-Net: Action Recognition in Video Sequences Using Optimal Deep Learning Fusion Assisted Framework

Muhammad Naeem Akbar, Seemab Khan, Muhammad Umar Farooq, Majed Alhaisoni, Usman Tariq, Muhammad Usman Akram

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

The combination of spatiotemporal videos and essential features can improve the performance of human action recognition (HAR); however, the individual type of features usually degrades the performance due to similar actions and complex backgrounds. The deep convolutional neural network has improved performance in recent years for several computer vision applications due to its spatial information. This article proposes a new framework called for video surveillance human action recognition dubbed HybridHR-Net. On a few selected datasets, deep transfer learning is used to pre-trained the EfficientNet-b0 deep learning model. Bayesian optimization is employed for the tuning of hyperparameters of the fine-tuned deep model. Instead of fully connected layer features, we considered the average pooling layer features and performed two feature selection techniques-an improved artificial bee colony and an entropy-based approach. Using a serial nature technique, the features that were selected are combined into a single vector, and then the results are categorized by machine learning classifiers. Five publically accessible datasets have been utilized for the experimental approach and obtained notable accuracy of 97%, 98.7%, 100%, 99.7%, and 96.8%, respectively. Additionally, a comparison of the proposed framework with contemporary methods is done to demonstrate the increase in accuracy.

Original languageEnglish
Pages (from-to)3275-3295
Number of pages21
JournalComputers, Materials and Continua
Volume76
Issue number3
DOIs
StatePublished - 2023

Keywords

  • Action recognition
  • artificial bee colony
  • deep learning
  • entropy
  • feature fusion
  • transfer learning

Fingerprint

Dive into the research topics of 'HybridHR-Net: Action Recognition in Video Sequences Using Optimal Deep Learning Fusion Assisted Framework'. Together they form a unique fingerprint.

Cite this