A Hybrid Duo-Deep Learning and Best Features Based Framework for Action Recognition

Muhammad Naeem Akbar; Farhan Riaz; Ahmed Bilal Awan; Muhammad Attique Khan; Usman Tariq; Saad Rehman

doi:10.32604/cmc.2022.028696

A Hybrid Duo-Deep Learning and Best Features Based Framework for Action Recognition

Muhammad Naeem Akbar
, Farhan Riaz
, Ahmed Bilal Awan
, Muhammad Attique Khan
, Usman Tariq
, Saad Rehman

Management Information Systems

Research output: Contribution to journal › Article › peer-review

12 Scopus citations

Abstract

Human Action Recognition (HAR) is a current research topic in the field of computer vision that is based on an important application known as video surveillance. Researchers in computer vision have introduced various intelligent methods based on deep learning and machine learning, but they still face many challenges such as similarity in various actions and redundant features. We proposed a framework for accurate human action recognition (HAR) based on deep learning and an improved features optimization algorithm in this paper. From deep learning feature extraction to feature classification, the proposed framework includes several critical steps. Before training fine-tuned deep learning models – MobileNet-V2 and Darknet53 – the original video frames are normalized. For feature extraction, pre-trained deep models are used, which are fused using the canonical correlation approach. Following that, an improved particle swarm optimization (IPSO)-based algorithm is used to select the best features. Following that, the selected features were used to classify actions using various classifiers. The experimental process was performed on six publicly available datasets such as KTH, UT-Interaction, UCF Sports, Hollywood, IXMAS, and UCF YouTube, which attained an accuracy of 98.3%, 98.9%, 99.8%, 99.6%, 98.6%, and 100%, respectively. In comparison with existing techniques, it is observed that the proposed framework achieved improved accuracy.

Original language	English
Pages (from-to)	2555-2576
Number of pages	22
Journal	Computers, Materials and Continua
Volume	73
Issue number	2
DOIs	https://doi.org/10.32604/cmc.2022.028696
State	Published - 2022

Keywords

Action recognition
deep learning
features fusion
features selection
recognition

Access to Document

10.32604/cmc.2022.028696

Cite this

@article{8e386dd0cc7240e6a20bc3ac82bfddee,

title = "A Hybrid Duo-Deep Learning and Best Features Based Framework for Action Recognition",

abstract = "Human Action Recognition (HAR) is a current research topic in the field of computer vision that is based on an important application known as video surveillance. Researchers in computer vision have introduced various intelligent methods based on deep learning and machine learning, but they still face many challenges such as similarity in various actions and redundant features. We proposed a framework for accurate human action recognition (HAR) based on deep learning and an improved features optimization algorithm in this paper. From deep learning feature extraction to feature classification, the proposed framework includes several critical steps. Before training fine-tuned deep learning models – MobileNet-V2 and Darknet53 – the original video frames are normalized. For feature extraction, pre-trained deep models are used, which are fused using the canonical correlation approach. Following that, an improved particle swarm optimization (IPSO)-based algorithm is used to select the best features. Following that, the selected features were used to classify actions using various classifiers. The experimental process was performed on six publicly available datasets such as KTH, UT-Interaction, UCF Sports, Hollywood, IXMAS, and UCF YouTube, which attained an accuracy of 98.3\%, 98.9\%, 99.8\%, 99.6\%, 98.6\%, and 100\%, respectively. In comparison with existing techniques, it is observed that the proposed framework achieved improved accuracy.",

keywords = "Action recognition, deep learning, features fusion, features selection, recognition",

author = "Akbar, \{Muhammad Naeem\} and Farhan Riaz and Awan, \{Ahmed Bilal\} and Khan, \{Muhammad Attique\} and Usman Tariq and Saad Rehman",

year = "2022",

doi = "10.32604/cmc.2022.028696",

language = "English",

volume = "73",

pages = "2555--2576",

journal = "Computers, Materials and Continua",

issn = "1546-2218",

publisher = "Tech Science Press",

number = "2",

}

TY - JOUR

T1 - A Hybrid Duo-Deep Learning and Best Features Based Framework for Action Recognition

AU - Akbar, Muhammad Naeem

AU - Riaz, Farhan

AU - Awan, Ahmed Bilal

AU - Khan, Muhammad Attique

AU - Tariq, Usman

AU - Rehman, Saad

PY - 2022

Y1 - 2022

N2 - Human Action Recognition (HAR) is a current research topic in the field of computer vision that is based on an important application known as video surveillance. Researchers in computer vision have introduced various intelligent methods based on deep learning and machine learning, but they still face many challenges such as similarity in various actions and redundant features. We proposed a framework for accurate human action recognition (HAR) based on deep learning and an improved features optimization algorithm in this paper. From deep learning feature extraction to feature classification, the proposed framework includes several critical steps. Before training fine-tuned deep learning models – MobileNet-V2 and Darknet53 – the original video frames are normalized. For feature extraction, pre-trained deep models are used, which are fused using the canonical correlation approach. Following that, an improved particle swarm optimization (IPSO)-based algorithm is used to select the best features. Following that, the selected features were used to classify actions using various classifiers. The experimental process was performed on six publicly available datasets such as KTH, UT-Interaction, UCF Sports, Hollywood, IXMAS, and UCF YouTube, which attained an accuracy of 98.3%, 98.9%, 99.8%, 99.6%, 98.6%, and 100%, respectively. In comparison with existing techniques, it is observed that the proposed framework achieved improved accuracy.

AB - Human Action Recognition (HAR) is a current research topic in the field of computer vision that is based on an important application known as video surveillance. Researchers in computer vision have introduced various intelligent methods based on deep learning and machine learning, but they still face many challenges such as similarity in various actions and redundant features. We proposed a framework for accurate human action recognition (HAR) based on deep learning and an improved features optimization algorithm in this paper. From deep learning feature extraction to feature classification, the proposed framework includes several critical steps. Before training fine-tuned deep learning models – MobileNet-V2 and Darknet53 – the original video frames are normalized. For feature extraction, pre-trained deep models are used, which are fused using the canonical correlation approach. Following that, an improved particle swarm optimization (IPSO)-based algorithm is used to select the best features. Following that, the selected features were used to classify actions using various classifiers. The experimental process was performed on six publicly available datasets such as KTH, UT-Interaction, UCF Sports, Hollywood, IXMAS, and UCF YouTube, which attained an accuracy of 98.3%, 98.9%, 99.8%, 99.6%, 98.6%, and 100%, respectively. In comparison with existing techniques, it is observed that the proposed framework achieved improved accuracy.

KW - Action recognition

KW - deep learning

KW - features fusion

KW - features selection

KW - recognition

UR - https://www.scopus.com/pages/publications/85132804054

U2 - 10.32604/cmc.2022.028696

DO - 10.32604/cmc.2022.028696

M3 - Article

AN - SCOPUS:85132804054

SN - 1546-2218

VL - 73

SP - 2555

EP - 2576

JO - Computers, Materials and Continua

JF - Computers, Materials and Continua

IS - 2

ER -

A Hybrid Duo-Deep Learning and Best Features Based Framework for Action Recognition

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this