Human-Based Interaction Analysis via Automated Key Point Detection and Neural Network Model

Israr Akhtar; Naif Al Mudawi; Bayan Ibrahimm Alabdullah; Mohammed Alonazi; Jeongmin Park

doi:10.1109/ACCESS.2023.3314341

Human-Based Interaction Analysis via Automated Key Point Detection and Neural Network Model

Israr Akhtar
, Naif Al Mudawi
, Bayan Ibrahimm Alabdullah
, Mohammed Alonazi
, Jeongmin Park

Information Systems

Research output: Contribution to journal › Article › peer-review

4 Scopus citations

Abstract

The human interaction with an object is one of the most challenging domains in real-life applications, such as smart homes, surveillance, medical, education, safety-based application of computer vision, and artificial intelligence. In this research article, we have proposed a framework for human and object interaction in real-life examples such as sports and other activities. Initially, we reviewed video-based data by considering the three state-of-the-art data sets. Preprocessing steps have been followed to avoid extra costs, such as video-to-frame conversion, noise reduction and background subtraction. Human silhouette extraction has been performed via the Gaussian mixture model (GMM) and supper pixel model. Next, human body points and object location detection were performed. Finally, human and object-based features have been extracted. To minimize the features replication and to achieve optimized results, we have applied stochastic gradient descent and Restricted Boltzmann Machine; As a result, we have achieved an accuracy of 88.46%, 82.00%, and 88.30% on human body parts recognition over the MPII dataset, UCF_aerial dataset, and wild Dataset respectively. The classification accuracy for the MPII dataset is 92.71%, for the UCF_aerial dataset is 90.60%, and for sports video in the wild Dataset is 92.42%. We have achieved a high accuracy rate compared to other state-of-the-art methods and frameworks due to the complex feature extraction and optimization approach.

Original language	English
Pages (from-to)	100646-100658
Number of pages	13
Journal	IEEE Access
Volume	11
DOIs	https://doi.org/10.1109/ACCESS.2023.3314341
State	Published - 2023

Keywords

Features optimization
human body key points detection
human-object interaction analysis
restricted Boltzmann machine
skeletonization
stochastic gradient decent
trajectories

Access to Document

10.1109/ACCESS.2023.3314341

Cite this

@article{31ea5a72cddd482383ef19509b12728c,

title = "Human-Based Interaction Analysis via Automated Key Point Detection and Neural Network Model",

abstract = "The human interaction with an object is one of the most challenging domains in real-life applications, such as smart homes, surveillance, medical, education, safety-based application of computer vision, and artificial intelligence. In this research article, we have proposed a framework for human and object interaction in real-life examples such as sports and other activities. Initially, we reviewed video-based data by considering the three state-of-the-art data sets. Preprocessing steps have been followed to avoid extra costs, such as video-to-frame conversion, noise reduction and background subtraction. Human silhouette extraction has been performed via the Gaussian mixture model (GMM) and supper pixel model. Next, human body points and object location detection were performed. Finally, human and object-based features have been extracted. To minimize the features replication and to achieve optimized results, we have applied stochastic gradient descent and Restricted Boltzmann Machine; As a result, we have achieved an accuracy of 88.46\%, 82.00\%, and 88.30\% on human body parts recognition over the MPII dataset, UCF\_aerial dataset, and wild Dataset respectively. The classification accuracy for the MPII dataset is 92.71\%, for the UCF\_aerial dataset is 90.60\%, and for sports video in the wild Dataset is 92.42\%. We have achieved a high accuracy rate compared to other state-of-the-art methods and frameworks due to the complex feature extraction and optimization approach.",

keywords = "Features optimization, human body key points detection, human-object interaction analysis, restricted Boltzmann machine, skeletonization, stochastic gradient decent, trajectories",

author = "Israr Akhtar and Mudawi, \{Naif Al\} and Alabdullah, \{Bayan Ibrahimm\} and Mohammed Alonazi and Jeongmin Park",

note = "Publisher Copyright: {\textcopyright} 2013 IEEE.",

year = "2023",

doi = "10.1109/ACCESS.2023.3314341",

language = "English",

volume = "11",

pages = "100646--100658",

journal = "IEEE Access",

issn = "2169-3536",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Human-Based Interaction Analysis via Automated Key Point Detection and Neural Network Model

AU - Akhtar, Israr

AU - Mudawi, Naif Al

AU - Alabdullah, Bayan Ibrahimm

AU - Alonazi, Mohammed

AU - Park, Jeongmin

PY - 2023

Y1 - 2023

N2 - The human interaction with an object is one of the most challenging domains in real-life applications, such as smart homes, surveillance, medical, education, safety-based application of computer vision, and artificial intelligence. In this research article, we have proposed a framework for human and object interaction in real-life examples such as sports and other activities. Initially, we reviewed video-based data by considering the three state-of-the-art data sets. Preprocessing steps have been followed to avoid extra costs, such as video-to-frame conversion, noise reduction and background subtraction. Human silhouette extraction has been performed via the Gaussian mixture model (GMM) and supper pixel model. Next, human body points and object location detection were performed. Finally, human and object-based features have been extracted. To minimize the features replication and to achieve optimized results, we have applied stochastic gradient descent and Restricted Boltzmann Machine; As a result, we have achieved an accuracy of 88.46%, 82.00%, and 88.30% on human body parts recognition over the MPII dataset, UCF_aerial dataset, and wild Dataset respectively. The classification accuracy for the MPII dataset is 92.71%, for the UCF_aerial dataset is 90.60%, and for sports video in the wild Dataset is 92.42%. We have achieved a high accuracy rate compared to other state-of-the-art methods and frameworks due to the complex feature extraction and optimization approach.

AB - The human interaction with an object is one of the most challenging domains in real-life applications, such as smart homes, surveillance, medical, education, safety-based application of computer vision, and artificial intelligence. In this research article, we have proposed a framework for human and object interaction in real-life examples such as sports and other activities. Initially, we reviewed video-based data by considering the three state-of-the-art data sets. Preprocessing steps have been followed to avoid extra costs, such as video-to-frame conversion, noise reduction and background subtraction. Human silhouette extraction has been performed via the Gaussian mixture model (GMM) and supper pixel model. Next, human body points and object location detection were performed. Finally, human and object-based features have been extracted. To minimize the features replication and to achieve optimized results, we have applied stochastic gradient descent and Restricted Boltzmann Machine; As a result, we have achieved an accuracy of 88.46%, 82.00%, and 88.30% on human body parts recognition over the MPII dataset, UCF_aerial dataset, and wild Dataset respectively. The classification accuracy for the MPII dataset is 92.71%, for the UCF_aerial dataset is 90.60%, and for sports video in the wild Dataset is 92.42%. We have achieved a high accuracy rate compared to other state-of-the-art methods and frameworks due to the complex feature extraction and optimization approach.

KW - Features optimization

KW - human body key points detection

KW - human-object interaction analysis

KW - restricted Boltzmann machine

KW - skeletonization

KW - stochastic gradient decent

KW - trajectories

UR - https://www.scopus.com/pages/publications/85171537597

U2 - 10.1109/ACCESS.2023.3314341

DO - 10.1109/ACCESS.2023.3314341

M3 - Article

AN - SCOPUS:85171537597

SN - 2169-3536

VL - 11

SP - 100646

EP - 100658

JO - IEEE Access

JF - IEEE Access

ER -

Human-Based Interaction Analysis via Automated Key Point Detection and Neural Network Model

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this