Robust hand gesture recognition using multiple shape-oriented visual cues

Samy Bakheet; Ayoub Al-Hamadi

doi:10.1186/s13640-021-00567-1

Robust hand gesture recognition using multiple shape-oriented visual cues

Samy Bakheet, Ayoub Al-Hamadi

Research output: Contribution to journal › Article › peer-review

19 Scopus citations

Abstract

Robust vision-based hand pose estimation is highly sought but still remains a challenging task, due to its inherent difficulty partially caused by self-occlusion among hand fingers. In this paper, an innovative framework for real-time static hand gesture recognition is introduced, based on an optimized shape representation build from multiple shape cues. The framework incorporates a specific module for hand pose estimation based on depth map data, where the hand silhouette is first extracted from the extremely detailed and accurate depth map captured by a time-of-flight (ToF) depth sensor. A hybrid multi-modal descriptor that integrates multiple affine-invariant boundary-based and region-based features is created from the hand silhouette to obtain a reliable and representative description of individual gestures. Finally, an ensemble of one-vs.-all support vector machines (SVMs) is independently trained on each of these learned feature representations to perform gesture classification. When evaluated on a publicly available dataset incorporating a relatively large and diverse collection of egocentric hand gestures, the approach yields encouraging results that agree very favorably with those reported in the literature, while maintaining real-time operation.

Original language	English
Article number	26
Journal	Eurasip Journal on Image and Video Processing
Volume	2021
Issue number	1
DOIs	https://doi.org/10.1186/s13640-021-00567-1
State	Published - Dec 2021
Externally published	Yes

Keywords

Fourier descriptor
Hand gesture recognition
Moments invariants
Shape oriented features
SVM

Access to Document

10.1186/s13640-021-00567-1

Cite this

@article{f7feb1d0a605412d8ec12b6ed395a376,

title = "Robust hand gesture recognition using multiple shape-oriented visual cues",

abstract = "Robust vision-based hand pose estimation is highly sought but still remains a challenging task, due to its inherent difficulty partially caused by self-occlusion among hand fingers. In this paper, an innovative framework for real-time static hand gesture recognition is introduced, based on an optimized shape representation build from multiple shape cues. The framework incorporates a specific module for hand pose estimation based on depth map data, where the hand silhouette is first extracted from the extremely detailed and accurate depth map captured by a time-of-flight (ToF) depth sensor. A hybrid multi-modal descriptor that integrates multiple affine-invariant boundary-based and region-based features is created from the hand silhouette to obtain a reliable and representative description of individual gestures. Finally, an ensemble of one-vs.-all support vector machines (SVMs) is independently trained on each of these learned feature representations to perform gesture classification. When evaluated on a publicly available dataset incorporating a relatively large and diverse collection of egocentric hand gestures, the approach yields encouraging results that agree very favorably with those reported in the literature, while maintaining real-time operation.",

keywords = "Fourier descriptor, Hand gesture recognition, Moments invariants, Shape oriented features, SVM",

author = "Samy Bakheet and Ayoub Al-Hamadi",

note = "Publisher Copyright: {\textcopyright} 2021, The Author(s).",

year = "2021",

month = dec,

doi = "10.1186/s13640-021-00567-1",

language = "English",

volume = "2021",

journal = "Eurasip Journal on Image and Video Processing",

issn = "1687-5176",

publisher = "Springer Publishing Company",

number = "1",

}

TY - JOUR

T1 - Robust hand gesture recognition using multiple shape-oriented visual cues

AU - Bakheet, Samy

AU - Al-Hamadi, Ayoub

PY - 2021/12

Y1 - 2021/12

N2 - Robust vision-based hand pose estimation is highly sought but still remains a challenging task, due to its inherent difficulty partially caused by self-occlusion among hand fingers. In this paper, an innovative framework for real-time static hand gesture recognition is introduced, based on an optimized shape representation build from multiple shape cues. The framework incorporates a specific module for hand pose estimation based on depth map data, where the hand silhouette is first extracted from the extremely detailed and accurate depth map captured by a time-of-flight (ToF) depth sensor. A hybrid multi-modal descriptor that integrates multiple affine-invariant boundary-based and region-based features is created from the hand silhouette to obtain a reliable and representative description of individual gestures. Finally, an ensemble of one-vs.-all support vector machines (SVMs) is independently trained on each of these learned feature representations to perform gesture classification. When evaluated on a publicly available dataset incorporating a relatively large and diverse collection of egocentric hand gestures, the approach yields encouraging results that agree very favorably with those reported in the literature, while maintaining real-time operation.

AB - Robust vision-based hand pose estimation is highly sought but still remains a challenging task, due to its inherent difficulty partially caused by self-occlusion among hand fingers. In this paper, an innovative framework for real-time static hand gesture recognition is introduced, based on an optimized shape representation build from multiple shape cues. The framework incorporates a specific module for hand pose estimation based on depth map data, where the hand silhouette is first extracted from the extremely detailed and accurate depth map captured by a time-of-flight (ToF) depth sensor. A hybrid multi-modal descriptor that integrates multiple affine-invariant boundary-based and region-based features is created from the hand silhouette to obtain a reliable and representative description of individual gestures. Finally, an ensemble of one-vs.-all support vector machines (SVMs) is independently trained on each of these learned feature representations to perform gesture classification. When evaluated on a publicly available dataset incorporating a relatively large and diverse collection of egocentric hand gestures, the approach yields encouraging results that agree very favorably with those reported in the literature, while maintaining real-time operation.

KW - Fourier descriptor

KW - Hand gesture recognition

KW - Moments invariants

KW - Shape oriented features

KW - SVM

UR - http://www.scopus.com/inward/record.url?scp=85111339530&partnerID=8YFLogxK

U2 - 10.1186/s13640-021-00567-1

DO - 10.1186/s13640-021-00567-1

M3 - Article

AN - SCOPUS:85111339530

SN - 1687-5176

VL - 2021

JO - Eurasip Journal on Image and Video Processing

JF - Eurasip Journal on Image and Video Processing

IS - 1

M1 - 26

ER -

Robust hand gesture recognition using multiple shape-oriented visual cues

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this