Toward robust action retrieval in video

Samy Sadek; Ayoub Al-Hamadi; Bernd Michaelis; Usama Sayed

doi:10.5244/C.24.44

Toward robust action retrieval in video

Samy Sadek
, Ayoub Al-Hamadi
, Bernd Michaelis
, Usama Sayed

Research output: Contribution to conference › Paper › peer-review

15 Scopus citations

Abstract

Retrieving human actions from video databases is a paramount but challenging task in computer vision. In this work, we develop such a framework for robustly recognizing human actions in video sequences. The contribution of the paper is twofold. First a reliable neural model, the Multi-level Sigmoidal Neural Network (MSNN) as a classifier for the task of action recognition is presented. Second we unfold how the temporal shape variations can be accurately captured based on both temporal self-similarities and fuzzy log-polar histograms. When the method is evaluated on the popular KTH dataset, an average recognition rate of 94.3% is obtained. Such results have the potential to compare very favorably to those of other investigators published in the literature. Further the approach is amenable for real-time applications due to its low computational requirements.

Original language	English
DOIs	https://doi.org/10.5244/C.24.44
State	Published - 2010
Externally published	Yes
Event	2010 21st British Machine Vision Conference, BMVC 2010 - Aberystwyth, United Kingdom Duration: 31 Aug 2010 → 3 Sep 2010

Conference

Conference	2010 21st British Machine Vision Conference, BMVC 2010
Country/Territory	United Kingdom
City	Aberystwyth
Period	31/08/10 → 3/09/10

Access to Document

10.5244/C.24.44

Cite this

@conference{3111cdf15cb14344a3f7a64ec03951d6,

title = "Toward robust action retrieval in video",

abstract = "Retrieving human actions from video databases is a paramount but challenging task in computer vision. In this work, we develop such a framework for robustly recognizing human actions in video sequences. The contribution of the paper is twofold. First a reliable neural model, the Multi-level Sigmoidal Neural Network (MSNN) as a classifier for the task of action recognition is presented. Second we unfold how the temporal shape variations can be accurately captured based on both temporal self-similarities and fuzzy log-polar histograms. When the method is evaluated on the popular KTH dataset, an average recognition rate of 94.3\% is obtained. Such results have the potential to compare very favorably to those of other investigators published in the literature. Further the approach is amenable for real-time applications due to its low computational requirements.",

author = "Samy Sadek and Ayoub Al-Hamadi and Bernd Michaelis and Usama Sayed",

year = "2010",

doi = "10.5244/C.24.44",

language = "English",

note = "2010 21st British Machine Vision Conference, BMVC 2010 ; Conference date: 31-08-2010 Through 03-09-2010",

}

TY - CONF

T1 - Toward robust action retrieval in video

AU - Sadek, Samy

AU - Al-Hamadi, Ayoub

AU - Michaelis, Bernd

AU - Sayed, Usama

PY - 2010

Y1 - 2010

N2 - Retrieving human actions from video databases is a paramount but challenging task in computer vision. In this work, we develop such a framework for robustly recognizing human actions in video sequences. The contribution of the paper is twofold. First a reliable neural model, the Multi-level Sigmoidal Neural Network (MSNN) as a classifier for the task of action recognition is presented. Second we unfold how the temporal shape variations can be accurately captured based on both temporal self-similarities and fuzzy log-polar histograms. When the method is evaluated on the popular KTH dataset, an average recognition rate of 94.3% is obtained. Such results have the potential to compare very favorably to those of other investigators published in the literature. Further the approach is amenable for real-time applications due to its low computational requirements.

AB - Retrieving human actions from video databases is a paramount but challenging task in computer vision. In this work, we develop such a framework for robustly recognizing human actions in video sequences. The contribution of the paper is twofold. First a reliable neural model, the Multi-level Sigmoidal Neural Network (MSNN) as a classifier for the task of action recognition is presented. Second we unfold how the temporal shape variations can be accurately captured based on both temporal self-similarities and fuzzy log-polar histograms. When the method is evaluated on the popular KTH dataset, an average recognition rate of 94.3% is obtained. Such results have the potential to compare very favorably to those of other investigators published in the literature. Further the approach is amenable for real-time applications due to its low computational requirements.

UR - https://www.scopus.com/pages/publications/84898404037

U2 - 10.5244/C.24.44

DO - 10.5244/C.24.44

M3 - Paper

AN - SCOPUS:84898404037

T2 - 2010 21st British Machine Vision Conference, BMVC 2010

Y2 - 31 August 2010 through 3 September 2010

ER -

Toward robust action retrieval in video

Abstract

Conference

Access to Document

Other files and links

Fingerprint

Cite this