Abstract
Retrieving human actions from video databases is a paramount but challenging task in computer vision. In this work, we develop such a framework for robustly recognizing human actions in video sequences. The contribution of the paper is twofold. First a reliable neural model, the Multi-level Sigmoidal Neural Network (MSNN) as a classifier for the task of action recognition is presented. Second we unfold how the temporal shape variations can be accurately captured based on both temporal self-similarities and fuzzy log-polar histograms. When the method is evaluated on the popular KTH dataset, an average recognition rate of 94.3% is obtained. Such results have the potential to compare very favorably to those of other investigators published in the literature. Further the approach is amenable for real-time applications due to its low computational requirements.
| Original language | English |
|---|---|
| DOIs | |
| State | Published - 2010 |
| Externally published | Yes |
| Event | 2010 21st British Machine Vision Conference, BMVC 2010 - Aberystwyth, United Kingdom Duration: 31 Aug 2010 → 3 Sep 2010 |
Conference
| Conference | 2010 21st British Machine Vision Conference, BMVC 2010 |
|---|---|
| Country/Territory | United Kingdom |
| City | Aberystwyth |
| Period | 31/08/10 → 3/09/10 |
Fingerprint
Dive into the research topics of 'Toward robust action retrieval in video'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver