Hybrid features for skeleton-based action recognition based on network fusion.

Tools

Chen, Z., Pan, J., Yang, X. and Qin, H., 2020. Hybrid features for skeleton-based action recognition based on network fusion. Computer Animation and Virtual Worlds, 31 (4-5), e1952.

Full text available as:

[thumbnail of Hybrid+Features+for+Skeleton-based+Action+Recognition+based+on+Network+Fusion.pdf]

Preview

PDF
Hybrid+Features+for+Skeleton-based+Action+Recognition+based+on+Network+Fusion.pdf - Accepted Version
Available under License Creative Commons Attribution Non-commercial.
1MB

Copyright to original material in this document is with the original owner(s). Access to this content through BURO is granted on condition that you use it only for research, scholarly or other non-commercial purposes. If you wish to use it for any other purposes, you must contact BU via BURO@bournemouth.ac.uk.

Any third party copyright material in this document remains the property of its respective owner(s). BU grants no licence for further use of that third party material.

DOI: 10.1002/cav.1952

Abstract

© 2020 John Wiley & Sons, Ltd. In recent years, the topic of skeleton-based human action recognition has attracted significant attention from researchers and practitioners in graphics, vision, animation, and virtual environments. The most fundamental issue is how to learn an effective and accurate representation from spatiotemporal action sequences towards improved performance, and this article aims to address the aforementioned challenge. In particular, we design a novel method of hybrid features' extraction based on the construction of multistream networks and their organic fusion. First, we train a convolution neural networks (CNN) model to learn CNN-based features with the raw skeleton coordinates and their temporal differences serving as input signals. The attention mechanism is injected into the CNN model to weigh more effective and important information. Then, we employ long short-term memory (LSTM) to obtain long-term temporal features from action sequences. Finally, we generate the hybrid features by fusing the CNN and LSTM networks, and we classify action types with the hybrid features. The extensive experiments are performed on several large-scale publically available databases, and promising results demonstrate the efficacy and effectiveness of our proposed framework.

Item Type:	Article
ISSN:	1546-4261
Uncontrolled Keywords:	action recognition; CNN; human skeleton; hybrid features; LSTM; multistream neural network
Group:	Faculty of Media & Communication
ID Code:	34481
Deposited By:	Symplectic RT2
Deposited On:	02 Sep 2020 13:39
Last Modified:	14 Mar 2022 14:23

Downloads

Downloads per month over past year

More statistics for this item...

Repository Staff Only -