Deep Ensemble Learning for Human Action Recognition in Still Images

Publication Date

2020-01-31

Country of Publication

Egypt

No. of Pages

Main Subjects

Philosophy

Abstract EN

Numerous human actions such as “Phoning,” “PlayingGuitar,” and “RidingHorse” can be inferred by static cue-based approaches even if their motions in video are available considering one single still image may already sufficiently explain a particular action.

In this research, we investigate human action recognition in still images and utilize deep ensemble learning to automatically decompose the body pose and perceive its background information.

Firstly, we construct an end-to-end NCNN-based model by attaching the nonsequential convolutional neural network (NCNN) module to the top of the pretrained model.

The nonsequential network topology of NCNN can separately learn the spatial- and channel-wise features with parallel branches, which helps improve the model performance.

Subsequently, in order to further exploit the advantage of the nonsequential topology, we propose an end-to-end deep ensemble learning based on the weight optimization (DELWO) model.

It contributes to fusing the deep information derived from multiple models automatically from the data.

Finally, we design the deep ensemble learning based on voting strategy (DELVS) model to pool together multiple deep models with weighted coefficients to obtain a better prediction.

More importantly, the model complexity can be reduced by lessening the number of trainable parameters, thereby effectively mitigating overfitting issues of the model in small datasets to some extent.

We conduct experiments in Li’s action dataset, uncropped and 1.5x cropped Willow action datasets, and the results have validated the effectiveness and robustness of our proposed models in terms of mitigating overfitting issues in small datasets.

Finally, we open source our code for the model in GitHub (https://github.com/yxchspring/deep_ensemble_learning) in order to share our model with the community.

American Psychological Association (APA)

Yu, Xiangchun& Zhang, Zhe& Wu, Lei& Pang, Wei& Chen, Hechang& Yu, Zhezhou…[et al.]. 2020. Deep Ensemble Learning for Human Action Recognition in Still Images. Complexity،Vol. 2020, no. 2020, pp.1-23.
https://search.emarefa.net/detail/BIM-1145575

Modern Language Association (MLA)

Yu, Xiangchun…[et al.]. Deep Ensemble Learning for Human Action Recognition in Still Images. Complexity No. 2020 (2020), pp.1-23.
https://search.emarefa.net/detail/BIM-1145575

American Medical Association (AMA)

Yu, Xiangchun& Zhang, Zhe& Wu, Lei& Pang, Wei& Chen, Hechang& Yu, Zhezhou…[et al.]. Deep Ensemble Learning for Human Action Recognition in Still Images. Complexity. 2020. Vol. 2020, no. 2020, pp.1-23.
https://search.emarefa.net/detail/BIM-1145575

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1145575

SaveSaved Print

Arab Citation & Impact Factor "Arcif"

Largest Arabic Database of Citations Analysis for the Arabic Scholarly Journals Issued in Arab World.

e-Marefa Platform for Arabic Textbook.

"Kashif" for Checking Similarity or Plagiarism in the Arabic Researches. know more