Mind the regularized GAP, for human action classification and semi-supervised localization based on visual saliency - Archive ouverte HAL Access content directly
Conference Papers Year : 2018

Mind the regularized GAP, for human action classification and semi-supervised localization based on visual saliency

(1, 2) , (1) , (3) , (2)
1
2
3

Abstract

This work addresses the issue of image classification and localization of human actions based on visual data acquired from RGB sensors. Our approach is inspired by the success of deep learning in image classification. In this paper, we describe our method and how the concept of Global Average Pooling (GAP) applies in the context of semi-supervised class localization. We benchmark it with respect to Class Activation Mapping initiated in (Zhou et al., 2016), propose a regularization over the GAP maps to enhance the results, and study whether a combination of these two ideas can result in a better classification accuracy. The models are trained and tested on the Stanford 40 Action dataset (Yao et al., 2011) describing people performing 40 different actions such as drinking, cooking or watching TV. Compared to the aforementioned baseline, our model improves the classification accuracy by 5.3 percent points, achieves a localization accuracy of 50.3%, and drastically diminishes the computation needed to retrieve the class saliency from the base convolutional model.
Fichier principal
Vignette du fichier
paper-final.pdf (5.87 Mo) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-01763103 , version 1 (10-04-2018)

Licence

Attribution - NonCommercial - NoDerivatives - CC BY 4.0

Identifiers

Cite

Marc Moreaux, Natalia Lyubova, Isabelle Ferrané, Frédéric Lerasle. Mind the regularized GAP, for human action classification and semi-supervised localization based on visual saliency. 13th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISAPP 208), Jan 2018, Funchal, Portugal. pp.307-314, ⟨10.5220/0006548303070314⟩. ⟨hal-01763103⟩
79 View
0 Download

Altmetric

Share

Gmail Facebook Twitter LinkedIn More