Mind the regularized GAP, for human action classification and semi-supervised localization based on visual saliency - LAAS - Laboratoire d'Analyse et d'Architecture des Systèmes Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

Mind the regularized GAP, for human action classification and semi-supervised localization based on visual saliency

Résumé

This work addresses the issue of image classification and localization of human actions based on visual data acquired from RGB sensors. Our approach is inspired by the success of deep learning in image classification. In this paper, we describe our method and how the concept of Global Average Pooling (GAP) applies in the context of semi-supervised class localization. We benchmark it with respect to Class Activation Mapping initiated in (Zhou et al., 2016), propose a regularization over the GAP maps to enhance the results, and study whether a combination of these two ideas can result in a better classification accuracy. The models are trained and tested on the Stanford 40 Action dataset (Yao et al., 2011) describing people performing 40 different actions such as drinking, cooking or watching TV. Compared to the aforementioned baseline, our model improves the classification accuracy by 5.3 percent points, achieves a localization accuracy of 50.3%, and drastically diminishes the computation needed to retrieve the class saliency from the base convolutional model.
Fichier principal
Vignette du fichier
paper-final.pdf (5.87 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01763103 , version 1 (10-04-2018)

Licence

Paternité - Pas d'utilisation commerciale - Pas de modification

Identifiants

Citer

Marc Moreaux, Natalia Lyubova, Isabelle Ferrané, Frédéric Lerasle. Mind the regularized GAP, for human action classification and semi-supervised localization based on visual saliency. 13th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISAPP 208), Jan 2018, Funchal, Portugal. pp.307-314, ⟨10.5220/0006548303070314⟩. ⟨hal-01763103⟩
106 Consultations
9 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More