Learning Group Activity Features Through Person Attribute Prediction

1Toyota Technological Institute, Japan 2University of Hyogo, Japan
CVPR2024

Abstract

This paper proposes Group Activity Feature (GAF) learning in which features of multi-person activity are learned as a compact latent vector. Unlike prior work in which the manual annotation of group activities is required for supervised learning, our method learns the GAF through person attribute prediction without group activity annotations. By learning the whole network in an end-to-end manner so that the GAF is required for predicting the person attributes of people in a group, the GAF is trained as the features of multi-person activity. %By learning the person attributes of people in a group with the GAF in an end-to-end manner, the GAF is trained as the features of multi-person activity. %In the person attribute prediction, our GAF is learned to extract features of multi-person activity for facilitating person attribute prediction. As a person attribute, we propose to use a person's action class and appearance features because the former is easy to annotate due to its simpleness, and the latter requires no manual annotation. In addition, we introduce a location-guided attribute prediction to disentangle the complex GAF for extracting the features of each target person properly. Various experimental results validate that our method outperforms SOTA methods quantitatively and qualitatively on two public datasets. Visualization of our GAF also demonstrates that our method learns the GAF representing fined-grained group activity classes.

BibTeX

@InProceedings{Nakatani_2024_CVPR,
        author    = {Nakatani, Chihiro and Kawashima, Hiroaki and Ukita, Norimichi},
        title     = {Learning Group Activity Features Through Person Attribute Prediction},
        booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
        month     = {June},
        year      = {2024},
        }