2021
DOI: 10.1007/s11633-020-1258-8
|View full text |Cite
|
Sign up to set email alerts
|

A Comprehensive Review of Group Activity Recognition in Videos

Abstract: Human group activity recognition (GAR) has attracted significant attention from computer vision researchers due to its wide practical applications in security surveillance, social role understanding and sports video analysis. In this paper, we give a comprehensive overview of the advances in group activity recognition in videos during the past 20 years. First, we provide a summary and comparison of 11 GAR video datasets in this field. Second, we survey the group activity recognition methods, including those ba… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
17
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 29 publications
(17 citation statements)
references
References 88 publications
0
17
0
Order By: Relevance
“…Early work on GAR relies on handcrafted features [13,15,17,18,31,53,66]; yet notable progress has been made in recent years by deep-learning (DL) based approaches [22,41]. We review DL-based methods and refer readers to the comprehensive review of GAR presented in [97]. Early DLbased methods use Convolutional Neural Networks (CNNs) to extract features and then apply recurrent neural networks for temporal modeling [46,58,80,95].…”
Section: Group Activity Recognitionmentioning
confidence: 99%
See 4 more Smart Citations
“…Early work on GAR relies on handcrafted features [13,15,17,18,31,53,66]; yet notable progress has been made in recent years by deep-learning (DL) based approaches [22,41]. We review DL-based methods and refer readers to the comprehensive review of GAR presented in [97]. Early DLbased methods use Convolutional Neural Networks (CNNs) to extract features and then apply recurrent neural networks for temporal modeling [46,58,80,95].…”
Section: Group Activity Recognitionmentioning
confidence: 99%
“…Early DLbased methods use Convolutional Neural Networks (CNNs) to extract features and then apply recurrent neural networks for temporal modeling [46,58,80,95]. Since learning interperson interactions is essential for GAR [97], much of the research explores how to capture the actor relations [4,36,40,72,96]. Several works tackle this problem from a graphbased perspective [40,63,100,101] such as applying Graph Convolutional Networks (GCNs) [49,96].…”
Section: Group Activity Recognitionmentioning
confidence: 99%
See 3 more Smart Citations