Proceedings of the 3rd ACM Conference on International Conference on Multimedia Retrieval 2013
DOI: 10.1145/2461466.2461504
|View full text |Cite
|
Sign up to set email alerts
|

Exploiting language models to recognize unseen actions

Abstract: This paper addresses the problem of human action recognition. Typically, visual action recognition systems need visual training examples for all actions that one wants to recognize. However, the total number of possible actions is staggering as not only are there many types of actions but also many possible objects for each action type. Normally, visual training examples are needed for all actions of this combinatorial explosion of possibilities. To address this problem, this paper is a first attempt to propos… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
11
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 21 publications
(11 citation statements)
references
References 22 publications
0
11
0
Order By: Relevance
“…Verbs Acts Images Sen Des PPMI (Yao and Fei-Fei, 2010) 2 24 4800 N N Stanford 40 Actions (Yao et al, 2011) 33 40 9532 N N PASCAL 2012(Everingham et al, 2015 9 11 4588 N N 89 Actions (Le et al, 2013) 36 89 2038 N N TUHOI (Le et al, 2014) -297410805 N N COCO-a (Ronchi and Perona, 2015 140 162 10000 N Y HICO (Chao et al, 2015) 111 600 47774 Y N VerSe (our dataset) 90 163 3518 Y Y…”
Section: Datasetmentioning
confidence: 91%
“…Verbs Acts Images Sen Des PPMI (Yao and Fei-Fei, 2010) 2 24 4800 N N Stanford 40 Actions (Yao et al, 2011) 33 40 9532 N N PASCAL 2012(Everingham et al, 2015 9 11 4588 N N 89 Actions (Le et al, 2013) 36 89 2038 N N TUHOI (Le et al, 2014) -297410805 N N COCO-a (Ronchi and Perona, 2015 140 162 10000 N Y HICO (Chao et al, 2015) 111 600 47774 Y N VerSe (our dataset) 90 163 3518 Y Y…”
Section: Datasetmentioning
confidence: 91%
“…Action images in sports (Gupta, Kembhavi, and Davis 2009;Li and Li 2007) are among the earliest datasets introduced for research. Daily activity datasets Le, Bernardi, and Uijlings 2013) contain common human activities in daily life. The latest version of Pascal VOC (Maji, Bourdev, and Malik 2011) competition includes ten categories of still image actions, with only a subset of people annotated (bounding box + action).…”
Section: Related Workmentioning
confidence: 99%
“…Earlier efforts such as Gupta et al [2009], Everingham et al [2010], Yao and Fei-Fei [2010], Yao et al [2011], Le et al [2013] used in-house annotators to label 6-89 human actions (such as "reading," "riding a bike," "playing guitar," or "holding a guitar").…”
Section: Actions and Interactions In Imagesmentioning
confidence: 99%