2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020
DOI: 10.1109/cvpr42600.2020.01329
|View full text |Cite
|
Sign up to set email alerts
|

Object Relational Graph With Teacher-Recommended Learning for Video Captioning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
99
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 226 publications
(99 citation statements)
references
References 37 publications
0
99
0
Order By: Relevance
“…Note that many of them were published after our initial submission of the present work in the end of May in 2019. Specifically, compared with ORG-TRL (Zhang et al, 2020 ), the previous state-of-the-art model on this dataset, BLEU-4, CIDEr, METEOR, and ROUGE-L are improved relatively by 14.9, 15.2, 7.1, and 4.2%, respectively. Our model has the highest overall score as defined in (16).…”
Section: Methodsmentioning
confidence: 90%
See 3 more Smart Citations
“…Note that many of them were published after our initial submission of the present work in the end of May in 2019. Specifically, compared with ORG-TRL (Zhang et al, 2020 ), the previous state-of-the-art model on this dataset, BLEU-4, CIDEr, METEOR, and ROUGE-L are improved relatively by 14.9, 15.2, 7.1, and 4.2%, respectively. Our model has the highest overall score as defined in (16).…”
Section: Methodsmentioning
confidence: 90%
“…Table 1 displays the performance of several models on YouTube2Text. We compare our model with existing methods, including LSTM-E (Pan et al, 2016 ), h-RNN (Yu et al, 2016 ), aLSTMs (Gao et al, 2017 ), SCN (Gan et al, 2017 ), MTVC (Pasunuru and Bansal, 2017a ), ECO (Zolfaghari et al, 2018 ), SibNet (Liu et al, 2018 ), POS (Wang et al, 2019a ), MARN (Pei et al, 2019 ), JSRL-VCT (Hou et al, 2019 ), GRU-EVE (Aafaq et al, 2019 ), STG-KD (Pan et al, 2020 ), SAAT (Zheng et al, 2020 ), and ORG-TRL (Zhang et al, 2020 ). Our method outperforms all the other methods on all the metrics by a large margin.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…Zhang et al [155] have proposed an Object Relational Graph with Teacher-Recommended Learning (ORG-TRL) system, which consists of an Object-Relational Graph (ORG) based encoder, which can capture more comprehensive interaction features to improve visual representation. In the data preprocessing phase, Inception-Resnet-v2 and C3D are used for visual and spatio-temporal feature extraction purposes.…”
Section: Feature Extractionmentioning
confidence: 99%