2017
DOI: 10.1007/s00138-017-0825-7
|View full text |Cite
|
Sign up to set email alerts
|

Generating natural language tags for video information management

Abstract: This exploratory work is concerned with generation of natural language descriptions that can be used for video retrieval applications. It is a step ahead of keyword based tagging as it captures relations between keywords associated with videos. Firstly we prepare hand annotations consisting of descriptions for video segments crafted from a TREC Video dataset. Analysis of this data presents insights into human's interests on video contents. Secondly we develop a framework for creating smooth and coherent descri… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
1
1
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 38 publications
0
2
0
Order By: Relevance
“…The image description is generated by creating visual dependency representation of natural images in [38]. Natural language description generation is also done for video for their retrieval purpose in [39], in which they capture relations between keywords associated with videos. Evaluation of machine translation with human‐generated description is also necessary.…”
Section: Related Workmentioning
confidence: 99%
“…The image description is generated by creating visual dependency representation of natural images in [38]. Natural language description generation is also done for video for their retrieval purpose in [39], in which they capture relations between keywords associated with videos. Evaluation of machine translation with human‐generated description is also necessary.…”
Section: Related Workmentioning
confidence: 99%
“…Image description is generated by creating visual dependency representation of natural images in [25]. Natural language description generation is also done for video for their retrieval purpose in [26], in which they capture relations between keywords associated with videos. Evaluation of machine translation with human generated description is also necessary.…”
Section: B Image Description Generationmentioning
confidence: 99%