2020
DOI: 10.1109/tmm.2019.2924598
|View full text |Cite
|
Sign up to set email alerts
|

The Role of the Input in Natural Language Video Description

Abstract: Natural Language Video Description (NLVD) has recently received strong interest in the Computer Vision, Natural Language Processing (NLP), Multimedia, and Autonomous Robotics communities. The State-of-the-Art (SotA) approaches obtained remarkable results when tested on the benchmark datasets. However, those approaches poorly generalize to new datasets. In addition, none of the existing works focus on the processing of the input to the NLVD systems, which is both visual and textual. In this work, it is presente… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
0
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 70 publications
(102 reference statements)
0
0
0
Order By: Relevance
“…Due to the large amounts of data that are accessible on the Internet, Natural Language Processing (NLP) has also experienced a shift towards information extraction and production during the last decade. [29] Additionally, as personal computers have become more accessible to the general public, applications for natural language processing (NLP) have grown more widespread, which has stimulated additional study in the subject. [8] III.OVERVIEW OF NLP "Natural Language Processing (NLP) is an area of computing technology, specifically within the domain of artificial cognition (AI), that is devoted to providing computers with the capacity to grasp written text and spoken language in a manner that is comparable to how human beings do so.…”
Section: Introductionmentioning
confidence: 99%
“…Due to the large amounts of data that are accessible on the Internet, Natural Language Processing (NLP) has also experienced a shift towards information extraction and production during the last decade. [29] Additionally, as personal computers have become more accessible to the general public, applications for natural language processing (NLP) have grown more widespread, which has stimulated additional study in the subject. [8] III.OVERVIEW OF NLP "Natural Language Processing (NLP) is an area of computing technology, specifically within the domain of artificial cognition (AI), that is devoted to providing computers with the capacity to grasp written text and spoken language in a manner that is comparable to how human beings do so.…”
Section: Introductionmentioning
confidence: 99%
“…With the rapid development of the economy and society and the continuous progress of scientific and technological power, video images play a rather important role in information communication [1]. Relevant scientific data show that about 80% of human's access to external information comes from vision, which shows that vision is an important channel for humans to know and transform the world, so the effective transmission and expression of video image information is of great importance [2][3].…”
Section: Introductionmentioning
confidence: 99%