Yuling Xi scite author profile

It is a hard issue to describe the complex traffic scene accurately in computer vision. The traffic scene is changeable, which causes image captioning easily interfered by light changes and object occlusion. To solve this problem, we propose an image caption generation model based on attention mechanism. Combining convolutional neural network (CNN) and recurrent neural network (RNN) to generate an end-to-end description for traffic images. To generate a semantic description with distinct degree of discrimination, the attention mechanism is applied to language model. Using Flickr8K、Flickr30K and MS COCO benchmark datasets to validate the effectiveness of our method. The accuracy is promoted maximally by 8.6%, 12.4%, 19.3% and 21.5% in different evaluation metrics. Experiments show that our algorithm has good robustness in four different complex traffic scenarios, such as light change, abnormal weather environment, road marked target and various kinds of transportation tools.

show abstract

A Dynamic Feature Interaction Framework for Multi-task Visual Perception

Chen

Wang

et al. 2023

Int J Comput Vis

View full text Add to dashboard Cite

Searching sharing relationship for instance segmentation decoder

Wang

Wan³

et al. 2023

Appl Intell

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yuling Xi

Stimulus-driven and concept-driven analysis for image caption generation

A long video caption generation algorithm for big video data retrieval

Visual question answering model based on visual relationship detection

Image caption generation with high-level image features

Visual attention based on long-short term memory model for image caption generation

Image Caption Description of Traffic Scene Based on Deep Learning

A Dynamic Feature Interaction Framework for Multi-task Visual Perception

Searching sharing relationship for instance segmentation decoder

Contact Info

Product

Resources

About