2022
DOI: 10.1007/s00530-022-00902-0
|View full text |Cite
|
Sign up to set email alerts
|

Image and audio caps: automated captioning of background sounds and images using deep learning

Abstract: Image recognition based on computers is something human beings have been working on for many years. It is one of the most difficult tasks in the field of computer science, and improvements to this system are made when we speak. In this paper, we propose a methodology to automatically propose an appropriate title and add a specific sound to the image. Two models have been extensively trained and combined to achieve this effect. Sounds are recommended based on the image scene and the headings are generated using… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
9

Relationship

1
8

Authors

Journals

citations
Cited by 32 publications
(7 citation statements)
references
References 59 publications
0
6
0
Order By: Relevance
“…It provides important reference and guidance for the research of object detection and recognition methods. Poongodi et al [ 12 ] proposed a method to adapt media titles and add specific sounds to images. To achieve this effect, a computer vision model of image scene recommendation and natural language processing is combined.…”
Section: Related Workmentioning
confidence: 99%
“…It provides important reference and guidance for the research of object detection and recognition methods. Poongodi et al [ 12 ] proposed a method to adapt media titles and add specific sounds to images. To achieve this effect, a computer vision model of image scene recommendation and natural language processing is combined.…”
Section: Related Workmentioning
confidence: 99%
“…( Poongodi et al., 2022a ) In this research, we suggest a way to automatically suggest a suitable title and add a particular sound to the image. To get this result, two models that have been intensively trained together.…”
Section: Related Workmentioning
confidence: 99%
“…The degree of impact is as small as possible [18]. The auxiliary function of the image recognition system adopts a modular design scheme, which improves the scalability of the system and facilitates the access of subsequent algorithms and functions [19]. By analyzing the characteristics of the sensitive image recognition algorithm and user characteristics, it summarizes the needs of the system's auxiliary functions, which include the application startup module, the image recognition API module, and the image storage module.…”
Section: Research On Image Acquisition Innovationmentioning
confidence: 99%