2020 IEEE International Conference on Informatics, IoT, and Enabling Technologies (ICIoT) 2020
DOI: 10.1109/iciot48696.2020.9089557
|View full text |Cite
|
Sign up to set email alerts
|

A Scene-to-Speech Mobile based Application: Multiple Trained Models Approach

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
1
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 29 publications
0
3
0
Order By: Relevance
“…Karkar et al [ 105 ] present the concept of scene to speech (STS). STS recognizes the elements in a captured image or a video clip and speaks, loudly, informative textual content that describes the scene.…”
Section: Resultsmentioning
confidence: 99%
“…Karkar et al [ 105 ] present the concept of scene to speech (STS). STS recognizes the elements in a captured image or a video clip and speaks, loudly, informative textual content that describes the scene.…”
Section: Resultsmentioning
confidence: 99%
“…Regardless these work's importance, they have not focused on mobile devices. In this way, the work [12] created an application responsible for converting video content into audio descriptions, which was implemented on ARM-based processor hardware. The researchers utilized a series of specialized models for fine-grained object classification, each focusing on a specific category.…”
Section: Related Workmentioning
confidence: 99%
“…A mobile device embedded with a smart scanner or camera is assigned to scan or capture the tags for visual marker identification. Non-tag-based systems [7,9,16] do not utilize any visual marker or barcodes. Instead, they process the raw imageries and apply various image feature detection algorithms and machine learning algorithms to recognize the objects.…”
Section: Introductionmentioning
confidence: 99%