2022
DOI: 10.1134/s1054661822030026
|View full text |Cite
|
Sign up to set email alerts
|

Combining Text and Image Analysis Methods for Solving Multimodal Classification Problems

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 11 publications
0
1
0
Order By: Relevance
“…In fact, the recognition memory gap between object names validated with visual vs. nonvisual attributes disappeared when the word (apple) was employed as the test item rather than the image. The [31] adapted paradigm to isolate the processes of text comprehension and image recognition. In the research phase, they discovered that people remembered more information when the photos' object orientations and shapes matched the sentence's indicated object orientations and shapes.…”
Section: Embodiment and Knowledge Representationmentioning
confidence: 99%
“…In fact, the recognition memory gap between object names validated with visual vs. nonvisual attributes disappeared when the word (apple) was employed as the test item rather than the image. The [31] adapted paradigm to isolate the processes of text comprehension and image recognition. In the research phase, they discovered that people remembered more information when the photos' object orientations and shapes matched the sentence's indicated object orientations and shapes.…”
Section: Embodiment and Knowledge Representationmentioning
confidence: 99%
“…AGI is characterized by an algorithm or set of algorithms capable of performing tasks across multiple domains as a typical human being would [1]. Creating an artificial general intelligence (AGI) requires a qualitative analysis of heterogeneous information, which is unique to humans' ability to make intelligent decisions based on vision, hearing, reading, and other senses [2]. The multimodal nature of data makes it possible to obtain highquality solutions to problems of analyzing corrupted or visually attacked images, provided that additional, nonvisual information is available.…”
Section: Artificial General Intelligence (Agi)mentioning
confidence: 99%