2023
DOI: 10.1145/3583682
|View full text |Cite
|
Sign up to set email alerts
|

Content-based and Knowledge-enriched Representations for Classification Across Modalities: A Survey

Abstract: This survey documents representation approaches for classification across different modalities, from purely content-based methods to techniques utilizing external sources of structured knowledge. We present studies related to three paradigms used for representation, namely a) low-level template-matching methods, b) aggregation-based approaches and c) deep representation learning systems. We then describe existing resources of structure knowledge and elaborate on the need for enriching representations with such… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 186 publications
0
2
0
Order By: Relevance
“…After image pre-processing feature extraction achieved using GLCM global descriptor, GLCM retrieved additional information from its layers to find the precise photos from the repository, their characteristics in terms of parameters such as distance, angle, resolution, level, and symmetric values. These values are retrieved from the pictures using the GLCM technique by image attributes such as Dissimilarity, Correlation, Homogeneity, Contrast, ASM, and Energy (1)(2)(3) . The photos are examined at several angles (0, 45, 90, and 135) to extract the characteristics using the layers.…”
Section: Additional Feature Extractionmentioning
confidence: 99%
See 1 more Smart Citation
“…After image pre-processing feature extraction achieved using GLCM global descriptor, GLCM retrieved additional information from its layers to find the precise photos from the repository, their characteristics in terms of parameters such as distance, angle, resolution, level, and symmetric values. These values are retrieved from the pictures using the GLCM technique by image attributes such as Dissimilarity, Correlation, Homogeneity, Contrast, ASM, and Energy (1)(2)(3) . The photos are examined at several angles (0, 45, 90, and 135) to extract the characteristics using the layers.…”
Section: Additional Feature Extractionmentioning
confidence: 99%
“…First, they are heavily reliant on labeled data, which can be challenging and costly to acquire. Second, they lack transparency and explainability, which limits the dependability of deep CBIR systems (3,4) . The quality of picture searches by using a strategy that uses multiple deep neural networks and K-Nearest Neighbor algorithms in content-based image retrieval (5) .…”
Section: Introductionmentioning
confidence: 99%