2019
DOI: 10.1007/s13735-019-00189-4
|View full text |Cite
|
Sign up to set email alerts
|

ContextNet: representation and exploration for painting classification and retrieval in context

Abstract: In automatic art analysis, models that besides the visual elements of an artwork represent the relationships between the different artistic attributes could be very informative. Those kinds of relationships, however, usually appear in a very subtle way, being extremely difficult to detect with standard convolutional neural networks. In this work, we propose to capture contextual artistic information from fine-art paintings with a specific ContextNet network. As context can be obtained from multiple sources, we… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
26
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 30 publications
(34 citation statements)
references
References 49 publications
0
26
0
Order By: Relevance
“…Instead of aiming to retrieve the images that are most similar to a query image, crossmodal retrieval aims at finding the images most closely related to a provided query text or at finding the best descriptive texts for a query image. Cross-modal image retrieval plays an important role in the context of querying art collections, e.g., [18,19], where it is a challenging task to match images and texts in cultural heritage related collections [48]. In [18], descriptors are learned by minimizing a variant of the triplet loss, where image descriptors and text descriptors are forced to be similar with respect to their dot product.…”
Section: Image Retrieval For Cultural Heritagementioning
confidence: 99%
See 2 more Smart Citations
“…Instead of aiming to retrieve the images that are most similar to a query image, crossmodal retrieval aims at finding the images most closely related to a provided query text or at finding the best descriptive texts for a query image. Cross-modal image retrieval plays an important role in the context of querying art collections, e.g., [18,19], where it is a challenging task to match images and texts in cultural heritage related collections [48]. In [18], descriptors are learned by minimizing a variant of the triplet loss, where image descriptors and text descriptors are forced to be similar with respect to their dot product.…”
Section: Image Retrieval For Cultural Heritagementioning
confidence: 99%
“…In [18], descriptors are learned by minimizing a variant of the triplet loss, where image descriptors and text descriptors are forced to be similar with respect to their dot product. The approach in [19] also addresses cross-modal retrieval using strategies that are similar to the ones used in our work. The authors obtain image descriptors for retrieval on the basis of a CNN (ContextNet) pre-trained for the multi-task classification of four semantic variables.…”
Section: Image Retrieval For Cultural Heritagementioning
confidence: 99%
See 1 more Smart Citation
“…One of the most important concepts related to DL is transfer learning [1,[18][19][20][21][22][23][24][25][26][27][28][29][30]. Popular programming and software development platforms such as Matlab or Python offer a wide range of pre-trained CNN models of different structures and complexity.…”
Section: Related Workmentioning
confidence: 99%
“…Besides finding objects and classifying paintings, art historians are interested in visual similarities and patterns between artworks to gain new knowledge about artistic relations or the meaning and development of motifs. To solve this task without manual effort, researchers proposed algorithms to find visual similarities in art collections automatically [ 17 , 45 , 46 ]. While these approaches show promising results, they are limited to small collections since they require a pairwise comparison of all images and thus, scale quadratically with the dataset size.…”
Section: Related Workmentioning
confidence: 99%