2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2022
DOI: 10.1109/wacv51458.2022.00154
|View full text |Cite
|
Sign up to set email alerts
|

Interpretable Semantic Photo Geolocation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(14 citation statements)
references
References 25 publications
0
12
0
Order By: Relevance
“…Next, we use an ensemble of hierarchical classification using all three resolutions. However, in agreement with [43], this method does not achieve a consistent improvement than considering only fine partitioning. Moreover, the ensemble increases inference time by almost 9%.…”
Section: A Implementation Details and Hyper-parameter Values A1 Adapt...mentioning
confidence: 81%
See 3 more Smart Citations
“…Next, we use an ensemble of hierarchical classification using all three resolutions. However, in agreement with [43], this method does not achieve a consistent improvement than considering only fine partitioning. Moreover, the ensemble increases inference time by almost 9%.…”
Section: A Implementation Details and Hyper-parameter Values A1 Adapt...mentioning
confidence: 81%
“…We train TransLocator in a unified multi-task framework for simultaneous geo-localization and scene recognition, and thus, our system can be applied to images from all environmental settings. Extensive experiments with TransLocator on four benchmark datasets -Im2GPS [13], Im2GPS3k [14], YFCC4k [50] and YFCC26k [43] shows a significant improvement of 5.5%, 14.1%, 4.9%, 9.9% continent-level accuracy over current state-of-the-art. We also obtain better qualitative results when we test TransLocator on challenging real-world images.…”
Section: Discussionmentioning
confidence: 93%
See 2 more Smart Citations
“…High performance in localisation indicates that the explanations often align with the bounding boxes or segmentation masks provided by human annotators. We consider two localisation metrics, the pointing game [57] and top-k intersection [46]. The pointing game measures whether the pixel with the highest importance is located within the object location.…”
Section: Evaluation Of Explanationsmentioning
confidence: 99%