2020
DOI: 10.48550/arxiv.2010.01305
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Bounding Boxes Are All We Need: Street View Image Classification via Context Encoding of Detected Buildings

Kun Zhao,
Yongkun Liu,
Siyuan Hao
et al.

Abstract: Street view images classification aiming at urban land use analysis is difficult because the class labels (e.g., commercial area), are concepts with higher abstract level compared to the ones of general visual tasks (e.g., persons and cars). Therefore, classification models using only visual features often fail to achieve satisfactory performance. In this paper, a novel approach based on a "Detector-Encoder-Classifier" framework is proposed. Instead of using visual features of the whole image directly as commo… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 52 publications
0
1
0
Order By: Relevance
“…Their experiments found a strong correlation between the object counts in social media images and building functions. Zhao et al (2020) proposed a 'Detector-Encoder-Classifier' network to firstly detect the building of different categories in GSV images using state-of-the-art object detectors (Ren et al 2015), (Cai and Vasconcelos 2018). Then the detected bounding boxes metadata is sent into the Recurrent Neural Network (RNN) to conduct urban land-use classification.…”
Section: Building Classificationmentioning
confidence: 99%
“…Their experiments found a strong correlation between the object counts in social media images and building functions. Zhao et al (2020) proposed a 'Detector-Encoder-Classifier' network to firstly detect the building of different categories in GSV images using state-of-the-art object detectors (Ren et al 2015), (Cai and Vasconcelos 2018). Then the detected bounding boxes metadata is sent into the Recurrent Neural Network (RNN) to conduct urban land-use classification.…”
Section: Building Classificationmentioning
confidence: 99%