2016
DOI: 10.48550/arxiv.1608.05442
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Semantic Understanding of Scenes through the ADE20K Dataset

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
112
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 84 publications
(118 citation statements)
references
References 28 publications
0
112
0
Order By: Relevance
“…Fine-tuning: We employed three data sets: ImageNet (ILSVRC 2012) [49] for single-label classification, MS-COCO [34] for object detection, and ADE20K [60] for semantic segmentation. The number of images used was 1% of each data set (roughly, 12 thousand for ImageNet, a thousand for COCO, and 2 hundred for ADE20K).…”
Section: Pre-trainingmentioning
confidence: 99%
“…Fine-tuning: We employed three data sets: ImageNet (ILSVRC 2012) [49] for single-label classification, MS-COCO [34] for object detection, and ADE20K [60] for semantic segmentation. The number of images used was 1% of each data set (roughly, 12 thousand for ImageNet, a thousand for COCO, and 2 hundred for ADE20K).…”
Section: Pre-trainingmentioning
confidence: 99%
“…The training and testing sets consist of about 4998 and 5105 images respectively. ADE-20k: ADE-20k [16] is a challenging dataset that contains 22K densely annotated images with 150 fine-grained semantic concepts. The training and validation sets consist of 20210 and 2000 images respectively.…”
Section: A Benchmarksmentioning
confidence: 99%
“…That means each point shares the different global context since they have the appearance variation locally. Visualization of Error Map: Figure 10 gives error map on both Cityscapes [73] and ADE20k [16] validation datasets using ASSP as GA head baselines. In particular, we use ResNet101 backbone as a strong baseline and LDv2 as the LD module.…”
Section: Visualization Of Local Affinity Map On Sampled Pointsmentioning
confidence: 99%
See 1 more Smart Citation
“…One important direction of visual pattern recognition in agriculture is aerial image semantic segmentation. Different from conventional image semantic segmentation dataset where only RGB based image is available [5,20,19,13], the agricultural data collection process utilizes specific cameras to capture Red, Green and Blue channel(RGB) with an additional near-infrared(NIR) signal channel which can be used in the pattern recognition process [4]. Also, agricultural data is naturally imbalanced.…”
Section: Introductionmentioning
confidence: 99%