2020
DOI: 10.1007/s11263-020-01401-3
|View full text |Cite
|
Sign up to set email alerts
|

Compositional Convolutional Neural Networks: A Robust and Interpretable Model for Object Recognition Under Occlusion

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
33
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
3
2

Relationship

3
7

Authors

Journals

citations
Cited by 65 publications
(38 citation statements)
references
References 42 publications
1
33
0
Order By: Relevance
“…In a broader context, our work builds on and extends a recent line of work that follows an approximate analysis-by-synthesis approach to computer vision [49], which formulates vision as an inverse rendering process on the level of neural network features. Several recent works demonstrate that approximate analysis-by-synthesis induces a largely enhanced generalization in out-of-distribution situations such as when objects are partially occluded in image classification [21][22][23]57] and object detection [50], when images are modified through adversarial patches [20], or when objects are viewed from unseen 3D poses [49]. Our work enables the learning of models for approximate analysis-by-synthesis with minimal supervision.…”
Section: Related Workmentioning
confidence: 99%
“…In a broader context, our work builds on and extends a recent line of work that follows an approximate analysis-by-synthesis approach to computer vision [49], which formulates vision as an inverse rendering process on the level of neural network features. Several recent works demonstrate that approximate analysis-by-synthesis induces a largely enhanced generalization in out-of-distribution situations such as when objects are partially occluded in image classification [21][22][23]57] and object detection [50], when images are modified through adversarial patches [20], or when objects are viewed from unseen 3D poses [49]. Our work enables the learning of models for approximate analysis-by-synthesis with minimal supervision.…”
Section: Related Workmentioning
confidence: 99%
“…In fact, the issue of object detection under the influence of occlusion is a challenging task which negatively affects the robustness of most detection algorithms [47]. While current approaches aim to tackle this problem by applying a compositional neural network structure in combination with an occluder model [47][48][49][50], the majority of approaches focus on the problem of partial occlusion and would, therefore, be of limited suitability for this study. Moreover, during the tracking process, the negative effect of object occlusion can be reduced to some extent, by applying a predictive model like the KF algorithm, which internally interprets the CNN detections as noisy measurement information.…”
Section: Pig Detection and Trackingmentioning
confidence: 99%
“…[18] additionally predicts the segmentation masks of occluders. [21] integrates compositional models and deep convolutional neural networks into a unified model which is more robust to partial occlusions.…”
Section: Related Workmentioning
confidence: 99%