2020
DOI: 10.1109/access.2020.2972055
|View full text |Cite
|
Sign up to set email alerts
|

Joint Probabilistic People Detection in Overlapping Depth Images

Abstract: Privacy-preserving high-quality people detection is a vital computer vision task for various indoor scenarios, e.g. people counting, customer behavior analysis, ambient assisted living or smart homes. In this work a novel approach for people detection in multiple overlapping depth images is proposed. We present a probabilistic framework utilizing a generative scene model to jointly exploit the multi-view image evidence, allowing us to detect people from arbitrary viewpoints. Our approach makes use of meanfield… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
15
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 16 publications
(15 citation statements)
references
References 36 publications
(49 reference statements)
0
15
0
Order By: Relevance
“…PrimaryCaps are reshaped and vectorized. Thus, using the ReLU and tanh activation functions, it is simulated to a fully connected neural network and calculated as in equations ( 11) and (12). Here 1 and 2 are the results of two fully connected layers, 3 and 4 are the weight matrices of these layers, and 3 and 4 offset values.…”
Section: ) Capsule Attention Modulementioning
confidence: 99%
“…PrimaryCaps are reshaped and vectorized. Thus, using the ReLU and tanh activation functions, it is simulated to a fully connected neural network and calculated as in equations ( 11) and (12). Here 1 and 2 are the results of two fully connected layers, 3 and 4 are the weight matrices of these layers, and 3 and 4 offset values.…”
Section: ) Capsule Attention Modulementioning
confidence: 99%
“…In previous work [13] we re-cast the problem of people detection and tracking with multiple depth sensors as an inverse problem, employing an approximately differentiable scene model to detect people from arbitrary viewpoints. Following these ideas we introduced a probabilistic framework [1] based on a discrete scene configuration space. For stochastic inference a variational mean-field approximiation is used to jointly exploit the multi-view information in order to estimate the marginal probability distribution of people present in the scene.…”
Section: Sequence Of Temporal Framesmentioning
confidence: 99%
“…This has three major implications: (i) the sensors capture the scene from the top-view to reduce occlusions in crowded scenes; (ii) since the resulting field of view of a single sensor is quite limited, sensor networks need to be employed to cover a larger area; (iii) position changes of people lead to drastically varying appearances as a result of the vertical top-view. In previous work [1] we propose a probabilistic framework which uses a generative scene model to leverage the full image evidence from all sensor views. For the final approximation of the probability distribution of people present in the scene a mean-field variational inference optimization is employed.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…As the second step, the head-shoulder descriptor (HSD) was constructed to jointly encode depth difference information and local geometric information. FUJIMOTO et al [ 18 ] used the roundness and size of a height-continuous region to describe the human upper-back shape. They utilized implicitly included information in the missing region to alleviate the influences of partial loss of depth information caused by occlusions.…”
Section: Introductionmentioning
confidence: 99%