2022
DOI: 10.3390/electronics11030440
|View full text |Cite
|
Sign up to set email alerts
|

Person Localization Model Based on a Fusion of Acoustic and Visual Inputs

Abstract: PLEA is an interactive, biomimetic robotic head with non-verbal communication capabilities. PLEA reasoning is based on a multimodal approach combining video and audio inputs to determine the current emotional state of a person. PLEA expresses emotions using facial expressions generated in real-time, which are projected onto a 3D face surface. In this paper, a more sophisticated computation mechanism is developed and evaluated. The model for audio-visual person separation can locate a talking person in a crowde… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 31 publications
(30 reference statements)
0
1
0
Order By: Relevance
“…These sensors are used as a part of sensing modalities to analyze different information spaces including vision, sound, touch, etc. Based on the number of used sensing modalities these inputs are then fused in a multimodal approach [5].…”
Section: Introductionmentioning
confidence: 99%
“…These sensors are used as a part of sensing modalities to analyze different information spaces including vision, sound, touch, etc. Based on the number of used sensing modalities these inputs are then fused in a multimodal approach [5].…”
Section: Introductionmentioning
confidence: 99%